Chetan Kardekar
Data Engineer
Spark and Hadoop Developer having 5+ years of IT experience in Apache spark, AWS, Databricks and Hadoop ecosystems . Understands the complex processing needs of big data and have experience developing codes and modules in scala, python and java to address those needs by performing Data Analytics.
**************@*****.***
201, SwamiKrupa Residency,
Baner, PUNE, India
linkedin.com/in/chetan-kardekar-
a2820079
SKILLS
Apache Spark AWS
Scala Python
Hive Kafka SQL
Hbase Java EMR
Databricks
Elastic Search
Amazon S3 Redshift
AirFlow (Basics)
SparkStreaming
LANGUAGES
English
Full Professional Proficiency
Hindi
Full Professional Proficiency
Marathi
Full Professional Proficiency
HOBBIES
Travelling and being
gourmet
Sketching and Painting
WORK EXPERIENCE
09/2017 – Present
Big Data Engineer
QuEST Global
Client - HP
Technology Stack - AWS Services - Amazon S3, Kinesis, Redshift, EMR, Spark 2.4, Scala 2.11, Hive, Databricks, Java, Python, SQL
Experienced in gathering analytics requirements from PO's and delivering in CI/CD approach. Developed a module to clean the data coming from the source data streams of AWS kinesis. Developed a parser to convert the Json streams into parquet which is to be used by all the downstream spark jobs in project Pipeline.
Development of spark jobs using scala to process terabytes of data and optimize them using custom catalyst optimizer and other spark optimization techniques. Involved in designing of database and efficient Redshift tables. Writing scripts to deploy the spark jobs in EMR.
Migration of running the Spark Jobs from Amazon EMR to Databricks. 06/2015 – 05/2017
Big Data Engineer
Saama Technologies
Clients- CSAA Insurance Group, Swiss Mobiliar
Technology Stack - Spark 1.5.x, Elastic Search 1.x, HDP 2.3(Hive, Oozie), Scala, Python, Hbase, Kafka, Spark Streaming
Creation of Hive tables for claim and policy databases using partitioning and bucketing functionality of hive for table creation.
Development of spark jobs for creating various data stores for claims as well as policy and calculative fields to identify fraudulent claims.
Development of Email Notification feature in Scala to notify the fraud claim details to claimants through mail.
Development of cron jobs as well as Oozie Jobs.
Developing Data Scrapper for Facebook, Indeed, Twitter, Kompass.ch,etc (Swiss websites) Development of Score Model is Spark for the data scrapped from above mentioned websites. Development of Kafka Producer in Java for reading the customer care call recordings which are already converted from voice to text and Kafka consumer in scala through Spark Streaming. Developing the code in scala for performing churn analytics on the call text data and writing it to hbase.
EDUCATION
04/2012 – 04/2015
Master's Degree in Computer Applications Under Engineering G.E.S.College of Engineering affiliated to University of Pune Grade - Distinction
04/2009 – 04/2012
Bachelor's Degree in Computer Science
C.M.C.S College affiliated to University of Pune
Grade - First Class
Achievements/Tasks
Achievements/Tasks