Madhura
Gaikwad
Summary
Results-driven Data Engineer
with 10+ years of experience
across Insurance,
Automotive, Banking, and
Cybersecurity domains.
Expertise in building
scalable, automated data
pipelines in cloud and big
data ecosystems (AWS,
Hadoop, Databricks). Skilled
in processing large-scale
structured & semi-structured
data using Spark (Scala &
Python), SQL, and Kafka.
Strong experience with
CI/CD (Jenkins, GitHub),
orchestration (Airflow), and
containerization (Docker).
Proven ability to work in
Agile teams and translate
business requirements into
actionable data solutions
Skills
• Languages & Tools:
Python, Scala, SQL, Spark, Git,
Jenkins, Docker, Shell Script
• Cloud & Platforms:
AWS (EMR, S3, RDS, ECS)
Databricks, Cloudera,
Hortonworks
• Big Data:
Hadoop, Hive, Kafka, Airflow,
Postgres, Parquet, JSON, CSV
• Modeling & Reporting:
Tableau, QlikView,
Data lake/data warehouse
Big Data Engineer
+1-248-***-**** ***.*******@*****.*** Windsor, Ontario, Canada EXPERIENCE
Big Data Engineer (Sept 2022 – April 2024)
SecurityScorecard Inc.,
Toronto, ON
● Designed and implemented ingestion pipelines for associating IPs/domains with companies across structured and semi- structured data sources
● Boosted data processing efficiency by 25% using Spark
(Scala/Python) and AWS (EMR, S3, RDS, ECS)
● Built and optimized PostgreSQL tables; applied efficient partitioning and indexing techniques
● Developed and monitored CI/CD pipelines using Jenkins and GitHub; automated data workflows using Airflow, Docker, cron, shell script
Big Data Engineer (Nov 2019 – June 2020)
Pattonlabs- (Contract at Blue Cross Blue Shield),
Detroit, MI
● Built robust Scala-Spark ETL pipelines to ingest, cleanse and process healthcare claims and member data into Hadoop data lake
● Improved processing time and job reliability while supporting agile delivery using Jira and Git
Data Analyst- Data Engineer (Sept 2017 – Nov
2019)
TEKsystems-(Contract at Ford Motor Company),
Dearborn, MI
● Identified and enhanced new and existing data sources (third party data, warranty, safety and Manufacturing) to optimize business opportunities
● Migrated safety data (NHTSA) from SAS to Hive in Hadoop environment; implemented partitions and buckets to enhance query performance downstream safety analytics
● Integrated Kafka-streamed data and transformed it using Spark
● Ensured compliance with GDPR by designing secure handling processes for PII data
Senior Software Engineer (Aug 2015 – Sept 2016)
Danske Bank-Denmark,
Bangalore, India
● Engineered Spark jobs with Scala to handle large-scale
• Methodologies:
Agile/Scrum,
CI/CD, Version Control,
Data Governance (GDPR)
KEY
ACHIEVEMENTS
Resolved a critical bug
retaining a $1M customer at
SecurityScorecard
CERTIFICATIONS
• Coursera- Python and
Pandas for Data Engineering
-Duke University
• Coursera - Virtualization,
Docker, and Kubernetes
for Data Engineering -
Duke University
AWARDS
ITC Infotech - 'Spot Award'
for Outstanding
performance
Wipro - 'Feather in My Cap'
for Client Recognition
Ford Motor Company –
'Recognition of Teamwork'
aggregations and OLTP integration via Sqoop
● Managed Agile sprints and served as Scrum Master for an 11- member team
● Maintained Hive/DB2 environments and designed end-to-end ETL architecture documentation
Associate IT Consultant (July 2013 – July 2015)
ITC Infotech,
Bangalore, India
• Performed analysis, design, development, testing, review and implementation of Hadoop, Hive and Mainframe Projects
• Oversaw rollout of project at client location in Copenhagen, Denmark
Senior Software Engineer (Nov 2008 – July 2013)
Wipro Technologies,
Pune, India
• Executed various Insurance and Banking projects for clients like Friends Provident, LIC, Citibank, ABSA
• Led design, development, unit testing, review and implementation of Mainframe Projects
Education
Master of Science in Software Engineering
Birla Institute of Technology and Science-Pilani, Rajasthan India Bachelor of Computer Science
The University of Pune - Maharashtra, India