SANDEEP AHUJA
LEAD DATA ENGINEER
TECH STACK
Python Programming
DBMS: Teradata, Oracle,
Redshift, PostgreSQL
AWS: Glue, Lambda, SNS,
S3, Aurora, Redshift
Terraform
Snowflake: Snowpipe,
Virtual Warehouses
Hadoop: Big Data, HDFS,
Spark, Hive, Hue, Sqoop.
CICD: Jenkins, UDeploy,
GIT
ETL: Informatica, Glue.
Agile: Rally, Jira.
Other: Unix scripting,
SQL, HiveQL.
INDUSTRY CERTIFICATIONS
CONTINOUS LEARNING
SnowPro Core Certification – Jun 2021
AWS Certified Developer Associate – Mar
2021
AWS Certified Architect Associate – Apr 2021
AWS Certified Cloud Practitioner – Mar 2021
Hortonworks Certified Associate(HCA) - Sep
2018
Hortonworks Certified Spark Developer
(HDPCD)
CCA Spark and Hadoop Developer (CCA175)
Databricks Certified Developer Apache
Spark2.x
MapRCertified Data Analyst– MCP
MapR Certified HBase Developer (MCHBD
1.0)
Other courses to gain conceptual knowledge
on various AI/ML algorithms and techniques.
EXPERIENCE
Built ETL jobs on AWS Cloud: Lambda or Glue
(PySpark/Python).
Contributed a reusable Python script to call REST
API and stores JSON response to RDBMS table.
Loading nested ND-JSON to table via PySpark.
Delivered an AWS lambda utility (Python OOPs)
to orchestrate execution of the data pipelines used by over 15 teams enterprise wide.
Built a Jenkins solution to execute Glue jobs.
Event-based solution to load S3 file data into
RDBMS table.
Ingestion of external vendor data into Cigna cloud storage - Redshift and Teradata Vantage.
Aug 2020 - Present Evernorth (Cigna)
SOFTWARE ENGINEERING SENIOR ADVISOR
EDUCATION
Bachelor of Engineering July 2006 - June 2010
Computer Science
Rajiv Gandhi Proudyogiki Vishwavidyalaya, Bhopal (India) PERSONAL PROFILE
A Data Engineer and a delivery lead handling multiple Big Data/Cloud projects, leading a data team at Evernorth (Cigna) which is responsible for data gathering from various sources, transforming and optimally storing in AWS Cloud. Securely sharing data with various external vendors as per partner agreements. My focus on data integrity and cross-functional integration across various teams/domains has consistently boosted the overall efficiency and optimized data pipelines. Delivered $1.2M worth of project this year within the business critical agreed-upon timelines. Hired more than 19 engineers both onshore and offshore. Mentored my team of engineers from the very first day of onboarding. Coaching them and keeping them engaged and motivated by sharing fun, challenging problems regularly and driving team building sessions.
What I will contribute: Leverage my 12+ years of industry knowledge to build high quality data pipelines, collaborate with clients and business users to deliver improved satisfaction, establish a positive and motivated work environment in the team to work as a single unit to deliver, learn and grow together.
CONTACT
Phone: +1-412-***-****
Email: adyl9v@r.postjobfree.com
LinkedIn: www.linkedin.com/in/sandeep-ahuja/
Did a POC project on external D&B API during
RFP phase and presented a visual flow of business
use-case, which helped in finalizing the terms for new contract and saved $800K.
Involved in data modeling and converting business
requirements into technical design, building,
testing and deploying ETL code to load data.
Organized and participated in brain storming
sessions with BA, SA and subject matter expert to
find best/optimal solutions.
July 2015 - Aug 2020 Accenture USA (Onshore)
DATA ENGINEER, ETL DEVELOPER
Working with data modelers when creating new
tables and choosing best/optimal columns for
indexing.
Design ETL solutions for data warehouse:
dimensions, fact tables.
Developed a reusable Unix shell script to load ESRI
(geometry) data into PostgreSql.
Saved over 1 TB of space each year by created an
archival solution.
Automation of daily sales file processing, which
mitigated manual errors thus saving $50K/month.
Sep 2010 - June 2015 Accenture India (Offshore)
ETL DEVELOPER, SUPPORT ANALYST