S E N I O R D A T A E N G I N E E R
S U B I K S H A R A N G A N A T H A N
E D U C A T I O N
SASTRA University
S K I L L S
AWS
Linux
Python
HDFS
Kafka
PySpark
Streaming
Batch Processing
MySQL
Spark-SQL
Azure
Data Frames
Data Factory
Hive
Databricks
Apache Spark
AWS Glue
Azure Synapse
AWS Redshift
W O R K E X P E R I E N C E
OAK STREET HEALTH
August 2024 - Present
SENIOR DATA ENGINEER
TISTA SCIENCE AND TECHNOLOGY CORPORATION
June 2023 - AUGUST 2024
SENIOR DATA ENGINEER
Facilitated in POCs for Data migration to Azure
delivering 100% efficiency.
Achieved 100% consistency in a team of 4 members,
towards driving POCs on Data workflow in Azure Data Factory.
Strengthened POCs to set up Devops environment using Jira, Azure code deploy on Databricks and ADLS Gen2 environments.
Achieved proficiency in Azure Data Factory, Databricks, ADLS Gen2, Delta lake and other Azure services.
P R O F I L E
Skilled Big Data Engineer with 9 years of outstanding performance record in designing and implementing
efficient ETL processes. Proficient in Azure Cloud and PySpark. Proficient in Python, SQL, Pyspark and cloud- based data platforms. Master in collaborating
with cross-functional teams to deliver high-quality solutions saving project costs by around 20%.
3070 Ballester Road, Indian
land, SC 29707
*************@*****.***
B.Tech - Electronics and
Communication Engineering
C O N T A C T
Led the design and implementation of an Data pipeline, reducing data processing time by 25%.
Reduced redundant activities of 3 departments by
Building Data Warehouse, Data modelling and Data
Ingestion activities.
Implemented data quality checks, resulting in a 15% improvement in data accuracy.
Conducted performance tuning on SQL queries,
optimizing data retrieval by 20%.
SENIOR DATA ENGINEER
PNC BANK
DECEMBER 2018 - JUNE 2022
Orchestrated retail lending project, employing PySpark and AWS Cloud, optimizing lending processes for new banks and achieving 20% reduction in processing time. Applied data cleaning techniques, resolving 95% of duplicates, null values, and datatype issues, ensuring data integrity for downstream teams.
Implemented Slowly Changing Dimensions (SCD) strategies, handling updates seamlessly and enhancing data accuracy by 25%.
DATA ENGINEER
SAFECO INSURANCE
OCTOBER 2016 - NOVEMBER 2018
Mastered with 100% consistency, in Hive queries by creating and querying HIVE tables to retrieve useful analytical information.
Initiated use of Sqoop to import large data on 10 machines from traditional RDMS to HDFS. Pioneered in Data Crunching, Data Ingestion, Data Transformation Activities. TEST ENGINEER
WIPRO TECHNOLOGIES
JUNE 2013 - JUNE 2018
Developed API automation script using Rest Assured and Postman Reviewed functional/design specifications, and other relevant documents to extract test requirements. Identified test cases to automate them using Selenium Web Driver, Maven, TestNG and Java (Eclipse IDE) and contributed in creation of framework.
Extensive use of Selenium Web Driver in Java for automating UI for executing test cases. Involved in database testing and written and executed complex SQL queries