ASHMITHA PARUCHURI BALAJI
Phone: 510-***-**** Email: ad1i7l@r.postjobfree.com LinkedIn Location: Milpitas, California Summary
Experienced Azure Data Engineer, well versed in SQL, Python, and Cloud technologies. Results-driven, and an ambitious engineer with a constant urge to learn new technology stacks to accomplish the tasks at hand and has a lookout for optimizing projects.
Skills
Languages Python, SQL
Databases
Libraries & Tools
MySQL, Postgres
Airflow, PySpark, Microsoft Power BI, Pandas,
Matplotlib, Microsoft Excel
Cloud Technologies Azure (Data Factory, Databricks, Data Lake, Synapse, Blob Storage), AWS (S3, EC2, Redshift)
Other
Docker, Git, CSV, JSON, Linux, Windows.
Employment History
Azure Data Engineer, IT OPENDOORS Feb 2022 – April 2023 Technologies – Python, SQL, Azure Data Factory, Data Lake Gen2, Databricks, Synapse
• Involved in On-Prem to Cloud data migration.
• Interacting with client and getting the requirements.
• Performed ETL on large Datasets.
• Developed pipelines, Linked services, Datasets in Azure Data Factory.
• Scheduling the pipelines based on tumbling window for automation job in ADF pipeline.
• Store and Process data using Azure Data Lake Gen2 and Azure Databricks.
• Attended daily stand-up calls, sprint planning meeting. Academic Projects (Selected)
Twitter Data Pipeline
• Developed a Python script to collect tweets for a given user using Twitter APIs.
• Developed Python script to transform & extract required data from the tweets and store it in CSV files.
• Used AWS EC2 to deploy Apache Airflow and automate the above process and store the CSV files in AWS S3.
Formula One Data Pipeline
• Configured Azure Databricks clusters for efficient data processing.
• Applied PySpark and Spark SQL for intricate data transformations and implemented Delta Lake for secure and versioned data management.
• Orchestrated end-to-end data pipelines in Azure Data Factory. Education
Bachelor of Technology, Anna University, India June 2013 – Apr 2017 Major – Information Technology