data engineer, SQL, Python, ETL Operations

Location:

Kansas City, KS, 66101

Posted:

January 16, 2024

Contact this candidate

Resume:

BHAGYA SREE PALAPARTHI

913-***-**** *************@*****.*** Overland Park, Kansas

Summary:

Enthusiastic and detail-oriented Data Analytics Intern with a strong background in data analysis and a passion for leveraging insights to inform business decisions. Seeking an internship opportunity with Cox to contribute to the Intelligence team's projects and gain hands-on experience in a dynamic and collaborative environment. Education:

Master of Science in Computer Science: GPA 3.6/4.0 University of Central Missouri – Missouri Jan 2023 to May 2024 Bachelor of Science and Technology: GPA 3.7/4.0

University College of Engineering JNTU Narasaraopet Aug 2017 to June 2021 Technical Skills:

Programming Languages: Python, Spark, Hive

Cloud Platforms: Azure Cloud

Database and Data Warehouse: SQL, NoSQL

Data Processing: ETL processes, Data Transformation Tools: Azure Data Factory, Azure Databricks

Version Control: Git

Data Visualization: Power BI

Work Experience: Sept 2021 - Dec 2022

Data Engineer – Infosys (Hyderabad, India):

• Spearheaded the design and implementation of scalable data solutions in Azure Cloud, ensuring efficient processing and management of large datasets.

• Executed end-to-end data pipeline development, from data ingestion to storage and retrieval, optimizing workflows for enhanced performance.

• Collaborated with cross-functional teams to identify business requirements and implemented data solutions to address specific needs.

• Conducted performance tuning and troubleshooting for data pipelines, ensuring seamless operation, and minimizing downtime.

• Have good experience working with Azure BLOB and Data lake storage and loading data into Azure SQL Synapse analytics

(DW).

• Proficient writing complex spark (PySpark) User defined functions (UDFs), Spark SQL and HiveQL.

• Experience working on Azure Services like Data Lake, Data Lake Analytics, SQL Database, Synapse, Data Bricks, Data factory

• Worked on data warehousing and ETL tools like Informatica and PowerBI.

• Acquaintance with Agile and Waterfall methodologies. Responsible for handling several clients facing meetings with great communication skills.

• Working knowledge on Azure cloud components (Databricks, Data Lake, Blob Storage, Data Factory)

• Hands - on experience in Azure Cloud Services, Azure Synapse Analytics, SQL Azure, Data Factory, and Azure Data Lake.

• Created Batch & Streaming Pipelines in Azure Data Factory (ADF) using Linked Services/Datasets/Pipeline/ to Extract, Transform and load data.

• Created Azure Data Factory (ADF) Batch pipelines to Ingest data from relational sources into Azure Data Lake Storage (ADLS gen2) & incremental fashion and then load into Delta tables after cleansing.

• Involved in creating Hive tables and loading and analyzing data using hive queries

• Written Hive queries on the analyzed data for aggregation and reporting.

• Transformed and Copied data from the JSON files stored in a Data Lake Storage into an Azure Synapse Analytics table by using Azure Databricks.

• Used several RDD transformation to filter the data injected into SparkSQL

• Maintained and enhanced data warehouses, ensuring data integrity, security, and availability. Projects:

Songs Listening Analysis (AWS)

Collected Songs database from Kaggle, built ETL pipeline that extracts data from S3, stages them in Redshift, and transform data into set of dimensional tables for analysis to continue finding insights into what songs users are listening using Power BI.

Data Lake with Spark & AWS S3

Built an ETL Pipeline using songs & log files in s3 and performed transformations like partition by, lambda, and filters using PySpark and covert data into fact and dimensional tables. Certifications:

Microsoft Azure data fundamentals AZ-900, DP -900, Hankerrank SQL Basic and Intermediate Certifications, IBM SQL Certification.

Contact this candidate