Post Job Free

Resume

Sign in

Data Engineer Analysis

Location:
Hayward, CA
Salary:
80000
Posted:
December 04, 2023

Contact this candidate

Resume:

EDUCATION

Masters in computer science, CALIFORNIA STATE UNIVERSITY, EAST BAY, GPA - 3.9 /4.0 MAY 2023 Relevant Courses: Statistics and probability, ML, Web systems development, Advanced algorithms, Programming for software engineering, Database management/development.

VENKATA RAMANA A

Hayward, California, US ad1oxl@r.postjobfree.com 510-***-**** linkedin My_Github Tableau SUMMARY

Results-driven Data Engineer with 3 years of experience of designing, developing, and implementing ETL solutions for development and complex data integration projects. Skilled in data base development, data modeling, data analysis, and design for data warehousing. Proficient in Cloud technologies, and documentation of the entire data life cycle. Experience in working with DBAs, data analysts, developers, and production support team to identify and fix data issues that across platforms. Experienced in working in a dynamic and high- pressure environment. Ability to manage multiple tasks simultaneously while ensuring timely completion. A team player with excellent communication and problem-solving skills.

TECHNICAL SKILLS

Tools: Azure, Hadoop, Apache Spark, AWS, Apache airflow, Kafka, JIRA, GIT, Excel, Tableau, PowerBI, Docker, Kubernetes, ETL, Snowflake, AWS Glue, Azure Synapse, Azure Data Factory, and Data bricks.

Data Analysis Activity: Data Modelling, DB Performance Tuning, Data Profiling, Data Mining/analysis, Data wrangling.

Scripting Languages: Unix Shell Scripting, Bash Scripting, HTML, CSS. Databases: DB2, MS SQL, Oracle, NoSQL, Cassandra.

Programming: Proficient in Python, SQL, Advanced SQL, PL/SQL, C++, Java, JavaScript. EXPERIENCE

Data Engineer, GROWME.AI, CA Mar’2023 - Present

Collaborated with teams to identify and collect data from diverse datasets related to college applications information and store in a centralized data warehouse at ADLS GEN2. Developing and maintaining ETL pipelines using Python (Pandas, NumPy), SQL, and Pyspark in Azure data factory and used Apache airflow to schedule and trigger ADF pipelines.

Performing Exploratory Data Analysis and generated insights to help high school students make informed college decisions.

Developed dashboards and reports using Tableau to present key findings, target audience, trends...etc. using KPI’s to stakeholders. Data Engineer, Banner Health Oct’2022 – Mar’2023

Tasked with analyzing and optimizing the sales data for the company’s pharma products . Collecting, cleaning using python and SQL, and integrating multi source Banner health sales data into Azure Synapse using optimized ETL to reduce ingestion time by 25%.

Utilized window functions to increase the underperforming category sales by 10% and specific underperformed regional sales by 18%.

Scheduled pipelines at regular intervals using Apache airflow. Building reports on result data using PowerBI embedding T-SQL queries.

. Continuous learning in data manipulation, querying, visualization, and reporting. Data Engineer, Cal Fresh Hope Nov’2021 – Oct’2022

Tasked with on CSU student’s food stamps project. Collaborated and collected students’ financial data from respective CSU campuses and performed ETL. Designed and developed data pipelines to Extract, Transform, and load data from various sources into a centralized data repository in Azure ADLS GEN 2 for analytics. Conducted data cleaning and enriching using SQL, pandas, and other python libraries to ensure the completeness of data. Implemented automation whenever needed.

Performed Descriptive and Exploratory Data Analysis to identify students who meet the criteria for being underpaid.

Also created insightful dashboards with the enriched data using tableau to let stakeholders understand the behavior of data easily. With the usage of Azure Data Factory pipelines the orchestration time reduced by 16% and data warehousing costs reduced by 22%. Junior Data Engineer, OSI Technologies Oct’2020 – June’2021

As a Junior Data engineer, proactively participated in the monitoring and optimization of ETL performance, undertaking tasks such as database tuning, query optimization, and performance enhancements for ETL workflows. Contributed significantly to data definitions, data storage infrastructure enhancement, and data mining architectures, promoting a data-centric approach to decision-making.

Worked closely with cross-functional teams, including software developers, testers, and product managers, to understand project requirements and deliver software solutions that met or exceeded expectations. Machine Learning Intern, Internshala Apr’2020 – Aug’2020

Prepared training and testing data which is scraped using beautiful soup for Machine Learning team and then applied feature scaling and ML algorithms to it to build an ML model on the given housing data. This improved the accuracy by 18%. ACADEMIC PROJECTS

1. Covid 19 Reporting and Analysis - Technology and skills: Azure, ADF, ADLS Gen2, PowerBI, Azure synapse, Databricks - Covid19. 2. YouTube Data Analysis end-end project - Technology and skills: AWS, AWS Glue, AWS Athena, AWS Lambda - Youtube_Data_Analysis 3. F1 racing data analysis - Technology and Skills: Azure, Databricks, Spark core, SparkSQL, ETL. - F1_racing 4. NYC Taxi data - Technology and skills: Azure, Synapse, Serverless SQL, PowerBI - NYC_Taxi_Data



Contact this candidate