Post Job Free

Resume

Sign in

Data Engineer Big

Location:
Annandale, VA
Posted:
April 17, 2024

Contact this candidate

Resume:

Sai Rohit U

ad42vg@r.postjobfree.com +1-716-***-**** linkedin.com/in/sairohituddagiri

SUMMARY

Data Engineer with 4+ years of experience in designing and implementing data solutions using cloud-based platforms. Skilled in ETL development, dimensional modeling, and data analysis, demonstrated through successful projects in healthcare and financial sectors. Proficient in Python, SQL, and various big data technologies, with a Master's degree in Data Sciences and hands-on certifications in AWS and Snowflake.

SKILLS

Programming Languages : Python, R, SQL, JavaScript, HTML, CSS, React, REST API Big Data Technologies : Snowflake, Google BigQuery, Azure, AWS, Databricks, Informatica, Apache Spark, Hadoop. Databases : MySQL, PostgreSQL, SSIS, Oracle, Teradata, MongoDB, PL/SQL, T-SQL Data and ML Skills : Tableau, Power BI, TensorFlow, Scikit, NumPy, Pandas, HuggingFace. DevOps : Git, Docker, Kubernetes, Linux, Shell scripting, Version Control, GitLab, JIRA, Jenkins, Agile. EXPERIENCE

PCE Lab, Buffalo, NY Mar 2023 – Dec 2023

MACHINE LEARNING RESEARCH ASSISTANT

• Developed data model using business rules and ERD for an innovative data management system, resulting in a 25% decrease in data retrieval time.

• Utilized SQL, Python, and statistical analysis tools to analyze patient survey records datasets, identifying key trends and patterns, informed decision-making processes and resulted in a 15% increase in operational efficiency.

• Engineered a patient data analysis tool on Google BigQuery, resulting in a 40% reduction in data processing time for healthcare professionals managing diabetes patients. Deloitte, Hyderabad, India Feb 2021 – July 2022

SENIOR DATA ENGINEER

• Collaborated with data architects in optimizing data warehousing solution using Snowflake(MPP platform) streamlined data flows, saving 30+ TB monthly storage cost and supporting real-time analytics of clinical trial data.

• Effectively communicated with business stakeholder in understanding the data requirements and collaborated with 4 data engineers in ETL implementation, resulting in 100% client satisfaction rate.

• Enhanced ETL data pipeline processes by automating data extraction from SQL databases and flat files into Snowflake using IICS integration tool, reducing manual effort by 75% and ensuring timely data availability for analysis.

• Developed a data transformation framework addressing problems using Apache Spark on EMR environment, resulting in availability of previously inaccessible datasets to 150+ end clients.

• Built Continuous Integration and Delivery pipelines leveraging Docker and Kubernetes, decreasing deployment time by 40% and ensuring consistent and reliable releases. TATA Consultancy Services, Bangalore, India Jan 2019 – Jan 2021 DATA ENGINEER

• Migrated a Financial Data Warehouse of 100PB+ data from Teradata (on-premises) to Snowflake Data warehouse

(cloud), resulting in a more efficient and cost-effective data storage solution for the company.

• Developed and automated data processing pipeline by using AWS Glue service that processed over 10TB of data per month, improving handling of over 3 billion financial transactions monthly, increasing processing speed by 25%.

• Implemented pipeline to automate data loading from AWS S3 to Snowflake utilizing AWS SQS, Snowpipe, and Snowflake Tasks technologies, increased data accuracy and reduced manual efforts by 60%.

• Developed and implemented real-time streaming data processing solutions using Kafka, resulting in a 30% increase in overall data processing efficiency.

CERTIFICARIONS

• Snowflake Hands on Essentials – Data Engineer.

• AWS Certified Cloud Practitioner, AWS – Storage Data Migration. EDUCATION

University at Buffalo, The State University of New York Buffalo, New York Master of Science in Data Sciences and Applications [GPA: 4.0/4.0] Dec 2023



Contact this candidate