Post Job Free
Sign in

Data Engineer Machine Learning

Location:
Los Angeles, CA, 90012
Salary:
80000
Posted:
July 08, 2024

Contact this candidate

Resume:

MANJUNATH CHIMBILI

Data Engineer

***************@*****.*** +1-929-***-**** Jersey City, NJ LinkedIn GitHub PROFESSIONAL SUMMARY:

Results-oriented Data Engineer with 4 years of experience specializing in processing and analyzing numeric data.

Proven expertise in designing efficient data pipelines, ensuring data integrity, and optimizing data storage solutions.

Strong background in SQL, Python, and data warehousing technologies.

Collaborating with cross-functional teams to deliver actionable insights and drive data-driven decision-making. PROFESSIONAL EXPERIENCE:

UPS, NJ Mar 2023 – Present

Data Engineer

Executed scalable ETL procedures using python to reduce 30% of the time needed for data processing by ingesting, cleaning, and transforming big datasets from several sources.

Transform Apache Spark and Kafka to create data pipelines that allow for real-time analytics and processing.

Improved query performance by 25% and shortened data retrieval times through the optimization of SQL queries and database schemas. Executed advanced data analytics using PySpark, boosting customer targeting strategy by 40%.

Engineered and launched an Apache Kafka based real-time data processing pipeline, which processed 7 TB per day and minimized latency by 65%.

Collaborated with a cross-functional team of data scientists and machine learning engineers to build predictive models, increasing sales by 15%.

Implemented Docker-based deployment workflows, streamlining continuous integration/continuous deployment (CI/CD) processes by 30%.

Architected and maintained PostgreSQL databases, resulting in 50% improvement in the query response time.

Designed and implemented machine learning models in conjunction with data scientists for predictive analytics using numerical datasets. Rigorous testing and validation processes to guarantee the integrity and quality of the data.

Created a real-time data processing system that can handle streaming data from Internet of Things devices and provide real-time analytics and monitoring using Spark Streaming and Apache Kafka. Trigent, India Jul 2018 – May 2021

Data Engineer

Allowed for the effective storing and retrieval of numerical data by aiding in the creation of data marts and warehouses.

Generated and analysed data using sophisticated SQL queries, giving business stakeholders useful insights.

Python scripts were used to automate the procedures of data extraction and transformation, increasing workflow efficiency by 20%.

Conducted ETL operations using Apache Spark and Hadoop, accelerating data preparation time by 60%.

Enhanced data warehousing strategy with data modelling techniques, leading to 20% improvement in data interpretation.

Optimized SQL queries with efficient schema design, decreasing data redundancy by 25%.

Developed and managed AWS Redshift clusters to handle massive sets of raw data, leading to enhanced system performance. Oversaw an optimization effort on Amazon Redshift for an already-existing data warehouse.

Counselled move the on-premises data infrastructure to the AWS cloud, which upgraded dependability and scalability.

Greatly shortened the time it takes to process data by doing performance tuning and optimization of ETL procedures.

Project comprised schema redesign and refined ETL procedures, which decreased query response times by 40%. TECHNICAL SKILLS:

Programming Languages: Python, SQL, Java, R, Hadoop, Scala, Shell Scripts, SAS.

Data Engineering Tools: Apache Spark, Apache Kafka, Airflow.

Databases: MySQL, PostgreSQL, MongoDB, Oracle, NoSQL.

Data Warehousing: Amazon Redshift, Google Big Query, Snowflake.

Big Data Technologies: HBase, Apache Spark, Hadoop, Hive, Kafka, Pig, MapReduce.

ETL Tools: Talend, Informatica, AWS Glue, Extract, Transform, Load (ETL).

Cloud Platforms: AWS, Google Cloud Platform (GCP), Microsoft Azure.

Data Visualization Tools: Tableau, Power BI, Matplotlib, Seaborn.

Other Tools: Docker, Git, Jenkins, Terraform, Linear Regression, Logistic Regression, Decision Trees, Random Forests. EDUCATION

Masters in Artificial Intelligence University at Buffalo, Buffalo, NY Dec 2022 Bachelors in Mechanical Engineering Jawaharlal Nehru Technological University, Hyderabad, Telangana Jun 2019 CERTIFICATION:

AWS Solutions Architect Associate: AWS Certified

Azure Fundamentals: Azure Certified



Contact this candidate