Post Job Free
Sign in

Data Engineer Engineering

Location:
Visnagar, Gujarat, India
Posted:
July 26, 2024

Contact this candidate

Resume:

Manideep Reddy

Data Engineering

FL ****************@*****.*** 774-***-**** Linked In

SUMMARY

• Experienced Data Engineer with 5+ years of expertise in designing, building, and maintaining robust data infrastructure and pipelines.

• Proficient in utilizing technologies such as Python, SQL, Hadoop, and Spark to process and manage large datasets efficiently.

• Skilled in data modeling, ETL processes, and database management, with a proven track record of optimizing data workflows and ensuring data integrity.

• Adept at collaborating with data scientists, analysts, and cross-functional teams to deliver scalable data solutions that support business objectives. Committed to continuous learning and leveraging the latest tools and techniques to enhance data engineering practices and drive innovation. EXPERIENCE

BlackRock, Inc. NY Data Engineer Apr 2023 - Present

• Architected an end-to-end data processing pipeline that consolidated multi-source datasets, boosting analytics efficiency by 40%.

• Optimized data retrieval processes using partitioning and indexing strategies in NoSQL databases, reducing latency by 50%.

• Spearheaded the migration of large-scale data warehouses to cloud-based platforms, leading a team of 5 and improving system scalability.

• Developed complex ETL scripts to clean, transform, and integrate 10TB+ datasets, which enhanced data quality for critical business decisions.

• Initiated a company-wide data governance strategy that enforced data standards and streamlined data access for cross-functional teams.

• Experienced in using Azure Data Factory for orchestrating and automating data workflows and ETL processes.

• Experienced in managing and securing data environments with Azure's comprehensive suite of data services and tools.

Zensar Technologies Limited India Data Engineer July 2018 - July 2022

• Experienced in leveraging AWS services like EC2, S3, Redshift, and Glue for scalable and efficient data engineering solutions.

• Proficient in using Python for developing data processing pipelines, performing data analysis, and implementing machine learning algorithms.

• Proficient in using Power BI for creating interactive data visualizations and reports to support data-driven decision-making.

• Skilled in using Kubernetes for orchestrating containerized applications and managing scalable deployments across clusters.

• Experienced in using Amazon Redshift for scalable, high-performance data warehousing and complex query processing.

• Experienced in integrating Apache Kafka with data processing frameworks for real-time analytics.

• Skilled in AWS EMR for big data processing using Hadoop, Spark, and other distributed frameworks. CERTIFICATIONS

• AWS Certified Developer Associate (AWS DVA-C02) Expiration date: Dec 2026

• Machine Learning with Python from IBM

• Data Science with Python and R programming

• Data Visualization using Tableau

PROJECTS

Job Portal

• Currently developing a sample job portal application using Java, Spring Boot, and MySQL which will serve a ReactJS front end.

Anime App

• Developed a dynamic React Native application designed to retrieve and present data through the MyAnimeList

(MAL) API

Earthquake App

• Developed a React application that retrieves and displays real-time earthquake data by implementing the United States Geological Survey (USGS) API

Cloud Resume

• Launched my resume on the cloud using AWS S3 static hosting.

• Set up an S3 bucket for secure storage and configured static hosting for optimal performance. SKILLS

Programming Languages: Python, Java, Scala, SQL, PL/SQL, T-SQL, Unix Shell Scripting Big Data Technologies: Hadoop, Spark, Hive, Pig, Sqoop, MapReduce, HBase, EMR Methodologies: SDLC, Agile, Waterfall

DevOps Tools: Git, Jenkins, Docker, Kubernetes

ETL Tools: SSIS, Informatica PowerCenter, Erwin, Talend, Data Stage Databases: SQL Server, PostgreSQL, MySQL, Oracle, Snowflake, DynamoDB Data Pipelines: Apache Airflow, AWS Step Function, Luigi, Prefect, Oozie Streaming Technologies: Amazon Kinesis, Apache Spark, Apache Kafka, Apache Hive Libraries: Pandas, NumPy, Matplotlib, SciPy, ScarPy, TensorFlow, PyTorch, Scikit-learn, NLTK, Plotly, Keras, PyMc3 Data Visualization: Microsoft Excel, Power BI, Tableau, IBM Cognos, QlickView, QuickSight, Seaborn, SSRS Cloud Platforms: AWS (EC2, S3, Redshift, Glue), Azure (Azure Data Factory, Azure Databricks), GCP Data Warehousing: Amazon (Redshift, Dynamo DB, RDS, Athena), Azure (Synapse, BLOB, Data Lake), Big Query, Teradata, Snowflake

EDUCATION

Clark University, Worcester, MA

Masters in Data Analytics

Sathyabama Institute of Technology and Science (SITS), Chennai, India Bachelor of Electronics and Communication Engineering



Contact this candidate