Manideep Reddy
Data Engineering
FL ****************@*****.*** 774-***-**** Linked In
SUMMARY
• Experienced Data Engineer with 5+ years of expertise in designing, building, and maintaining robust data infrastructure and pipelines.
• Proficient in utilizing technologies such as Python, SQL, Hadoop, and Spark to process and manage large datasets efficiently.
• Skilled in data modeling, ETL processes, and database management, with a proven track record of optimizing data workflows and ensuring data integrity.
• Adept at collaborating with data scientists, analysts, and cross-functional teams to deliver scalable data solutions that support business objectives. Committed to continuous learning and leveraging the latest tools and techniques to enhance data engineering practices and drive innovation. EXPERIENCE
BlackRock, Inc. NY Data Engineer Apr 2023 - Present
• Architected an end-to-end data processing pipeline that consolidated multi-source datasets, boosting analytics efficiency by 40%.
• Optimized data retrieval processes using partitioning and indexing strategies in NoSQL databases, reducing latency by 50%.
• Spearheaded the migration of large-scale data warehouses to cloud-based platforms, leading a team of 5 and improving system scalability.
• Developed complex ETL scripts to clean, transform, and integrate 10TB+ datasets, which enhanced data quality for critical business decisions.
• Initiated a company-wide data governance strategy that enforced data standards and streamlined data access for cross-functional teams.
• Experienced in using Azure Data Factory for orchestrating and automating data workflows and ETL processes.
• Experienced in managing and securing data environments with Azure's comprehensive suite of data services and tools.
Zensar Technologies Limited India Data Engineer July 2018 - July 2022
• Experienced in leveraging AWS services like EC2, S3, Redshift, and Glue for scalable and efficient data engineering solutions.
• Proficient in using Python for developing data processing pipelines, performing data analysis, and implementing machine learning algorithms.
• Proficient in using Power BI for creating interactive data visualizations and reports to support data-driven decision-making.
• Skilled in using Kubernetes for orchestrating containerized applications and managing scalable deployments across clusters.
• Experienced in using Amazon Redshift for scalable, high-performance data warehousing and complex query processing.
• Experienced in integrating Apache Kafka with data processing frameworks for real-time analytics.
• Skilled in AWS EMR for big data processing using Hadoop, Spark, and other distributed frameworks. CERTIFICATIONS
• AWS Certified Developer Associate (AWS DVA-C02) Expiration date: Dec 2026
• Machine Learning with Python from IBM
• Data Science with Python and R programming
• Data Visualization using Tableau
PROJECTS
Job Portal
• Currently developing a sample job portal application using Java, Spring Boot, and MySQL which will serve a ReactJS front end.
Anime App
• Developed a dynamic React Native application designed to retrieve and present data through the MyAnimeList
(MAL) API
Earthquake App
• Developed a React application that retrieves and displays real-time earthquake data by implementing the United States Geological Survey (USGS) API
Cloud Resume
• Launched my resume on the cloud using AWS S3 static hosting.
• Set up an S3 bucket for secure storage and configured static hosting for optimal performance. SKILLS
Programming Languages: Python, Java, Scala, SQL, PL/SQL, T-SQL, Unix Shell Scripting Big Data Technologies: Hadoop, Spark, Hive, Pig, Sqoop, MapReduce, HBase, EMR Methodologies: SDLC, Agile, Waterfall
DevOps Tools: Git, Jenkins, Docker, Kubernetes
ETL Tools: SSIS, Informatica PowerCenter, Erwin, Talend, Data Stage Databases: SQL Server, PostgreSQL, MySQL, Oracle, Snowflake, DynamoDB Data Pipelines: Apache Airflow, AWS Step Function, Luigi, Prefect, Oozie Streaming Technologies: Amazon Kinesis, Apache Spark, Apache Kafka, Apache Hive Libraries: Pandas, NumPy, Matplotlib, SciPy, ScarPy, TensorFlow, PyTorch, Scikit-learn, NLTK, Plotly, Keras, PyMc3 Data Visualization: Microsoft Excel, Power BI, Tableau, IBM Cognos, QlickView, QuickSight, Seaborn, SSRS Cloud Platforms: AWS (EC2, S3, Redshift, Glue), Azure (Azure Data Factory, Azure Databricks), GCP Data Warehousing: Amazon (Redshift, Dynamo DB, RDS, Athena), Azure (Synapse, BLOB, Data Lake), Big Query, Teradata, Snowflake
EDUCATION
Clark University, Worcester, MA
Masters in Data Analytics
Sathyabama Institute of Technology and Science (SITS), Chennai, India Bachelor of Electronics and Communication Engineering