Yogesh Reddy Kovvuru
Hyderabad, India — 824-***-**** — **************@*****.*** — LinkedIn
Professional Summary
Data Engineer and Backend Developer with 4+ years of experience building the infrastructure required for the optimal extraction, transformation, and loading (ETL) of data at scale. Solid expertise in Python and SQL, actively leveraging AWS, GCP, and Databricks ecosystems to power robust big data architectures. Proven track record of designing scalable data pipelines, automating manual processes, and optimizing data delivery. Strong collaborator adept at working with Technical Architects, Product Owners, and stakeholders to resolve technical challenges and drive innovative data analytics solutions. Technical Skills
• Programming: Python (Solid Expertise), SQL, PySpark
• Data Engineering: ETL/ELT Pipelines, Data Pipeline Architecture, Data Ingestion & Transformation
• Cloud Platforms: AWS (EMR, EC2, RDS, Batch, Lambda, S3), GCP, Azure
• Data Warehouses: AWS Redshift, Google BigQuery (GBQ), Databricks
• Databases: PostgreSQL, MySQL, Amazon RDS, NoSQL (MongoDB, DynamoDB)
• DevOps & Tools: Git, CI/CD, Apache Airflow, REST APIs Certifications
• Google Cloud Certified – Professional Data Engineer (PDE)
• Google Cloud Certified – Cloud Data Practitioner
• Databricks Certified Data Engineer Associate
Professional Experience
Senior Analyst – Data Engineer Jan 2025 – Present
Accenture
• Built and orchestrated end-to-end ETL jobs leveraging Databricks and Apache Airflow for the scalable ingestion, transformation, and storage of high-volume healthcare datasets.
• Authored highly optimized, reusable PySpark code to calculate and aggregate various business metrics, applying core data structures to process complex workflows efficiently.
• Set up comprehensive data validation frameworks for ingested data, ensuring high fidelity and automating manual quality checks.
• Collaborate cross-functionally with Technical Architects, Product Owners, and Executives to support data infrastructure needs and resolve technical issues. Data Engineer May 2023 – Dec 2024
YuktaMedia
• Built backend infrastructure for optimal extraction, transformation, and loading (ETL) of ad revenue data using SQL and AWS big data technologies (EMR, EC2, RDS).
• Developed and managed scalable data pipelines that integrate with cloud data warehouses such as AWS Red- shift and S3 data lakes.
• Created tools for data management and analytics workflows, effectively integrating NoSQL solutions (Dy- namoDB) for fast, flexible data retrieval.
• Communicated clearly across engineering and product teams to deliver optimized reporting workflows and au- tomated billing solutions.
Data Engineer Feb 2022 – Apr 2023
Tata Consultancy Services (TCS)
• Designed and executed robust backend ETL processes to ingest millions of daily transactions, utilizing Python and PostgreSQL for structured storage.
• Built automated data delivery pipelines, working comfortably within relational and NoSQL database systems to support Customer 360 analytics.
• Engineered anomaly detection modules that take advantage of strong backend development practices to improve data quality and pipeline reliability.
Education
B.Tech – Electronics and Communication Engineering Aug 2020 Karunya University, Coimbatore, India
Languages
English (Fluent) — Telugu (Native) — Hindi (Conversational)