Sai Sarvani
Tondepu
DATA ENGINEER
**********@*****.*** 503-***-**** www.linkedin.com/in/sai-sarvani-tondepu PROFESSIONAL SUMMARY
Data Engineer with 5+ years of experience designing, implementing, and optimizing data architectures. Skilled in data pipeline development, ETL processes, and database management, with a proven ability to leverage big data technologies to improve data accessibility, processing speed, and reporting accuracy. Experienced in building scalable data solutions to support real-time analytics and reporting. Strong expertise in SQL, Python, Spark, and AWS, with a solid foundation in data warehousing and data modeling.
TECHNICAL SKILLS
• Programming Languages: Python, SQL, C++
• Big Data Technologies: Apache Spark, Hadoop, Hive
• ETL Tools: Informatica, Talend
• Cloud Platforms: AWS (Redshift, S3, EMR), Azure
• Databases: MySQL, PostgreSQL, MongoDB, Cassandra
• Data Warehousing: Snowflake, Redshift, BigQuery
• Data Modeling: Star Schema, Snowflake Schema, Data Vault
• Reporting Tools: Tableau, Power BI
• Others: Docker, Kubernetes, Airflow, Git
EXPERIENCE
January 2024 - current
Data Engineer, MCG Healthcare, Seattle, USA
• Developed and maintained robust ETL (Extract, Transform, Load) pipelines to efficiently handle, transform, and load large-scale healthcare data from multiple sources, ensuring data accuracy and integrity.
• Designed and optimized databases and data warehouses for efficient data storage, processing, and retrieval, enabling advanced analytics and insights for healthcare operations and clinical decision-making.
• Worked closely with cross-functional teams, including data scientists, healthcare analysts, and clinical professionals, to understand data needs and deliver insights supporting healthcare quality, cost-effectiveness, and patient outcomes.
• Implemented data governance standards and data quality checks to ensure compliance with HIPAA and other healthcare data regulations, safeguarding patient privacy and data security.
• Automated data integration and workflow processes to streamline data availability and reduce time-to-insight for healthcare stakeholders.
• Developed and maintained data dictionaries and metadata management practices to ensure consistency and transparency across healthcare data systems.
• Created data visualizations and reports to support clinical and operational stakeholders in understanding trends, patterns, and potential areas for improvement in healthcare services.
• Troubleshot and optimized data workflows to reduce processing times, improve data accuracy, and increase operational efficiency across healthcare data environments. October 2022 – December 2023
Data Engineer, Asian Paints, India
• Data Pipeline Development: Built, tested, and optimized end-to-end data pipelines, ensuring seamless data flow between internal systems and analytical platforms.
• Data Collection and Integration: Integrated data from multiple sources, including ERP systems, CRM tools, and external APIs, to create centralized data repositories for reporting and analytics.
• ETL Process Management: Designed and managed ETL (Extract, Transform, Load) processes, ensuring data integrity and compliance with company standards.
• Database Management: Administered and maintained databases, including performance tuning, query optimization, and backup management.
• Data Quality Assurance: Conducted data validation and troubleshooting to ensure accuracy, consistency, and completeness in processed datasets.
• Collaboration with Data Analysts and Stakeholders: Worked closely with cross-functional teams, including data analysts and product managers, to understand data needs and deliver relevant insights.
• Automating Data Workflows: Developed scripts and automation tools to streamline repetitive tasks, improving efficiency and reducing the margin of error in data processing.
• Documentation and Reporting: Maintained thorough documentation for data processes, data models, and technical procedures, supporting transparency and knowledge sharing across the team.
July 2020 – October 2022
Junior Data Engineer, Suneratech, India
• Data Pipeline Development: Assist in the design, development, and maintenance of scalable data pipelines for processing and integrating large volumes of structured and unstructured data from various sources.
• Data Cleansing and Transformation: Collaborate with senior data engineers to clean, transform, and enrich raw data, ensuring accuracy and consistency for downstream analysis and reporting.
• Database Management: Support database management tasks, including querying, optimization, and schema design for both relational and NoSQL databases to ensure high performance and reliability.
• Automation and Monitoring: Develop and maintain automated scripts to monitor data workflows, ensuring data quality and the timely processing of datasets.
• Collaboration with Cross-Functional Teams: Work closely with data scientists, analysts, and other engineers to understand data requirements and contribute to the creation of data- driven solutions for business needs.
EDUCATION
August 2015 – May 2020
Bachelor’s in architecture, GITAM University, India Diploma in Data Science
SKILLS
Project management Data analysis Communication
Organization Problem-solving Time Management
Data Modelling Collaboration Critical Thinking
Efficiency Detail Oriented Adaptability