Parth Dodia Data Engineer
+1-551-***-**** ************@*****.***
linkedin.com/in/parthdodia98/ github.com/parthdodia parthdodia.com SUMMARY
Data Engineer with over 4 years of experience in designing scalable ETL pipelines, optimizing cloud architecture, and enhancing real- time data processing. Experienced in leveraging AWS, Spark, Python to streamline financial and analytical data workflows, improving decision-making and operational efficiency. Adept at implementing data governance frameworks, real-time monitoring systems, and multi-cloud architectures and devops to drive accuracy, reduce costs, and enhance performance across large-scale datasets. EXPERIENCE
Berkshire Hathaway USA
Data Engineer Sep 2024 – Present
• Collaborated with a team of 10+ (engineers, analysts, data scientists) to develop scalable ETL pipelines leveraging Kinesis, S3, Glue, and Spark to process 1M+ financial transactions monthly, enhancing data accessibility for business teams
• Developed real-time financial dashboards, integrating 30+ data sources using Spark, AWS QuickSight, and Airflow, enabling 50% faster executive decision-making and improving revenue forecasting accuracy by 5%
• Designed a data synchronization framework using AWS DMS, Kafka, and Lambda, ensuring real-time data consistency across 200+ systems, reducing financial discrepancies by 35%, and enhancing fraud detection efficiency by 25%
• Optimized CI/CD pipelines and performance tuning, implementing AWS CodePipeline, Docker, and Kubernetes, reducing deployment cycle time by 2 hours, and cutting financial report processing time by 30% Aplus Datalytics India
Data Engineer Jan 2019 – Aug 2022
• Engineered scalable machine learning pipelines using AWS SageMaker, processing over 5TB of data daily, improving predictive analytics accuracy by 30%, and driving a 15% increase in revenue through data-driven insights
• Established a robust data governance framework, enhancing data quality across 15+ databases and 10,000+ data files, reducing data inconsistencies by 40%, and cutting compliance-related costs by 20%
• Led a team of 3 analysts to streamline data lake architecture and automate ingestion processes, reducing query retrieval times and manual data entry errors by 40%, and saving 100+ operational hours per month accelerating reporting processes
• Constructed a real-time data monitoring system, integrating alert mechanisms and anomaly detection, which reduced data pipeline failures by 35% and improved incident resolution time by 20%
• Implemented a resilient multi-cloud data architecture, reducing disaster recovery time by 50%, and optimizing infrastructure costs by
$100K annually, increasing the efficiency of cloud resource utilization by 10% PROJECTS
CloudMart: AI-Driven Multi-Cloud E-Commerce Platform
• Developed and deployed a multi-cloud e-commerce platform using AWS, Google Cloud, and Azure, leveraging Terraform, Docker, and Kubernetes (Amazon EKS) for scalable infrastructure
• Orchestrated CI/CD pipelines (AWS CodePipeline & CodeBuild) for automated deployment and utilized AWS Lambda to integrate with Google Cloud BigQuery for data processing
• Integrated Amazon Bedrock (Claude Sonnet 3) for intelligent product recommendations, improving customer engagement, and OpenAI (GPT-4o) for AI-driven customer support, reducing response time
• Leveraged Google Cloud BigQuery to analyze order history and identify trends, used Azure Text Analytics for sentiment detection with 90% accuracy, helping enhance customer support quality SKILLS
Programming / Data Processing: Python, Scala, SQL, Apache Spark, Kafka, Airflow Cloud / Infrastructure: AWS, Git, Terraform, Docker, Kubernetes Database / Visualization: PostgreSQL, MongoDB, Snowflake, Power Bi, Tableau AI / Machine Learning: AWS Sagemaker, Bedrock, Scikit-Learn, Generative AI Data Engineering / Automation: Data Pipeline Orchestration, Real-time Data Processing, Performance Optimization, DevOps Certifications: IBM Data Engineering Professional (Link), MultiCloud DevOps and AI (Link) EDUCATION
Pace University, New York City, NY Master of Science in Information Systems May 2024 Mumbai University, Mumbai, India Bachelor of Technology in Electronics and Telecommunications May 2020