Siddhant Mukherjee
Data Engineer / GCP Engineer/ ETL Developer
Google Cloud Data Engineer with nearly 3 years of experience in building scalable data solutions. Data Engineer specializing in Google Cloud Platform (BigQuery, Dataflow, Pub/Sub, Dataproc) with expertise in building scalable pipelines, optimizing data warehousing, and enabling data-driven decision- making.
*********************@*****.*** +91-766******* Pune, India WORK EXPERIENCE
Data Engineering, Management, and Governance
Analyst
Accenture
12/2022 - Present, Pune,Maharashtra
Quality Champions Award from Client.
Best Performer award by supervisor.
Best People Recognition From Manager.
Obtained P4(Advanced proficiency in BigQuery).
PROJECTS
1.Cadbury Mondelez International-INBOUND DAMAGES (08/2023) Developed and automated Talend ETL jobs for the Inbound Damages project, enabling seamless ingestion of incremental and historical data from SAP and SharePoint into Google Cloud BigQuery. Streamlined stored procedure execution within BigQuery using Talend orchestration, improving data pipeline efficiency and reliability.
2.Cadbury Mondelez International- Lighout Brazil Ingestion Project
(2024)
Worked with the Brazil Data Ingestion team to implement and automate diverse ingestion strategies using Talend, DBeaver, and a custom framework. Ensured data accuracy and consistency by testing and deduplicating records in BigQuery. 3.Cadbury Mondelez International-Digital Procurement
(2024 - Present)
Optimized BigQuery queries with partitioning, improving runtimes by 40%. Migrated legacy ETL/SQL processes to GCP and Talend, managing data in GCS
(CSV, Parquet, JSON). Automated Talend jobs with DIAT approvals. Used GitHub for CI/CD and DBeaver for validation. Authored TDDs, runbooks, and conducted unit testing in dev/QA.
SKILLS
Programming: Python, SQL, Java
Databases: MySQL, PostgreSQL, SQL Server
NoSQL Databases: MongoDB
ETL & Data Pipelines: Apache Airflow, Talend
Cloud Platforms: Google Cloud Platform
(BigQuery, Dataflow, Pub/Sub, Dataproc)
Data Warehousing: BigQuery
Data Lake & Lakehouse: Delta Lake, Apache
Iceberg
DevOps & Infrastructure: CI/CD (GitHub
Actions, Jenkins)
Data Quality & Governance: Lineage,
cataloging, security, GDPR/PII compliance
Monitoring & Logging: Prometheus, Grafana,
ELK stack
Business Understanding: Mapping data
needs to business goals
CERTIFICATIONS
1.Google Cloud Associate Cloud Engineer
2.Google Cloud Digital Leader
ORGANIZATIONS
HighRadius (05/2021 - 06/2021)
Internship
LANGUAGES
English
Professional Working Proficiency
HIndi
Full Professional Proficiency
Bengali
Full Professional Proficiency
Achievements/Tasks