Post Job Free
Sign in

Data Engineer

Location:
Kansas City, MO
Posted:
September 11, 2025

Contact this candidate

Resume:

ARUN SUMANTH POLINENI

• Data Engineer • 816-***-**** • Kansas City, MO – 64111 • LinkedIn •***************@*****.*** PROFESSIONAL SUMMARY

Results-driven Data Engineer with 5+ years of experience in architecting and optimizing robust data pipelines across Azure and AWS, consistently delivering measurable improvements in processing efficiency, cost reduction, and data quality. Proven expertise in ADF, Databricks, Snowflake, Redshift, Python, and Power BI, with a track record of driving business insights through scalable ETL, CI/CD automation, and large-scale data processing solutions. EDUCATION

Master of Science, Computer Science - 3.63/4.0 GPA Jan 2023 – May 2024 University of Missouri - Kansas City, Kansas City, MO TECHNICAL SKILLS

• Programming Languages: Python, SQL, Apache Spark, PySpark, Spark SQL, DAX, Scala, Java

• Data Modeling and ETL: ETL Processes, Data Warehousing, Data Modeling, Apache NiFi, Informatica PowerCenter, SSIS, Apache Flink, Apache Druid, Apache Beam, Apache Airflow, Talend, Medallion Architecture

• Cloud Technologies: Microsoft Azure (Data Factory, Databricks, Synapse, Data Lake Storage, Logic Apps, Cosmos DB, ADLS, Azure Key Vault), AWS (S3, EC2, Redshift, Glue, Lambda, RDS), GCP Big Query, Microsoft Fabric

• Databases & Warehouses: MySQL, SQL Server, Azure DB, Postgres SQL, Mongo DB, Snowflake, AWS Redshift, Azure Synapse

• Big Data Technologies: Apache Spark, Apache Hadoop, Apache Kafka

• Machine Learning: Logistic Regression, Decision Trees, Random Forests, PyTorch, AWS SageMaker

• Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-learn, Seaborn, TensorFlow

• Devops & Infrastructure as Code (IaC): Azure DevOps, Jenkins, Kubernetes, Terraform

• Tools: SSMS, Power BI, Tableau, SAS, Visual Studio, Jupyter, Microsoft Word, Excel, Splunk

• Project Management Methodologies: Agile and Waterfall

• Other Technologies: Version control tools (Git & Github), Linux, Data Governance (Collibra DGC), UNIX, API and Web Services. WORK EXPERIENCE

Data Engineer Jan 2024 - Present

S&P Global, USA

● Designed and deployed scalable 40+ ETL pipelines using Python, PySpark, Azure Data Factory, SSIS, and Databricks, cutting data processing time by up to 30%.

● Implemented Medallion Architecture within Azure Databricks and Snowflake, ensuring high-quality, analytics- ready data layers.

● Orchestrated complex data workflows with Apache Airflow, enhancing pipeline reliability and automation and reducing manual intervention by 40%.

● Engineered secure, scalable data platforms on Azure Data Lake Storage, Synapse Analytics, and Snowflake, optimizing storage, query performance, and cost-efficiency by 35%.

● Processed large-scale datasets with PySpark, improving performance for high-volume data workloads.

● Conducted advanced data analysis using Python (Pandas, NumPy) and SQL, yielding actionable insights for strategic decisions.

● Developed dynamic Power BI dashboards integrated with Snowflake and Azure, boosting real-time business intelligence capabilities.

● Implemented robust data validation frameworks, increasing pipeline reliability and data accuracy by over 20%.

● Built reusable ETL components for ingesting data (APIs, files, databases) into Snowflake and Azure Data Lake, reducing new pipeline development time by 30%.

● Worked in the finance domain, supporting market intelligence and analytics platforms with secure, scalable data pipelines to power financial insights.

Data Engineer Mar 2021 - Dec 2022

Accenture Solutions, Hyderabad, India

● Developed ETL pipelines using AWS Glue, Informatica PowerCenter, and PySpark, enhancing data integration and processing efficiency by 25%.

● Optimized real-time streaming pipelines with Kafka and Spark Streaming, reducing latency and accelerating high- frequency data processing by over 25%.

● Managed data storage solutions using MongoDB, Amazon Redshift, and Snowflake, reducing compute costs by 20% and improving analytics performance.

● Automated scalable data workflows via Apache Airflow and Spark, reducing AutoML model training time by 30%.

● Engineered cloud-native data platforms leveraging AWS services (EC2, S3, Lambda, Glue, Redshift, Snowflake) for secure, scalable analytics.

● Provisioned and managed AWS data environments using Terraform, enabling repeatable and secure infrastructure deployment.

● Developed data solutions for clients in retail and supply chain sectors, integrating ML models for demand forecasting and anomaly detection.

● Automated CI/CD pipelines with AWS CodePipeline and CodeBuild, streamlining continuous integration, testing, and deployment.

● Containerized data processing applications using Docker and deployed them via Kubernetes clusters for scalable, portable execution environments.

Azure Data Engineer Mar 2019 - Mar 2021

Cashify, Hyderabad, India

● Designed and deployed 50+ parameterized Azure Data Factory (ADF) pipelines for automated ETL, improving efficiency by 30%.

● Implemented ETL in Azure Databricks using Spark and PySpark, boosting performance by 25%.

● Monitored pipeline executions, resolving discrepancies and ensuring robust data processing.

● Built automated workflows in ADF and Databricks with Logic Apps and Functions for alerts and logging, speeding up issue resolution by 50%.

● Established CI/CD pipelines in Azure DevOps, reducing ADF component deployment time across environments by 50%.

● Performed SQL analysis in Snowflake, uncovering insights to support data-driven decisions.

● Collaborated with stakeholders to align data solutions with business needs, generating ad-hoc reports using Snowflake and Power BI.

● Implemented data governance and compliance frameworks (GDPR, CCPA) using Collibra and data classification tools to enforce policies and maintain audit readiness. CERTIFICATIONS

● Microsoft Certified Azure Fundamentals (AZ-900)

● Microsoft Certified Azure Data Fundamentals (DP-900)

● Microsoft Certified Fabric Data Engineer (DP-700) ACHIEVEMENTS

● Awarded “Rookie of the Month” for outstanding contributions to pipeline automation and performance optimization.

● Recognized with “Monthly Grammy Award” for exceptional teamwork and successful project deliveries.



Contact this candidate