SRINIDHI ALLANKI
Langhorne, PA - ***** +1-484-***-**** ************@*****.*** Linkedin
SUMMARY
Data Engineer with 4+ years of experience building and optimizing cloud-based data pipelines and analytics platforms across consulting, energy, and infrastructure domains. Expertise in Databricks (PySpark, SQL, Delta Lake), ETL/ELT design, and data modeling on AWS and Azure. Skilled in Airflow, Kafka, and Terraform for workflow automation, with strong proficiency in Snowflake, Redshift, and Synapse. PROFESSIONAL EXPERIENCE
Data Engineer McKinsey & Company PA, USA March 2025 – Present
● Engineered and optimized batch/real-time data pipelines using Apache Airflow, Spark, Kafka, and Python, processing 1.5M+ daily records with sub-minute latency for analytics and operational reporting.
● Refactored SparkSQL and dbt workflows, reducing query and refresh times by 30%+ and boosting analytics performance.
● Unified fragmented datasets in Amazon Redshift, PostgreSQL, and Snowflake, implementing compression, partition pruning, and SQL tuning to reclaim terabytes of storage annually.
● Designed and developed AWS QuickSight dashboards for executive reporting, enabling interactive visualizations, ad-hoc analysis and automated data refreshes that improved decision-making across business units.
● Delivered production-ready datasets via RESTful APIs, speeding up ML model deployment by weeks. Data Engineer Dell Technologies Hyderabad, India May 2022 – December 2022
● Spearheaded the development of scalable ETL pipelines with Apache Spark, PySpark, and AWS Glue, processing 500K+ logs daily from S3 and MongoDB, saving 20+ hours weekly in manual prep.
● Built SQL transformations in Snowflake to harmonize telemetry and ticket datasets, powering root cause analysis and predictive analytics across 10+ customer-facing service lines.
● Automated metadata extraction and delivery into Amazon Redshift, integrating AWS Kinesis streams for real-time Power BI dashboards, enabling daily decision-making for technical support leads.
● Streamlined anomaly detection using PySpark and AWS Lambda/SNS to quickly address support ticket surges.
● Implemented schema versioning, data integrity checks, and CI/CD automation with dbt, ensuring consistency during weekly production pushes in Agile cycles.
● Provisioned Terraform-based staging environments for parallel workflow testing, optimizing cloud resources.
● Partnered with product managers, analysts, and customer success teams to deliver insights for executive reviews, reducing high-priority escalations by ~14%.
Data Engineer Tata Consultancy Services Hyderabad, India August 2019 – April 2022
● Automated infrastructure lifecycle management and patch deployments across RedHat 6/7/8 using Ansible, Shell scripting, and AWS CloudFormation, reducing manual effort by 40%.
● Architected reusable CloudFormation templates to enable scalable, reliable multi-region deployments.
● Tuned Linux systems to support high-throughput batch and streaming data pipelines, enhancing performance and efficiency.
● Managed and optimized AWS environments (EC2, S3, RDS, VPC) for 3,500+ Linux instances with 99% uptime.
● Developed AWS QuickSight dashboards using Redshift and RDS for real-time KPI insights.
● Designed and executed RDS-MySQL failover simulations to validate disaster recovery and ensure high availability.
● Implemented proactive monitoring with CloudWatch and Orion, cutting incident resolution time by 30%.
● Strengthened security by enforcing IAM policies, encrypted EBS volumes, and key rotation aligned with AWS best practices.
● Established IaC CI/CD with GitLab, enhancing deployment reliability and traceability.
● Partnered with network, security, DBA, and application teams to streamline infrastructure deployments and cloud migrations. Data Analyst Adani India February 2019 – August 2019
● Queried 2+ years, operational data, SQL (MySQL, PostgreSQL), anomalies, energy consumption patterns, industrial zones, early detection, overuse incidents.
● Developed automated Excel reports, PivotTables, VLOOKUP, weekly reporting efficiency (30%), site performance reviews.
● Created exploratory data visualizations, Power BI, power distribution KPIs, department heads, variance, 6 regional grids.
● Collaborated with engineers, sensor data, smart meters, Python (Pandas), datasets, validation, trend analysis.
● Documented Findings, recommendations, load analysis, optimization plan, 12% reduction, one facility. PROJECTS
Implementation of Secured Layer Over Gmail Communication
● Built a secure email communication layer by applying public-key cryptography to encrypt and decrypt message content, ensuring data confidentiality during Gmail-based exchanges.
● Implemented key generation and message handling logic using Python and cryptographic libraries, allowing only authorized recipients to access email content through private key validation. IoT-Based Smart Electronic Trolley for Supermarkets
● Developed an IoT-based smart trolley using RFID and microcontrollers to automate item tracking and billing, eliminating manual checkout.
● Integrated with a custom Android app and wireless modules to sync real-time billing data with store inventory and central dashboards. TECHNICAL SKILLS
Programming & Scripting: Python, Shell Scripting, Java, JavaScript, HTML, CSS Query & Data Languages: SQL (PostgreSQL, MySQL, SQL Server, PL/SQL, SparkSQL, HiveQL) Cloud Platforms: AWS (EC2, S3, RDS, VPC, IAM, CloudWatch, CloudTrail, Route53, CloudFormation) Data Engineering: Apache Airflow, Spark, Hadoop, dbt, Pipeline Automation (Batch & Stream) Data Warehousing: Snowflake, Redshift, Amazon RDS, SQL Server Databases: PostgreSQL, MySQL, MongoDB, DynamoDB
BI & Reporting: Power BI, Tableau, Excel (PivotTables, VLOOKUP), ServiceNow (reporting integration) DevOps & Automation: Git, GitHub, GitLab, CI/CD (GitLab CI, Jenkins), Docker, Ansible, Terraform (basic) APIs & Integration: RESTful APIs, Webhooks, Event-driven Architecture Monitoring & Observability: Prometheus, AWS CloudWatch, SolarWinds Orion Development Tools: Postman, VS Code, IntelliJ IDEA, Eclipse, PuTTY, Linux Command Line Software Engineering Practices: Clean code & documentation, Debugging & optimization, SDLC Data Architecture Concepts: OLAP Cubes, Distributed Data Processing EDUCATION
M.S. in Information Systems Technology Wilmington University DE, USA January 2023 – December 2024 B.Tech. in Electronics & Communication Engineering SNIST Hyderabad, India July 2018 – July 2022 CERTIFICATION
AWS Certified Solutions Architect – Associate Red Hat Certified System Administrator (RHCSA)