SREEJA BALUSUPATI
+1-312-***-**** ********************@*****.*** linkedin.com/in/sreeja-balusupati Summary
Results-driven Data Engineer with 2+ years of experience designing, building, and optimizing large-scale data pipelines, cloud architectures, and analytics platforms. Skilled in SQL, Python, Spark, Databricks, Power BI, and AWS/Azure with proven success in ETL development, data modeling, and workflow automation. Adept at collaborating with cross-functional teams to solve complex data problems, enhance governance, and deliver secure, scalable, and cost-optimized data solutions. Strong foundation in cloud data engineering, infrastructure-as-code (IaC), and real-time analytics integration.
Education
Bhoj Reddy Engineering College for Women, Hyderabad, India Aug 2019 – Jul 2023 Bachelor of Technology in Electronics and Communication Engineering CGPA: 8.19 Webster University, San Antonio, USA Aug 2023 – Jul 2025 Masters in Information Technology and Management CGPA: 3.7 Experience
Honeywell Aug 2023 – Present
Data Engineer Houston, USA
• Designed, developed, and optimized ETL pipelines using Python, SQL, and Spark to process 10M+ records across supply chain and trading data.
• Deployed data workflows on Databricks and AWS Redshift, enabling scalable analytics and reducing reporting cycle times by 25%.
• Implemented data quality checks, monitoring, and validation frameworks ensuring accuracy and timeliness of high-volume data feeds.
• Partnered with product managers, data scientists, and IT teams to integrate predictive models into production pipelines.
• Automated cloud infrastructure provisioning (S3, Lambda, Redshift) using Terraform and AWS CloudFormation.
• Enhanced governance by defining data access policies, IAM roles, and encryption standards to meet compliance requirements. Deloitte Jan 2023 – Jul 2023
Data Engineer Intern Bangalore, India
• Conducted SQL- and Excel-based analyses to identify inefficiencies in logistics and vendor performance.
• Supported the design of BI dashboards and KPIs (retains, runouts, benchmarking) for client supply chain operations.
• Collaborated with stakeholders to align data visualization with business and compliance requirements.
• Supported the design of data ingestion and transformation workflows with SQL and Python for logistics and supply chain clients.
• Assisted in building data marts and reporting layers for KPIs such as inventory turnover and vendor benchmarking.
• Participated in agile sprints (JIRA/Confluence), contributing to scalable BI and analytics platform enhancements. Projects
Retail Sales & Inventory Data Pipeline Python, SQL Server, Azure Data Factory, Power BI
• Developed ETL pipelines to ingest and transform retail sales & inventory data into an enterprise data lake.
• Built DAX-powered Power BI dashboards for real-time KPI monitoring.
• Result: Reduced stockouts by 10% and improved demand forecasting. Delivery Route Optimization with AI- Enhanced insights Python, Pandas, Geopandas, Matplotlib, Data bricks
• Analyzed logistics and delivery data within Databricks to identify high-delay zones.
• Built and deployed Python-based optimization scripts to model efficient routing strategies.
• Integrated geospatial analysis using GeoPandas with visual outputs in Matplotlib.
• Result: Suggested zoning changes reduced delivery time by 18%, improving SLA compliance. Customer Churn Prediction with Machine Learning Python, Scikit-learn, Excel, Azure Synapse
• Built logistic regression and decision tree models to predict churn probability.
• Designed a data pipeline pulling data into Azure Synapse for large-scale processing.
• Delivered retention-focused recommendations to business stakeholders.
• Result: Achieved 82% accuracy and reduced churn through proactive outreach strategies. Technical Skills
• Programming & Query: SQL, T-SQL, DAX, Python, Java, Scala
• Data Engineering & ETL: Azure Data Factory, Databricks, Spark, Delta Lake, Snowflake, Synapse, Kafka
• Databases & Warehousing: SQL Server, MySQL, PostgreSQL, BigQuery
• BI & Analytics: Power BI (DAX), Tableau, Scikit-learn (ML models, feature engineering, evaluation)
• DevOps & Collaboration: Azure DevOps (CI/CD), GitHub, JIRA, Confluence, MS Teams
• Core & Soft Skills: Data validation, pipeline debugging, anomaly detection, problem-solving, adaptability, stakeholder communication, teamwork, documentation, agile delivery
Certifications
• Power BI Data Analyst Associate – Microsoft
• SQL (Advanced) – HackerRank Skill Certification
Involvement and Achievements
• Participated in technical seminars focused on AI, IoT, and Data Analytics
• Certified for participation in Embedded Workshop – Embedded Solutions, Hyderabad
• Active contributor to university data science club projects and hackathons