Post Job Free
Sign in

Data Engineer Machine Learning

Location:
Hyderabad, Telangana, India
Posted:
September 10, 2025

Contact this candidate

Resume:

Sri Mounika Jammalamadaka

Sanjose, CA **********.***@*****.*** 408-***-**** linkedin.com/in/srimounikaj/ PROFESSIONAL SUMMARY

• Data professional with 5+ years of experience in data engineering, analysis, and product support.

• Exper9se in Python, SQL, big data tools (Spark, Hadoop), and cloud pla@orms (AWS, Azure) for scalable data solu9ons.

• Proven ability to design ETL pipelines, op9mize data workflows, and automate repor9ng for business intelligence.

• Strong analy9cal skills with experience in predicGve modeling, staGsGcal analysis, and data visualizaGon

(Tableau, Power BI).

• Collaborated with cross-func9onal teams to translate business requirements into data-driven soluGons.

• Hands-on experience with machine learning frameworks (Scikit-learn, TensorFlow) and deploying models into produc9on.

• Proficient in data warehousing (Snowflake, RedshiP) and database management (PostgreSQL, MySQL).

• Improved data processing efficiency by 30%+ through query op9miza9on and pipeline enhancements.

• Adept at Agile methodologies, version control (Git), and CI/CD prac9ces for data projects.

• Cer9fied in [relevant cerGficaGons, e.g., AWS Data AnalyGcs, Google Data Engineer].

• Strong communica9on skills with experience presen9ng insights to stakeholders and execuGves. TECHNICAL SKILLS

• Programming Languages: Python, SQL, R, Scala

• Big Data Tools: Apache Spark, Hadoop, KaUa, Hive

• Cloud Pla@orms: AWS (S3, Glue, Lambda), Azure (Data Factory, Synapse)

• Data Warehousing: Snowflake, RedshiY, BigQuery

• Databases: PostgreSQL, MySQL, MongoDB

• ETL/Data Pipelines: Airflow, Talend, Informa9ca

• Machine Learning: Scikit-learn, TensorFlow, PyTorch

• Data VisualizaGon: Tableau, Power BI, Matplotlib, Seaborn

• Version Control: Git, GitHub, Bitbucket

• Agile & DevOps: Jira, Docker, Kubernetes

• StaGsGcal Analysis: Hypothesis tes9ng, regression, A/B tes9ng

• Other Tools: Pandas, NumPy, PySpark, Databricks

WORK EXPERIENCE

Dewberry Data Engineer Fairfax, VA Mar 2024 – Present ResponsibiliGes:

• Designed and deployed scalable ETL pipelines using PySpark and AWS Glue to process 10TB+ of IoT sensor data, improving processing speed by 40%.

• Automated data inges9on from AWS S3, APIs, and on-prem databases into Snowflake, reducing manual effort by 25 hours/month.

• Built real-9me data quality checks (e.g., outlier detec9on, null valida9on) using Python and Great ExpectaGons, ensuring 99.8% accuracy.

• Developed Tableau dashboards for sustainability metrics, enabling clients to track CO2 emissions and energy usage trends.

• Op9mized complex SQL queries for geospa9al datasets, reducing run9me from 2 hours to 30 minutes.

• Collaborated with DevOps to containerize data pipelines using Docker and schedule workflows via Airflow.

• Integrated machine learning models (Scikit-learn) into pipelines to predict infrastructure maintenance needs.

• Migrated legacy SQL Server databases to Snowflake, improving query performance by 60%.

• Documented data lineage and metadata in Collibra for compliance with GDPR and CCPA regula9ons.

• Trained 15+ analysts on Snowflake best prac9ces and self-service analy9cs tools.

• Partnered with soYware engineers to design REST APIs for internal data access.

• Reduced cloud costs by 20% through right-sizing AWS resources and implemen9ng auto-scaling. Environment: Python, SQL, PySpark, AWS (S3, Glue, Lambda), Snowflake, Tableau, Airflow, Docker, Scikit-learn Nemetschek India Technical Consultant India Feb 2019 – Jun 2023 ResponsibiliGes:

• Resolved 500+ Ger-2/3 Gckets for BIM soYware clients, addressing data integra9on and API issues.

• Developed Python scripts to automate data valida9on, reducing error rates by 25%.

• Analysed ElasGcsearch logs to debug performance boalenecks, decreasing system down9me by 35%.

• Created Power BI reports for customer usage paaerns, influencing product roadmap decisions.

• Collaborated with data engineers to design SQL-based alerts for anomalous user behavior.

• Standardized JIRA workflows for bug tracking, improving resolu9on 9me by 50%.

• Conducted 50+ training sessions for clients on data extrac9on tools (e.g., Revit API, Forge).

• Reverse-engineered legacy VBA macros, migra9ng them to Python for beaer maintainability.

• Prototyped a NLP chatbot (TensorFlow) to handle repe99ve customer queries.

• Spearheaded documenta9on overhaul, reducing new hire onboarding 9me by 4 weeks.

• Assisted QA team in wri9ng PyTest scripts for data integrity valida9on.

• Proposed schema changes to op9mize PostgreSQL databases, improving query speed by 40%. Environment: Python, SQL, Elas9csearch, Power BI, JIRA, PostgreSQL, REST APIs, TensorFlow EDUCATION

• Master’s in Data Analy9cs at San Jose State University, CA

• Bachelor’s in Computer Science at GITAM University, India



Contact this candidate