Post Job Free
Sign in

Data Science Engineering

Location:
Denton, TX, 76201
Salary:
110000
Posted:
October 15, 2025

Contact this candidate

Resume:

SAI NAGA MANISH JUTURU

940-***-**** *********@*****.*** LinkedIn

PROFESSIONAL SUMMARY

Data professional with over six years of experience spanning data science, data engineering, and analytics, delivering measurable impact in telecommunications, healthcare, and marketing domains. Skilled at transforming raw data into actionable insights, building scalable data pipelines, and applying AI/ML techniques to solve complex business problems. Proficient in Python, SQL, GCP, Snowflake, and big data tools, with a proven record of improving decision making, operational efficiency, and model performance in enterprise environments.

TECHNICAL SKILLS

Programming & Scripting: Python, SQL, R, Java, Bash, Scala, Shell Scripting

Data Engineering & ETL: Snowflake, BigQuery, Apache Airflow, Kafka, Spark, Hadoop, Dataproc, Dataflow, Informatica, HVR, Fivetran, Airbyte, Qlik, Apache NiFi

Databases: MySQL, PostgreSQL, Oracle, MongoDB, Cassandra, Redis

ML/AI & Analytics: TensorFlow, Scikit learn, PyTorch, Vertex AI, XGBoost, NLP, OpenCV, Keras, Pandas, NumPy, Statsmodels, Prophet

Visualization & BI Tools: Tableau, Power BI, Looker, Matplotlib, Seaborn, Plotly, D3.js, SSRS

Cloud Platforms: Google Cloud Platform (GCP), AWS (EC2, S3, RDS, Lambda), Azure (Data Factory, Synapse)

Big Data & Storage: Hive, HDFS, Parquet, Avro, Delta Lake

DevOps & CI/CD: Git, GitHub, GitLab CI/CD, Jenkins, Docker, Kubernetes, Terraform

Other Tools: Jupyter Notebook, VS Code, PyCharm, Postman, REST APIs, JSON, YAML

PROFESSIONAL EXPERIENCE

Data Scientist Comcast May 2023 – Present

Designed and maintained large scale data pipelines in Snowflake and GCP BigQuery to process and analyze subscriber behavior, network performance, and streaming content metrics for business insights.

Developed predictive analytics models to optimize content recommendations, improving viewer engagement by 18% and reducing churn rates in targeted customer segments.

Built interactive Tableau and Looker dashboards tracking network usage, service reliability, and customer satisfaction, enabling leadership to make data driven operational decisions.

Applied NLP techniques to analyze customer feedback from multiple channels, identifying key drivers of dissatisfaction and guiding service improvement initiatives.

Collaborated with engineering and product teams to deploy machine learning models into production using Vertex AI, ensuring scalability and reliability for millions of users.

Designed and deployed deep learning models to detect anomalies in network performance data, reducing outage detection time by 40%.

Fine tuned transformer based language models for automated classification and sentiment analysis of customer service transcripts, improving accuracy by 22% compared to baseline models.

Leveraged Apache NiFi and Airbyte for automated ingestion of real time network logs and customer interaction data into analytics pipelines, reducing manual integration time by 45%.

Data Engineer S&S Health Jul 2022 – May 2023

Designed and implemented robust ETL pipelines using Apache Airflow, Spark, Snowflake, and Apache NiFi to integrate healthcare claims, pharmacy, and patient data into a centralized repository.

Migrated on premises claims data warehouse to GCP using HVR and Fivetran for secure and high volume replication, reducing infrastructure costs by 25% and improving query performance by 40%.

Created optimized SQL schemas to handle high volume structured and semi structured medical data, ensuring faster data retrieval and scalability for analytics workloads.

Developed automated data validation and anomaly detection scripts to identify potential billing errors and fraudulent claims, reducing manual review time by 30%.

Partnered with data science teams to build predictive models for patient risk scoring, achieving an 87% accuracy rate in identifying high risk cases.

Data Engineer Dentsu May 2019 – Dec 2021

Engineered and maintained real time streaming and batch data pipelines using Apache Airflow, Kafka, Spark, and Fivetran, processing 5TB+ of marketing data daily for analytics and campaign optimization.

Integrated diverse marketing and CRM datasets into Snowflake and GCP BigQuery, creating a centralized, analytics ready repository that provided a unified customer view for stakeholders.

Optimized SQL queries, indexing, and partitioning strategies, reducing query execution times by over 60% and delivering an annual compute cost saving of $80K.

Automated ETL workflows through Airflow DAGs, Apache NiFi, and HVR, reducing manual intervention by 95%, minimizing operational bottlenecks, and increasing delivery speed for analytics teams.

Collaborated with data science teams to generate feature rich datasets for advanced predictive modeling, improving ad targeting accuracy by 18% and campaign ROI.

Data Analyst Deloitte Apr 2018 – Apr 2019

Performed comprehensive financial data analysis for compliance and regulatory projects, increasing audit accuracy by 12% and improving the efficiency of risk detection.

Developed SQL based data marts to consolidate and streamline compliance data, reducing reporting turnaround from three days to four hours and increasing audit readiness.

Designed and automated interactive Power BI dashboards that allowed senior auditors to track compliance metrics in real time, eliminating the need for manual reporting cycles.

Partnered with cross functional teams to define key performance indicators (KPIs) and establish standardized reporting frameworks, ensuring consistent monitoring across all compliance projects.

Built data validation and anomaly detection scripts to identify inconsistencies in transactional data, improving accuracy by 15% and reducing manual data cleaning efforts.

EDUCATION

Master of Science in Information Systems and Technologies – University of North Texas

Bachelor of Computer Science and Engineering – GITAM University

CERTIFICATIONS

Google Cloud Certified – Associate Cloud Engineer

Snowflake SnowPro Core Certification

Google Cloud Professional Data Engineer Certification

PROJECTS

Fraud Detection with Vertex AI – Built and deployed a machine learning pipeline on GCP to detect fraudulent credit card transactions in real time with 92% precision.

Healthcare Data Warehouse Migration – Led the migration of claims data from on premise SQL Server to Snowflake on GCP, reducing query time by 45%.

Customer Churn Prediction – Created a churn prediction model using Python and XGBoost for a telecom client, increasing campaign ROI by 20%.



Contact this candidate