Post Job Free
Sign in

Senior Data Scientist with Generative AI & MLOps Expertise

Location:
Overland, MO
Salary:
80000
Posted:
March 09, 2026

Contact this candidate

Resume:

Sravan Kumar Uppoju

Data Scientist ML Engineer Generative AI

+1-636-***-**** ******************@*****.*** St Louis, MO LinkedIn: linkedin.com/in/sravan-kumar-uppoju-6443a9183/ Github: github.com/sravankumaruppoju PROFESSIONAL SUMMARY

Data Scientist and AI/ML Engineer with 5+ years of experience designing and deploying production-ready machine learning and generative AI solutions across healthcare, banking, retail, and service domains. Experienced across the full machine learning lifecycle including data ingestion, exploratory data analysis, feature engineering, model training, evaluation, deployment, and monitoring. Proficient in Python, SQL, Spark, and cloud platforms (AWS, Azure, GCP) with hands-on experience building predictive models, recommendation systems, time-series forecasting models, and LLM-powered applications such as Retrieval-Augmented Generation

(RAG) systems. Skilled in implementing scalable ML pipelines, experimentation frameworks, and responsible AI practices including model explainability and bias detection.

SKILLS

Programming: Python, SQL, R

ML and AI: Supervised Learning, Unsupervised Learning, Deep Learning, NLP, Model Evaluation, Predictive Modeling, Regression, Classification, Feature Selection, Statistical Analysis, Hyperparameter Tuning, A/B Testing, Recommendation Systems, Churn Prediction, Feature stores

Time Series: ARIMA, Prophet, LSTM

Generative AI: GPT-4, BERT, LLaMA, Retrieval-Augmented Generation (RAG), Prompt Engineering, LangChain, LangGraph, LLM Fine-Tuning, Vector Databases, FAISS, Chroma, Embeddings, Multi-Agent Systems, Knowledge Graphs

Frameworks and Libraries: NumPy, Pandas, Scikit-learn, TensorFlow, PyTorch, Keras, Hugging Face, SciPy

Big Data and Data Engineering: Apache Spark, Hadoop, Hive, Kafka, Airflow, ETL Pipelines, Data Modeling, Data validation

Databases: SQL Server, MySQL, Oracle, MongoDB, Snowflake, Neo4j

Cloud: AWS EC2, S3, Lambda, IAM, Azure Data Factory, Data Lake, Databricks, GCP Vertex AI, BigQuery

MLOps and DevOps: MLflow, Experiment Tracking, Model Versioning, Model Registry, Docker, CI/CD, Model Deployment, REST APIs, Model Monitoring

Visualization and BI: Power BI, Tableau, Advanced Excel, KPI dashboards

Methodologies: Agile, SDLC, Git, GitHub, Jupyter Notebook

Responsible AI and Compliance: SHAP, AIF360, Fairlearn, HIPAA Generative AI Projects:

Enterprise Knowledge Assistant (LLM + RAG System)

Technologies: Python, LangChain, OpenAI (GPT-4/ChatOpenAI), Chroma Vector DB, Embeddings, Streamlit, Prompt Engineering

Developed an end-to-end Retrieval-Augmented Generation (RAG) platform enabling users to query enterprise PDF documents using natural language.

Built document ingestion pipeline including PDF parsing, text chunking, and embedding generation for unstructured data.

Indexed embeddings in a Chroma vector database and implemented semantic similarity search for contextual document retrieval.

Integrated GPT-4 via LangChain to generate context-aware responses grounded in retrieved documents, reducing hallucinations.

Designed an interactive Streamlit chat interface supporting document upload and conversational Q&A.

Deployed the application on Streamlit Cloud and optimized performance using caching and session-based processing. Live App: https://enterprise-rag-assistant-ai.streamlit.app EXPERIENCE

Senior Data Scientist Anheuser-Busch NYC, NY Aug 2024 – Present

Built churn prediction and recommendation models (XGBoost, Random Forest), improving customer retention by 12% and engagement metrics.

Implemented A/B Testing frameworks for ranking and targeting models, improving response relevance by 23%.

Designed and maintained time-series forecasting models (ARIMA, Prophet, LSTM) on Azure Databricks with MLflow and Grafana, achieving 87% accuracy.

Performed exploratory data analysis (EDA) and feature engineering on large behavioral and transactional datasets.

Evaluated model performance using ROC-AUC, precision, recall, confusion matrix, and cross-validation techniques to ensure model robustness.

Developed scalable feature engineering pipelines and feature store integrations using PySpark and Spark SQL to support large-scale machine learning training datasets.

Developed enterprise Power BI dashboards with DAX and RLS, integrating Generative AI generated insights and reducing report refresh time by 40%.

Developed end-to-end MLOps pipelines across AWS, Azure, GCP (Vertex AI, BigQuery) with CI/CD automation using Jenkins and Tekton, reducing release cycles by 30%.

Monitored production model performance, tracked prediction drift, and scheduled periodic retraining to maintain model Sravan Kumar Uppoju

Data Scientist ML Engineer Generative AI

+1-636-***-**** ******************@*****.*** St Louis, MO LinkedIn: linkedin.com/in/sravan-kumar-uppoju-6443a9183/ Github: github.com/sravankumaruppoju accuracy and reliability.

Integrated REST APIs to deploy ML and LLM-powered services, enabling secure enterprise access to OpenAI and Hugging Face models.

Developed LLM-based RAG pipelines using GPT-4, LangChain, and FAISS for enterprise document search and summarization bots.

Implemented Agentic AI workflows with LangGraph and MCP, enabling memory-augmented, multi-agent conversational systems.

Designed graph-based AI solutions using Neo4j and knowledge graphs for entity linking and semantic search. Data Scientist NantHealth India May 2022 – May 2023

Designed ETL and NLP pipelines for healthcare claims, eligibility, and clinical data, handling structured and unstructured datasets.

Developed HIPAA compliant workflows, redacting sensitive PII and integrating healthcare NLP models via secure APIs like FHIR, HL7, X12.

Built LLM-powered RAG pipelines and multi-agent AI workflows for context-aware healthcare document retrieval and claims automation.

Applied predictive modeling, statistical analysis, and ML and NLP techniques to improve operational efficiency and accuracy.

Developed interactive Tableau dashboards to visualize claims and operational KPIs, accelerating stakeholder decision- making.

Explored Generative AI models Claude, LLaMA for document summarization and internal AI agents.

Implemented Explainable AI (XAI) techniques including SHAP and Fairlearn for model explainability and bias detection.

Collaborated cross-functionally to deliver analytics solutions that improved process efficiency and supported strategic business decisions.

Data Engineer Bank of America India Oct 2019 – Mar 2022

Built and optimized SQL-based ETL pipelines and reporting workflows for multi-platform financial data.

Developed Power BI and Tableau dashboards for internal teams, implementing RLS and usage tracking.

Conducted customer segmentation, CLV analysis, and profitability modeling, helping optimize client strategies.

Provided L1 and L2 production support and resolved data pipeline and BI report issues, improving SLA compliance.

Automated KPI monitoring and reporting workflows to reduce manual effort by 30%.

Collaborated with finance, marketing, and operations teams to translate complex datasets into actionable insights.

Developed ad-hoc analytics and trend analysis reports for internal and client-facing stakeholders. Junior Data Analyst Avon Technologies India Apr 2019 – Oct 2019

Built analytics pipelines for clinical and product data using Python, Hive, Hadoop, and Spark SQL.

Developed predictive models using Python and R to analyze product adoption, market trends, and operational efficiency.

Designed dashboards in Power BI and Tableau, enabling real-time visualization of KPIs and business metrics.

Processed large-scale datasets 1TB or more, performing aggregation, transformation, and cleaning for analysis.

Automated data ingestion and preprocessing workflows, reducing manual effort and errors.

Conducted exploratory and statistical analysis to identify trends and insights for marketing and product teams. EDUCATION

Master of Science in Computer and Information Sciences, Saint Louis University, St Louis, MO, May 2025 Bachelor of Technology (B.Tech) - Computer Science Engineering Jawaharlal Nehru Technological University Hyderabad, India, 2019

CERTIFICATIONS

Microsoft Certified: Azure Data Engineer Associate

IBM Data Science Professional Certificate

Microsoft Certified: Power BI Data Analyst Associate

AWS Certified Cloud Practitioner



Contact this candidate