Post Job Free
Sign in

Data Analyst Science

Location:
Austin, TX
Posted:
September 10, 2025

Contact this candidate

Resume:

Riyanshi Bohra

352-***-**** **************@*****.*** Portfolio LinkedIn GitHub Austin, TX (Open to relocation) Education

University of Arizona Tucson, AZ

Master of Science (M.S.) in Data Science; GPA: 3.78/4.0 Aug 2023 – May 2025 Coursework: Machine Learning (ML), Applied Natural Language Processing (NLP), Data Mining & Warehousing Manipal University Jaipur Jaipur, India

Bachelor of Technology (B.Tech) in Information Technology; GPA: 3.72/4.0 Jul 2019 – May 2023 Coursework: Relational Databases, Data Structures & Algorithms, Operating Systems, Probability & Statistics Professional Experience

Data Analyst (Product & Data Associate) Jul 2025 – Present UGenome AI Austin, TX

• Automated genomics data workflows to process high-throughput sequencing data, reducing clinical reporting cycles from 3-4 weeks to under 2 days for AI-driven diagnostics

• Designed QA validation scripts to validate multi-source datasets (500K+ records per batch), resolving over 1000 data inconsistencies and ensuring clean, production-ready inputs for 3 ML workflows

• Integrated validated datasets into ProPEx AI pipelines, collaborating with AI teams to cut preprocessing time by 40% and deliver pharmacogenomic insights across 12 initial clinical deployments Data Analyst II (Research Professional) Jan 2024 – May 2025 Zuckerman College of Public Health, University of Arizona Tucson, AZ

• Built Python and SQL pipelines to analyze 1B public health records across 15 years, enabling automated dashboard updates for 3 statewide policy reports impacting ~2M students

• Applied logistic regression and time-based modeling in R across 1,200 school sites, uncovering equity gaps and informing a $625K reallocation strategy

• Designed and maintained Tableau dashboards and interactive reports tracking 30+ KPIs across 200 districts, enhancing cross-functional visibility and supporting ongoing health equity evaluations Data Science Intern (Applied ML Engineer) Jul 2022 – Sep 2022 PwC (PricewaterhouseCoopers) Mumbai, India

• Engineered ETL pipelines using Python, SQL, and Apache Spark to ingest and clean 15,000+ pharmaceutical manufacturing records/month, improving data reproducibility and reducing cycle time by 45%

• Analyzed 7+ years of production data to forecast yield efficiency, increasing throughput by 20% (from 400K to 480K units) and driving $50,000 in annual cost savings

• Designed 5 Power BI dashboards with embedded A/B testing metrics to track KPIs across 7 production lines, reducing manual workload by 350+ hours/week

Technical Skills

Languages: Python (Pandas, NumPy, Scikit-learn, Matplotlib), SQL (MySQL, Oracle SQL), R, JavaScript Frameworks & Libraries: Flask, FastAPI, React, Next.js, TensorFlow, PyTorch, SpaCy, Hugging Face Transformers Data & BI Tools: Tableau, Power BI, Looker, Microsoft Excel (Pivot Tables, VBA), A/B Testing (SciPy, Statsmodels) Developer Tools & Cloud: Git/GitHub, VS Code, Jupyter Notebook, Cursor, RStudio, AWS, Azure Generative AI: LangChain, LangGraph, VectorDB (Pinecone, Deep Lake), Retrieval-Augmented Generation (RAG) Project Experience

SafeDrive AI: Real-time Distracted Driving Detection Python, TensorFlow, Computer Vision GitHub

• Created a real-time analytics pipeline to extract distraction patterns from 50,000+ dashcam images, achieving 92% accuracy and enabling daily trend reports on driving behavior across 200 drivers

• Delivered behavior summary dashboards for a 3-month pilot using edge computing outputs, reducing manual tagging needs by 30% and informing intervention strategies with location-based charts and risk metrics Metropolitan Climate Profiling Python (Scikit-learn, Matplotlib, Pandas), XGBoost, Tableau GitHub

• Analyzed 300,000+ climatic data points across 3 cities using statistical modeling and ML-based clustering, identifying 12 hotspots and mapping long-term urban heat trends

• Developed interactive Tableau dashboards, visualizing 10+ climate risk indicators and enabling 30 urban planners to prioritize interventions across 50 high-risk neighborhoods, improving climate adaptation strategies



Contact this candidate