Cherry Gupta
317-***-**** Indianapolis, IN, ***** *********@*****.*** LinkedIn: www.linkedin.com/in/cherry-gupta-27b95a173
SKILLS
Programming Skills: Python, R, SAS, Hadoop, MATLAB, C++, WordPress, Oxygen XML Editor2, XML, HTML, HL7, JSON.
Data Extraction & Visualization: MySQL, PostgreSQL, NoSQL (MongoDB, Cassandra), Apache Spark, Tableau, Power BI, DHIS2, AWS QuickSight, Microsoft Excel, Adobe Creative Suit, LDAVis, Vos Viewer
Certifications: Human subject certification for Biomedical research.
Libraries: NumPy, Pandas, Scikit-learn, Open Computer Vision (OpenCV), TensorFlow, Keras, PyTorch, Azure VMs. Machine Learning(ML): Regression Analysis, Convolutional Neural Networks (CNN), Clustering, XG Boost, Factor Analysis, Anomaly Detection, A/B Testing, Random Forest, Classification Analysis, Statistical Analysis.
Health Informatics: Electronic Health Records (EHR), OpenEMR, Clinical Decision Support Systems (CDSS), HIPAA law.
Data Standards: SNOMED CT, ICD, LOINC, CPT, RxNorm.
Program Management: Qualtrics, JIRA
Medical: Health/Medical Terminologies, Pharmacology, Public health, Physiology, Histology, Pathology, Surgery,Anatomy.
EDUCATION
Master of Science in Health Informatics Indiana University, Indianapolis, US (August 2023 – May 2025)
Bachelor of Dental Surgery Government College of Dentistry (GDC) Indore, India (September 2014 – March 2019)
WORK EXPERIENCE
Indiana University, Indianapolis, Indiana (August 2024 – Present)
Data Science Researcher
Developed SQL and Python-based data pipelines to extract, prepare, and automate public health data collection, improving dataset accessibility by 25% and enabling real-time analysis.
Synthesized research findings using PRISMA guidelines, developed Tableau dashboards for KPI tracking, and presented insights via LDAVis and VOSviewer, enhancing engagement among 50+ stakeholders.
Randolf County Caring Community Partnership, Moberly, Missouri (May 2024 – August 2024)
Data Analytics Intern
Developed complex ETL pipelines using SQL and Apache Spark to process large-scale healthcare datasets, reducing data processing time by 30% and improving KPI tracking efficiency for Medicaid and COVID-19 projects
Designed Power BI dashboards and reports integrated into DHIS2 to monitor health program performance for 10,000+ clients, improving reporting accuracy by 15% and ensuring real-time access to critical data for stakeholders.
BizOne Soft Pvt. Ltd, Mumbai, India (March 2022 - July 2023)
Data Analyst
Designed SQL-based data warehouse solutions and Python predictive models, integrating data for 1,800+ daily patients, reducing manual handling by 40% and boosting annual revenue by 14% through enhanced data analysis.
Implemented predictive analytics using R and Tableau dashboards on Apache Spark for early intervention, reducing patient churn rate by 18% through personalized care recommendations and optimizing resource allocation.
Rely Home Dental Services and Dental Solutions, Indore, India (March 2021 - February 2022)
Clinical Data Analyst
Developed machine learning forecasting models for patient care trends and analyzed clinical data, improving inventory efficiency by 20% and enhancing decision-making for dental care providers
Engineered interactive Tableau dashboards with custom SQL queries to visualize complex treatment metrics and patient outcomes, enabling real-time data-driven care planning and increasing treatment efficiency by 12%.
Government College of Dentistry, Indore, India (April 2019 - July 2020)
Clinical Intern
Conducted statistical analysis on 1,200+ oral submucous fibrosis patient records using R, revealing 70% moderate to severe cases and developed predictive models to enhance early detection strategies for dental care providers.
Analyzed 5,000+ patient dental records using R, revealing 78% caries prevalence before age 17; created Tableau dashboards to improve early intervention strategies by 12%.
ACADEMIC PROJECTS
Advanced statistical Analysis and Predictive Modeling for Heart Failure Prediction (January 2024- March 2024)
Conducted statistical analysis in R on heart failure data and created Tableau dashboards to visualize risk factors, improving early intervention strategies by 15% and enhancing risk assessment accuracy.
Enhanced PCOS Diagnosis with Machine Learning (August 2023 - December 2023)
Leveraged SQL for efficient data extraction and engineered ML models for PCOS detection and Tableau dashboards on Apache Spark to visualize predictors, improving diagnostic accuracy by 22% and enhancing early identification.