Post Job Free
Sign in

Data Analyst Machine Learning

Location:
Boston, MA
Posted:
March 22, 2025

Contact this candidate

Resume:

Akshita Jain

Boston, MA +1-413-***-**** ********@**.*** Portfolio GitHub LinkedIn

SUMMARY

Results driven Data Analyst with hands-on experience in healthcare data analysis, predictive modeling, and data visualization. Proven ability to conduct non-routine analyses using Python, R, and SQL to solve complex business problems. Adept at building ETL pipelines, implementing machine learning algorithms, and generating actionable insights to improve operational efficiency. Enthusiastic about enhancing healthcare outcomes through data- driven solutions.

EDUCATION

Boston University, Metropolitan College Sep 2023 – Dec 2024 Master of Science in Applied Data Analytics Boston, MA Relevant Coursework: Data Visualization, Data Science with Python, Statistical Analysis, Advanced Machine Learning and Neural Networks Akhilesh Das Gupta Institute of Technology & Management Sep 2019 – Jul 2023 Bachelor of Technology in Computer Science & Engineering Delhi, India Relevant Coursework: Object Oriented Programming, Advanced Database Management, Software Development, and Information Systems SKILLS

Programming Languages: Python, R, SAS, SQL

Data Analysis & Modeling: Predictive Modeling, Statistical Analysis, Data Wrangling, Algorithm Designing, ETL pipelines, Time Series Analysis, A/B Testing, EHR Data Processing

Visualization Tools: Looker, Tableau, Power BI, Microsoft Excel (Advanced: Pivot Tables, Complex Formulas) Big Data & ETL: Snowflake, Airflow, Oracle DB, MySQL, PostgreSQL Frameworks: Seurat, Scanpy, TensorFlow, Keras, PyTorch, SciPy, Scikit-learn, Pandas, NumPy, Matplotlib, Plotly PROFESSIONAL EXPERIENCE

Blue Star Contractor LLC Jun 2024 – Aug 2024

Data Analyst Intern Remote

• Conducted cost analysis using SQL to extract and transform project expense data, identifying procurement inefficiencies that reduced material costs by 5% and improved project margins.

• Engineered interactive dashboards in Power BI and Tableau with custom DAX measures to optimize resource allocation across multiple sites, reducing idle workforce and equipment time by 10% and driving operational efficiency. KareXpert Feb 2022 – Mar 2023

Clinical Data Analyst Delhi, India

• Analyzed 50,000+ patient records using Python (Pandas, NumPy) and SQL, applying advanced statistical methodologies (outlier detection, trend analysis, predictive modeling) to improve healthcare delivery efficiency by 20%.

• Collaborated with clinical research coordinators to interpret cardiopulmonary exercise data, while designing Tableau and Power BI dashboards with advanced features, accelerating clinical decision-making by 35%. 1Gen Apr 2020 – Aug 2021

Healthcare Data Analyst Delhi, India

• Leveraged Python, SQL, and ML algorithms (regression models, and clustering techniques) to analyze 15,000+ EHRs, uncovering significant trends that boosted precision medicine adoption by 20% and reduced patient stress by 15%.

• Enhanced operational efficiency by 30% by automating EHR and genomic data processing with Python, reducing data handling time, and advancing mindfulness-based healthcare insights through Tableau dashboards. ACADEMIC PROJECTS

Healthcare Data Integration and Transformation Pipeline Airflow Snowflake Tableau GitHub Actions

• Developed end-to-end ETL pipelines using Airflow to orchestrate the extraction of data from Oracle DB, perform transformation tasks using Python (via Airflow tasks), and load the transformed data into Snowflake for analysis, successfully processing over 500GB of healthcare data.

• Automated CI/CD workflows with GitHub Actions, streamlining deployment and reducing the deployment time by 50%. Created Tableau dashboards to provide interactive, real-time data visualizations, maintaining 99.9% uptime for continuous access to critical healthcare insights. Single Cell RNA Sequencing Analysis Seurat R ggplot2

• Performed single-cell RNN sequencing analysis on 2,700 Peripheral Blood Mononuclear Cells (PBMC) using Seurat, including data preprocessing, clustering, and visualization with PCA and UMAP.

• Identified differentially expressed genes (MS4A1, CD79A, CD8) and biomarkers across 9 distinct cell clusters, optimizing feature selection and normalization techniques to enhance biological insights. ADDITIONAL EXPERIENCE

Bioinformatics and Computational Biology Lab, Delhi Technological University Jan 2023– Aug 2023 Undergraduate Research Intern Delhi, India

• Single-Cell RNA-Seq Analysis for Human Lung Cell Atlas – Conducted cell annotation, data integration with HCA, QC filtering, and clustering on lung single-cell RNA-seq data to study cellular heterogeneity.

• Computational & Bioinformatics Expertise – Utilized Seurat (R) and Scanpy/AnnData (Python) for single-cell data analysis. Developed Unix-based pipeline automation scripts to process large-scale biological datasets efficiently.



Contact this candidate