Post Job Free
Sign in

Machine Learning Data Science

Location:
West Springfield, MA
Posted:
October 07, 2025

Contact this candidate

Resume:

Arihant Barjatya

413-***-**** *********@*****.*** arihant-barjatya/ arihunter

Education

University of Massachusetts Amherst Amherst, MA

MS Computer Science Courses: Advanced ML, RL, IR, Distributed Systems, Computational Biology May 2026 Indian Institute of Technology Guwahati Guwahati, India Bachelors Electrical Engg Courses: Computer Vision, Optimization, Information Theory July 2018 – July 2022 Experience

Expedia Group Jun 2025 – Aug 2025

Machine Learning Science Intern Seattle, WA

• Developed a GitHub PR tool to detect i18n defects in message files, reducing review time from 41 hrs to seconds.

• Fine-tuned open-source LLMs like Qwen Coder on Databricks and GPT models, achieving an F1 score of 0.71. Johnson & Johnson MedTech Jan 2025 – Jun 2025

Data Science Co-op Danvers, MA

• Built an LLM-based document parser to extract structured data from clinical notes, boosting efficiency by 50%.

• Built a pipeline to prepare fine-tuning datasets from proprietary sources, then fine-tuned and evaluated open-source language and time-series foundational models on them. Neohumans Dec 2023 – July 2024

Founding AI Engineer Bengaluru, India

• Implemented long-term memory for AI companions using RAG techniques, increasing CRR by 70%.

• Created fine-tuning engine for open-source LLMs, combining synthetic Hinglish data generation, fine-tuning with Deepspeed, deployment services, and alignment using the DPO algorithm, reducing model launch time by 50%. Learnmigo Sept – Dec 2023

Co-Founder Bengaluru, India

• Addressed inconsistent education quality in India by developing custom AI bots based on Bloom’s Taxonomy.

• Led tech team to develop customizable agentic chatbot helping teachers create student specific study plans.

• Spearheaded the development of an AI tutor app for K-12 students, attracting over 5,000 users in the first month Oracle Aug 2022 – Sept 2023

Member of Technical Staff - Performance Engineering Bengaluru, India

• Automated LOB data collection, profiling, and reporting on Oracle DB, resulting in an 80% speedup.

• Successfully migrated internal dashboards to a secure HTTPS environment, improving data security. Internships Jan 2021 – Aug 2022

Undergraduate Remote

• Salesken (ML Engineer May–Aug 2022): Fine-tuned BERT on proprietary data to guide SDR pitching; real-time transcript analysis shipped with FastAPI + Milvus.

• AIDASH (Data Scientist Jan–Apr 2022): Architected vegetation-segmentation pipeline ( 67% faster) and coregistration model ( 30% lower loss); added road-network extraction tracked via MLflow.

• Optum (UHG) (Product Apr–Jun 2021): Built ML models and a Django web app delivering cost-efficient insights to boost health-plan ratings.

• AI Palette (Data Engineer, Jan–Apr 2021): Optimized food-trend pipelines on AWS, cutting runtime by 75 % and automating them with Airflow to double daily refresh cadence. Technical Skills

Programming Languages: Python, C,C++, Golang

Deep Learning / AI: PyTorch, Deepspeed, Hugging Face, LangChain, vLLM, TensorRT, Ray, SageMaker, Azure ML Data Science: Numpy, Pandas, Matplotlib, Scikit-learn, OpenCV, PySpark Development: Docker, Git, Django, AWS, PostgreSQL, FastAPI Projects

• Investigated GRPO for adapting open-source LLMs to medical-domain tasks at the UMass BioNLP Lab.

• Synergizer AI copilot for business development; FastBio & RiskMate, copilots for biological research & risk analysis.



Contact this candidate