Arihant Barjatya
413-***-**** *********@*****.*** arihant-barjatya/ arihunter
Education
University of Massachusetts Amherst Amherst, MA
MS Computer Science Courses: Advanced ML, RL, IR, Distributed Systems, Computational Biology May 2026 Indian Institute of Technology Guwahati Guwahati, India Bachelors Electrical Engg Courses: Computer Vision, Optimization, Information Theory July 2018 – July 2022 Experience
Expedia Group Jun 2025 – Aug 2025
Machine Learning Science Intern Seattle, WA
• Developed a GitHub PR tool to detect i18n defects in message files, reducing review time from 41 hrs to seconds.
• Fine-tuned open-source LLMs like Qwen Coder on Databricks and GPT models, achieving an F1 score of 0.71. Johnson & Johnson MedTech Jan 2025 – Jun 2025
Data Science Co-op Danvers, MA
• Built an LLM-based document parser to extract structured data from clinical notes, boosting efficiency by 50%.
• Built a pipeline to prepare fine-tuning datasets from proprietary sources, then fine-tuned and evaluated open-source language and time-series foundational models on them. Neohumans Dec 2023 – July 2024
Founding AI Engineer Bengaluru, India
• Implemented long-term memory for AI companions using RAG techniques, increasing CRR by 70%.
• Created fine-tuning engine for open-source LLMs, combining synthetic Hinglish data generation, fine-tuning with Deepspeed, deployment services, and alignment using the DPO algorithm, reducing model launch time by 50%. Learnmigo Sept – Dec 2023
Co-Founder Bengaluru, India
• Addressed inconsistent education quality in India by developing custom AI bots based on Bloom’s Taxonomy.
• Led tech team to develop customizable agentic chatbot helping teachers create student specific study plans.
• Spearheaded the development of an AI tutor app for K-12 students, attracting over 5,000 users in the first month Oracle Aug 2022 – Sept 2023
Member of Technical Staff - Performance Engineering Bengaluru, India
• Automated LOB data collection, profiling, and reporting on Oracle DB, resulting in an 80% speedup.
• Successfully migrated internal dashboards to a secure HTTPS environment, improving data security. Internships Jan 2021 – Aug 2022
Undergraduate Remote
• Salesken (ML Engineer May–Aug 2022): Fine-tuned BERT on proprietary data to guide SDR pitching; real-time transcript analysis shipped with FastAPI + Milvus.
• AIDASH (Data Scientist Jan–Apr 2022): Architected vegetation-segmentation pipeline ( 67% faster) and coregistration model ( 30% lower loss); added road-network extraction tracked via MLflow.
• Optum (UHG) (Product Apr–Jun 2021): Built ML models and a Django web app delivering cost-efficient insights to boost health-plan ratings.
• AI Palette (Data Engineer, Jan–Apr 2021): Optimized food-trend pipelines on AWS, cutting runtime by 75 % and automating them with Airflow to double daily refresh cadence. Technical Skills
Programming Languages: Python, C,C++, Golang
Deep Learning / AI: PyTorch, Deepspeed, Hugging Face, LangChain, vLLM, TensorRT, Ray, SageMaker, Azure ML Data Science: Numpy, Pandas, Matplotlib, Scikit-learn, OpenCV, PySpark Development: Docker, Git, Django, AWS, PostgreSQL, FastAPI Projects
• Investigated GRPO for adapting open-source LLMs to medical-domain tasks at the UMass BioNLP Lab.
• Synergizer AI copilot for business development; FastBio & RiskMate, copilots for biological research & risk analysis.