Post Job Free
Sign in

Machine Learning Data Scientist

Location:
San Francisco, CA
Salary:
170000
Posted:
August 22, 2025

Contact this candidate

Resume:

Profile

Accomplished Senior Machine Learning Engineer & Data Scientist with a strong record of delivering production-ready AI solutions that bridge advanced research and real-world impact. Skilled in building end- to-end systems across computer vision, NLP, and generative AI, with hands-on expertise in large-scale data engineering, cloud MLOps, and model deployment. Known for transforming complex problems into scalable architectures using Bayesian inference, causal modeling, and deep learning frameworks. Experience spans healthcare, e-commerce, transportation, and industrial inspection, consistently driving efficiency, automation, and measurable business outcomes. Data Engineering & MLOps

•Big Data: PySpark, Databricks, GCP BigQuery

•Pipelines & Orchestration: Airflow, AWS Step

Functions, Glue

•CI/CD & Automation: Docker, Kubernetes,

Terraform, MLflow

•ETL Development & Data Onboarding

Software Development

•Backend: Python (Flask, Django), Node.js

(Express, NestJS)

•API Development: RESTful, Serverless

Architectures

•Frontend Integration: Next.js, SEO Optimization

•Security & Authentication: OAuth2, Clerk

Programming Languages

•Python (primary for ML/AI, data processing,

automation)

•SQL (data querying, analytics, pipeline

integration)

•C++ (computer vision & embedded systems

optimization)

•JavaScript (API integration, front-end

collaboration)

•Bash/Shell Scripting (automation, deployment)

Professional Experience

Senior Machine Learning Software Engineer/Data

Scientist, Meriti Inc

07/2021 – 08/2025

Skills

Machine Learning & AI

•Computer Vision: YOLOv5, U-Net, SAM, Stable

Diffusion, ControlNet, LoRA, OpenCV

•NLP: BERT, NER, Topic Modeling, Sentiment

Analysis, RAG, MCP

•Generative AI: Multi-view fashion generation,

real-time video captioning, text-to-image

synthesis

•Statistical Methods: Bayesian Inference, Causal

Inference

•Recommender Systems: Next Best Action

modeling, XGBoost-based recommendations

Cloud Platforms

•AWS: SageMaker, S3, Lambda, API Gateway,

EC2, RDS, Fargate

•GCP: BigQuery, AI Platform

Programming & Tools

•Languages: Python, SQL, JavaScript, TypeScript

•Frameworks: PyTorch, TensorFlow, scikit-learn

•DevOps: Git, CI/CD pipelines, Docker Compose

Security & Compliance

•HIPAA, GDPR, data encryption, de-identification

protocols

• Database migration across Databricks and GCP BigQuery.

• Developed Bayesian theory based algorithm to assess e-commerce product performance

• Implemented a Model Context Protocol (MCP) infrastructure to support scalable context management and real-time data synchronization.

• Designed and developed a robust MLOps pipeline from data ingestion to model deployment using Kubeflow, MLflow, and Seldon Core on EKS.

• Developed a RAG chatbot by utilizing LangChain and deploying Mixtral Large 7B on SageMaker Jumpstart, with Hugging Face LLM DLC powered by TGI on AWS.

• Topic modeling and Sentiment Analysis based on online customer review

• Extract product information through named entity recognition (NER) techniques. Jeffery Wood Machine Learning Engineer

**************@*******.***

+1-350-***-****

San Francisco, CA, US

linkedin.com/in/jeffery-w-30597571

• Automated the ETL data pipeline using Apache Airflow, Kafka, Spark, and Tableau, thereby enhancing scalability through best practices.

• Developed OCR pipeline to extract text from the railroad images using Yolov5.

• Developed spring defect detection on high-speed running train wheel images

• Created a hotel recommendation system using the HyDE approach alongside Redis and the OpenAI API

• Developed Next Best Action system for marketing content recommendation in the pharmaceutical commercialization process leveraging Terraform, Sagemaker, Airflow, S3, Docker, and Fargate.

• Designed various prompt techniques, including chain-of-thought and step-back prompting, to improve model performance.

•Led the end-to-end development of a LLM-powered Conversational AI system designed for hospital use, enabling physicians to interact with patient data via natural language for diagnosis support.

•Ensured secure handling of sensitive patient data in compliance with HIPAA/GDPR standards, including encrypted storage and de-identification protocols.

•Led a cross-functional team of 4+ developers, researchers, and data engineers in delivering a production-grade Conversational AI system under tight clinical compliance constraints.

•Mentored junior machine learning engineers and interns, providing guidance on model development, code quality, and best practices in applied AI.

•Acted as a technical lead in cross-department collaboration between AI/ML, product management, and clinical domain experts to align system capabilities with real-world hospital workflows. Tech Lead/AI Engineer, Voxel Cloud 08/2019 – 06/2021 Los Angeles, CA

•Designed and implemented computer vision solutions for automated medical imaging analysis using OpenCV and PyTorch, achieving 95% accuracy in anomaly detection.

•Collaborated with designers and product owners to implement security measures, safeguarding the web application from threats and vulnerabilities.

•Developed and maintained back-end services and APIs using NestJS (Node.js) and Python Flask, ensuring high performance and scalability.

•Collaborated with cross-functional teams in an agile environment to identify and develop software solutions aligned with business requirements.

•Implemented SEO using Next.js and Python Django, contributing to the success of a cutting-edge SaaS product.

Data Scientist, Cleerly 06/2017 – 08/2019 New York, USA

•Developed a serverless architecture using AWS Lambda and API Gateway to achieve a cost-effective and scalable backend solutian.

•Managed AWS infrastructure, including EC2 instances, 53 buckets, and RDS databases, ensuring high availability and data security.

•Built and maintained back-end server-side applications using Node.js, ensuring efficient communication between the Server and Client-side components.

•Enhanced an e-commerce site's search capabilities by leveraging Algolia to provide users with fast and relevant search results.

•Implemented user authentication and authorization using industry-standard protocols like OAuth2 and Clerk, resulting in improved security and reduced a risk of data breaches.

•Developed a RESTful API using Node.js and Express, enabling the company to expose its services to external users and systems, resulting in improved efficiency and flexibility.

•Leveraged Salesforce CRM to streamline customer relationship management, enhancing sales and marketing processes.

•Implemented Semantic HTML5 markup throughout the project, enhancing accessibility and improving Search Engine Optimization (SEO) by providing clear and meaningful structure to web content. Software Engineer Internship, Cisco 05/2016 – 12/2016

•Device Health Visibility: Provide a snapshot of device health, readiness for upgrade or maintenance window, general dashboard status of network elements, etc. 1 . Leverage automated health check software capability being created by our cBR-8 CMTS team that has been built on Open Daylight, includes capabilities to pull both CLI and SNMP information, and then incorporate into a GUI based dashboard for monitoring. 2 . Focus on the automation of the health check scripts, both the pulling of the information off the devices as well as the correlation and analysis of the information. 3 . Access health information in real time at all times and alarm specifically on failure of those health checks as a critical alarms. Iterate off the platform to incorporate more newly identified metrics to gather and present as health check inputs.

4. System wide resource usage.

5. Built on open source software and is portable to support 3rd party vendors as well.

•Alarm Reduction/Correlation :

1. Bring hiearchical approach to event/trap correlation to reduce alarm noise, correlate multiple events 2. Incorporate peer neighbor box event correlation as part of the process to further reduce alarms 3. Correlate alarms in device, process and service levels Test automation infrastructure to ensure high- quality and on-going development of ODL-based tools. Education

Master's degree, Computer Science,

Virginia Polytechnic Institute and State University 2015 – 2017

BS, Biology, General, Lanzhou University 2010 – 2014



Contact this candidate