Post Job Free
Sign in

Data Scientist Machine Learning

Location:
Massapequa Park, NY
Salary:
190000
Posted:
July 29, 2025

Contact this candidate

Resume:

Thomas A. Collins, Data Scientist

Massapequa Park, NY 11762 516-***-**** ******.*******.*******.**@*****.***

www.linkedin.com/in/thomasacollins www.github.com/tcollins1984 www.datacamp.com/portfolio/thomasanthonycollinstc

SUMMARY

Data Scientist with 8+ years of experience delivering AI-driven solutions, machine learning models, and advanced analytics across diverse industries. Skilled in modern AI technologies including LLMs, NLP, and Generative AI, as well as traditional ML and statistical modeling. Adept at designing scalable workflows, automating analytical processes, and developing data products that deliver actionable business insights. Proficient in Python, SQL, and leading data tools such as Databricks, AWS S3, and Tableau, with a proven track record of collaborating across teams to translate data into strategic value.

WORK EXPERIENCE

Data Scientist Aug 2018 – Dec 2020, Senior Data Analyst Jan 2021 – Dec 2024 Elsevier New York, NY

- Applied LLaMA-based LLMs and NLP methods to enhance text classification and improve data retrieval for large-scale datasets.

- Designed prompt engineering workflows and GenAI solutions for internal data analysis tasks.

- Developed and optimized analytical workflows using Python, PySpark, and SQL on Databricks, significantly reducing manual effort.

- Managed and optimized big data environments (100M+ rows), improving scalability and reliability.

- Built secure AWS S3 processes for efficient storage and retrieval of critical datasets.

- Authored database documentation to streamline onboarding and ensure consistent processes.

- Delivered data mining projects that provided actionable insights for senior stakeholders.

- Partnered with Marketing to integrate data insights into campaigns, increasing data-driven decision-making.

NLP Data Scientist Amenity Analytics

New York, NY Jan 2018 – Jun 2018

- Developed NLP models to analyze sentiment in financial texts, improving accuracy of analytical outputs.

- Processed and cleaned large datasets using Python Pandas, optimizing model performance.

- Created visual insights and reports using Tableau, Matplotlib, Seaborn, and Pandas to support business needs.

Data Scientist Kathy Kuo Home

New York, NY Jul 2017 – Dec 2017

- Conducted A/B testing and performance analysis to enhance user experience and conversion rates.

- Built and maintained Tableau dashboards for sales and marketing KPIs.

- Partnered with Marketing teams to improve campaign performance through analytics.

Data Scientist / Business Intelligence United Capital Source LLC

New York, NY May 2016 – Jun 2017

- Wrote SQL queries to mine CRM data for improved business targeting.

- Performed financial data analysis using R, Python, and VBA to refine client-lender matching.

- Led weekly data strategy discussions to promote a data-centric culture.

- Contributed data analysis that supported a 25% increase in small business funding, exceeding $100 million.

Physics Teacher Democracy Prep Public Schools

New York, NY Aug 2015 – May 2016

- Designed and delivered curriculum aligned with NYS Board of Regents standards.

- Used data-driven insights to track and improve student performance.

Adjunct Assistant Professor City University of New York

New York, NY Sep 2013 – Aug 2015

- Instructed astronomy and Physics lab sessions, facilitating hands-on learning.

Adjunct Associate Professor of Physics Hofstra University

Hempstead, NY Sep 2014 – Dec 2014

- Educated students in Physics 2 and labs, focusing on practical and theoretical aspects.

EDUCATION

Stevens Institute of Technology Ph.D., Physics Hoboken, NJ Sep 2007 – Jul 2012

Stevens Institute of Technology Graduate Program, Financial Engineering Hoboken, NJ Sep 2011 – May 2012

Stevens Institute of Technology M.S., Physics Hoboken, NJ Sep 2007 – May 2010

New York University B.A., Physics/Mathematics New York, NY Aug 2003 – Jan 2007

SKILLS

Technical Skills: Python SQL R VBA Power BI Tableau Pandas PySpark NumPy SKLearn AWS S3 Spark Databricks Matplotlib Seaborn Excel GitHub LaTeX Atlassian Confluence

AI & Analytics Expertise: LLMs (LLaMA, OpenAI) Prompt Engineering AI Agent Development NLP & Text Analytics Machine Learning Predictive Modeling Recommendation Systems Workflow Automation Big Data Processing (100M+ rows) Time Series Analysis Hypothesis Testing Multivariate Regression

LANGUAGES

English Spanish



Contact this candidate