Post Job Free

Resume

Sign in

Senior Data Scientist

Location:
Marrero, LA, 70072
Posted:
January 29, 2024

Contact this candidate

Resume:

PHILIP MARIAZIALE

Contact: 218-***-**** Email: ads5oe@r.postjobfree.com

DATA SCIENTIST ML ENGINEER AI SCIENTIST

Attuned to the latest trends and advancements in this field, I am consistently delivering impeccable results through my dedication to handling multiple functions and activities under high-pressure environment with tight deadlines./

EXECUTIVE SNAPSHOT

•Experienced Data Scientist & Full Stack Machine Learning Engineer with over 11+ years of experience in applying deep learning, artificial intelligence, and statistical methods to data science problems to increase understanding and enhance profits as well as market share of the company.

•Brilliant in developing algorithms and implementing novel approaches to non-trivial business problems in a timely and efficient manner; possess experience in knowledge databases and language ontologies.

•Good knowledge of executing solutions with common NLP frameworks and libraries in Python (HuggingFace, NLTK, spaCy, genism) or Java (Stanford CoreNLP, NLP4J). Familiarity with the application of Neural Networks, Support Vector Machines (SVM), and Random Forest.

•Stays up to date with the current research in data science, operations research, and Natural Language Processing to ensure we are leveraging best-in-class techniques, algorithms, and technologies.

•Possess knowledge of remote sensing; well versed in identifying/ creating the appropriate algorithm to discover patterns, and validate their findings using an experimental and iterative approach.

•Strong interpersonal & analytical skills, with abilities to multi-task & adapt, handling risks under high-pressure environments; creative problem solver, able to think logically and pay close attention to detail

PROFESSIONAL EXPERIENCE

Since Mar 2023-Present with United Health Group (Through Optum), Eden Prairie, Minnesota, U.S.

As a Senior Data Scientist - MLOps

•Led a pivotal role in a team-oriented project, overseeing crucial stages such as the development of an unsupervised outlier identification algorithm and the implementation of the CI/CD pipeline

•Executed unsupervised outlier detection through a unique approach, utilizing five distinct outlier detection methods to label the dataset and aggregating their findings to generate an outlier_percent column for streamlined filtering

•Spearheaded the design and implementation of the CI/CD pipeline, leveraging Azure Cloud, Snowflake, Databricks, Jenkins, Docker, and Kubernetes. The pipeline, hosted on Databricks, seamlessly pulled raw data from Snowflake, conducted ETL operations, identified outliers, and uploaded processed data to Azure Blob Storage

•Orchestrated Jenkins tasks for data preparation, merging dataset files into a unified CSV, and storing it in the workspace's datastore with versioning. Subsequently, initiated the model training task in Azure, executing code on a Databricks cluster and saving the output as a new model in AzureML

•Successfully deployed models using an Embedded Architecture approach, integrating models with the app within a Docker image for efficient deployment

•Implemented creational design patterns in the CI/CD pipeline for reusability and behavioral patterns in algorithms and integrations for improved efficiency

•Worked within a team structure led by a Data Science Manager, collaborating with 3 Data Scientists to achieve project objectives

•Utilized tools such as Snowflake, Jenkins, Azure Cloud, Docker, Databricks, PySpark, and Twistloc to streamline various aspects of the project

•Adopted a canary deployment release process, starting with limited access and gradually expanding over time to ensure a smooth and controlled deployment of models

•Contributed significantly to the project's overarching goal: developing a workflow to personalize Medicare plan recommendations for members seeking new plans during annual enrollment

Feb 2021-Mar 2023 with Regions Financial Bank, New York

As a Senior AI Scientist

•Developed a script for deploying updated Docker images to EC2 instances, ensuring efficient and timely updates.

•Facilitated discussions with Lot18 representatives to address project progress, conceptual exploration, and resolution of blockers or errors.

•Implemented A/B testing, confirming a 9% increase in repeat customers for populations using the recommender system.

•Engineered a Hybrid Mixed recommender system by integrating collaborative filtering, content-based, and demographic recommender techniques.

•Conducted A/B testing to optimize the most effective recommender system, addressing the "cold start" problem with a Demographic-based recommender system.

•Utilized Pandas and NumPy for data preprocessing, cleaning, and feature engineering, employing Python for handling missing values in the dataset.

•Implemented text preprocessing techniques such as stemming and lemmatization to streamline the corpus for efficient analysis.

•Applied Keras and TensorFlow for developing predictive algorithms and solving analytical problems.

•Constructed an NLP-based filter using embedding and BERT in TensorFlow and Keras for advanced text analysis.

•Constructed an NLP-based filter utilizing embedding and BERT in Tensorflow and Keras

May 2019 - Feb 2021 with Levi Strauss & Co. Company, San Francisco CA

As an ML-Ops Engineer

•Built a personalized in-session product recommendation engine

•Wrote scripts in Python that automated text summarization and clustering.

•Involved in Next-Best offer prediction and designed Microassortments for Next-Gen stores

•Performed Anomaly Detection and Root Cause Analysis

•Prepared data for collaboration with machine learning models

•Unified consumer profile with probabilistic record linkage

•Accountable for Visual search for similar and complementary products

•Architected, built, maintained, and improved new and existing suite of algorithms and their underlying systems

•Analyzed large data sets applied machine learning techniques and developed predictive models, statistical models, and developed and enhanced statistical models by leveraging best-in-class modeling techniques

•Implemented end-to-end solutions for batch and real-time algorithms along with requisite tooling around monitoring, logging, automated testing, performance testing, and A/B testing

•Worked closely with data scientists and analysts to create and deploy new product features on the e-commerce website, in-store portals, and the Levi's mobile app

•Established scalable, efficient, automated processes for data analyses, model development, validation and implementation

•Implemented deployment solutions using Tensorflow, Keras, Docker, and Elastic Kubernetes Service

•Executed Model Drift Monitoring and Retraining Strategies

Jul 2017 - May 2019 with Credit Suisse, New York City (REMOTE)

As a Machine Learning Engineer

•Develop a fraud detection system for financial transactions to enhance security

•Utilize Pandas for data cleaning and transformation, ensuring accuracy and reliability. Store data on Amazon S3

•Extract essential transaction features using Pandas and NumPy for model development

•Implement advanced ML algorithms with Scikit-learn and TensorFlow, leveraging AWS SageMaker for scalable training

•Optimize model performance using Scikit-learn and AWS SageMaker for hyperparameter tuning

•Build a real-time alerting system using AWS Lambda and SNS for immediate notifications

•Seamlessly integrate the model with the existing bank infrastructure using AWS Lambda and API Gateway

•Design for efficiency and scalability using AWS ECS for container orchestration and AWS Lambda for serverless execution

•Enhance transparency using SHAP (SHapley Additive exPlanations) and AWS XAI tools

•Monitor precision, recall, and F1 score using Scikit-learn metrics and AWS CloudWatch

•Establish a feedback loop using AWS Step Functions for model retraining and improvement

•Create concise documentation using Jupyter Notebooks, Sphinx, and store on AWS S3 for effective knowledge transfer and collaboration

Oct 2015 - Jul 2017 with New York Life Insurance, New York, New York

As a Data Scientist

•Engineered personalized product recommendations through the implementation of advanced machine learning algorithms, with a primary focus on Collaborative Filtering to cater to the unique needs of existing customers and drive the acquisition of new customers

•Spearheaded the creation and deployment of a diverse set of ML algorithms, leveraging logistic regression, random forest, KNN, SVM, neural network, linear regression, lasso regression, and k-means for comprehensive and effective modeling

•Pioneered the development of optimization algorithms tailored for data-driven models, extending their applicability to various machine learning paradigms, including supervised and unsupervised learning, as well as reinforcement machine learning

•Conducted in-depth research on statistical machine learning methods, encompassing forecasting, supervised learning, classification, and Bayesian methods, ensuring the incorporation of cutting-edge techniques into the modeling framework

•Advanced the technical sophistication of solutions by incorporating machine learning and other advanced technologies, contributing to the enhancement of overall model performance

•Executed exploratory data analysis and crafted insightful data visualizations using R and Tableau, fostering a deeper understanding of the underlying data patterns

•Collaborated seamlessly with data engineers to implement the ETL process, playing a crucial role in the optimization of SQL queries for efficient data extraction and merging from Oracle databases

•Leveraged a versatile skill set in R, Python, and Spark to develop a wide array of models and algorithms, catering to diverse analytic requirements within the project

•Ensured data integrity through meticulous data integrity checks, proficient data cleaning, exploratory analysis, and feature engineering employing a combination of R and Python to uphold data quality standards

Jul 2012 – Oct 2015 with Lam Research, Fremont, CA

As a Data Scientist

•Engineered a Vectorizing function to embed facial features, enhancing the representation of key facial characteristics

•Developed a specialized algorithm for efficient storage and comparison of vectorized features, streamlining the verification process

•Implemented Convolutional Neural Networks (CNNs) using PyTorch and Python to enhance the depth of image analysis

•Conducted comprehensive data cleaning on both image and tabular datasets to ensure data quality and accuracy

•Applied image augmentation techniques to introduce rotational, motion, and scale invariance for robust model training

•Devised statistical evaluation techniques to assess and validate the performance of the developed models

•Orchestrated deployment using Flask and Pickle, ensuring seamless integration and accessibility of the models

ACADEMIC CREDENTIALS

Bachelors Degree in Computer Science

University of New Orleans

Certifications

Machine Learning by Andrew Ng

Deep Learning Specialization by Deeplearning.io

PERSONAL DETAILS



Contact this candidate