Post Job Free

Resume

Sign in

Principal AI Scientist Mathematician

Location:
Framingham, MA, 01702
Salary:
~ $200K
Posted:
November 22, 2023

Contact this candidate

Resume:

MIKHAIL GORELKIN

Principal AI Scientist Mathematician Engineer

email: ad1df4@r.postjobfree.com

phone: 972-***-****

blog: http://blog.mikhail.gorelkin.com

summary Principal AI Scientist, Mathematician, and Engineer proficient in taming business uncertainty and complexity through artificial intelligence and generative AI, advanced data science, multi- agent and mathematical modeling, and engineering in production. key skills Artificial Intelligence, NLP, Machine Learning & Deep Learning, Reinforcement Learning, Probabilistic Programming, Decision Making

Generative AI: custom AI chatbots & autonomous AI agents using ChatGPT API and open- source Large Language Models, fine-tuning open-source LLMs, integrating them with data and computations, etc.

Advanced Methods in Data Science: transformers and zero-shot transfer learning for big data forecasting and probabilistic predictions in complex systems, quantum-inspired optimizations, recommender systems, etc.

Complex Systems & Multi-Agent Modeling

Mathematical Modeling

Python, C++, C#, Scala, Julia, SQL, scikit-learn, TensorFlow / Keras, PyTorch, spaCy, Hugging Face, LLMs / Transformers, LangChain, Faiss, Pinecone, Weka, PySpark, Akka, Google Colab, Amazon SageMaker, Amazon EC2 / Linux, GitHub Copilot. experience AI & Advanced Data Science Consultancy (Boston, MA) 04.2005 – current Principal AI Scientist Mathematician Engineer

Areas of work

Research using scientific publications

Data analysis and conceptual design of innovative approaches, models, and algorithms

Development of models and algorithms (prototyping and programming for production)

Development of distributed and multi-agent systems, including agent-based modeling Projects (sample list)

Generative AI / LLMs (BridgeCare Technologies Advisory): Researched recent advances in publications and open source. Designed a conceptual framework. Prototyping.

Big data forecasting (Celsius Holdings): Analyzed big data and developed a probabilistic solution for inventory forecasting (PyMC 5, PySpark, Amazon SageMaker).

Intelligent automation (Deloitte / ServiceNow): Researched and developed for production symmetric & asymmetric semantic search on multiple NLP tasks (PyTorch, spaCy, Transformers, Faiss, Dataiku).

AI for complex systems (AES Corp.): Researched and prototyped: several probabilistic forecasting models for gas prices (Prophet, Edward / TensorFlow), a framework for full simulation on the Southern California energy market (multi-agents, Bayesian Nets), a latent reinforcement learning algorithm, and a multi-agent reinforcement learning for competitive games model for the Colombian hydro energy market (PyTorch). Developed for production an automatic gas trader for the Southern California energy market.

Nonparametric probabilistic inference (Scientific Systems Company, Inc. / DARPA): Researched and developed a model for nonparametric inference for simulated trajectories with noise based on a variational autoencoder and probabilistic classifier (TensorFlow Probability).

Intelligent advertising (ADΣXT Corp.): Analyzed data streams from different platforms and restored their integrity with algorithmic constraints. Developed a reinforcement learning algorithm for optimal advertising strategies on FB (TensorFlow). Designed and prototyped a probabilistic multi-agent test framework for simulation and analysis of the users' behavior on FB.

An intelligent virtual agent (agent.ai): Researched automatically generated responses for IVAs. Developed a model and all algorithms based on deep adversarial learning (Torch7 / Lua).

Scalability for an enterprise system (Earley Information Science / Ford): Analyzed, solved, and developed a solution for a scalability problem for Ford's Virtual Assistant for Vehicles project.

Clinical trials (BBCR Consulting): Designed and developed an adaptive computational framework for CTs (Julia).

NLP / NLU (noHold): Developed several NLP / NLU algorithms for production with spaCy and scikit-learn. Prototyped unsupervised deep embedding for text clustering (word2vec, MXNet) and several semantic transformations on the texts based on the dependency parser (spaCy).

Intelligent mortgage (Boston Consulting Group): Researched and designed a framework for mortgage loan automation.

An intelligent online dating platform (Likeli): Researched and designed a conceptual model for the platform. Designed a personality similarity metric between users and implemented in Theano / GPU library. Developed an intelligent multi-agent framework based on Scala, Breeze, Akka, and Redis to scale computations for many thousands simultaneous users in real-time. Developed computing personality profiles for FB users based on the Big Five Personality Traits Model. Developed a real-time personalization system based on users’ feedbacks for several types of bandit algorithms modified for a vector space: Epsilon- Greedy, Softmax, UCB1, Bayesian, and Bayes-UCB1. Improved Bayes-UCB1 for balance multi- factor learning. Solved scalability and performance issues for 1M users and put the system into production.

Marketing (MutualMind): Developed for production various algorithms such as word sense disambiguation, Twitter hashtag decomposition (used Norvig’s algorithm), topic modeling, sentiment analysis, etc. (Python, NLTK, Gensim).

A marketing platform (SocialExtract): Developed many NLP algorithms to clean, transform, and integrate data streams from different sources. Prototyped a library of NLP and ML primitives and a framework for the quick composition of computational pipelines based on Python functional and streaming libraries. Prototyped a relevance engine in this framework. After the client was satisfied with the accuracy, I programmed it in a distributed environment (Hadoop / MapReduce, MongoDB, and Redis) for scalable production with mrjob / Python and Java. Implemented the distributed Sparse Proximal SVM based on publications.

A recommender (IEEE): Researched novelty detection in scientific articles and Deep NLP as a framework for NLP problems. Designed and developed a multi-label classification for a large label set (> 7K) (Mulan, MEKA, scikit-learn) and a similarity metric as a core for the client's recommender system.

Intelligent real-time logistics (Weft): Researched and designed a conceptual framework for a real-time logistics platform based on DARPA’s Cougaar architecture. Developed several heuristics based on ant-colony optimization for adaptive routing optimization (Java).

A social network (PopCircle): Researched, designed, and developed for production a composable approach to collaborative filtering with several social regularizations (Octave).

E-learning (Junyo): Researched and designed a model for adaptive learning. Developed the concept map extraction from documents and a fuzzy matching algorithm (Java, Stanford NLP). Found significant patterns in the data for sales prediction (Weka). Compuware Corp., Technology Department (Detroit, MI) 03.2000 – 11.2004 Software Developer VII

Researched and developed many advanced features for QALoad (automated performance and scalability testing software) (MS C++/STL, all major RDBs, Statistica). Central Transport International, Inc. (Sterling Heights, MI) 03.1998 - 02.2000 Systems Architect

Managed a team of ten engineers to developed an NT-based distributed enterprise architecture for 100+ terminals across the US, Canada, and Mexico using satellite communication. Managed several consulting teams to implement an ERP system on clusters.

Resource Technologies (Troy, MI) 08.1997 – 03.1998 Software Consultant

Developed a scalable architecture for terminal operations based on MTS and MS SQL Server with up to 300 MS-DOS clients / hand-held computers using MS RPC (MS C++). After several months of work, the client hired me as a Systems Architect. Advanced System & Designs, Inc. (Troy, MI) 01.1996 – 05.1997 Software Engineer

Researched and developed the DoES for Windows (the primary product for the Shainin approach to Statistical Design of Experiment) and the ANOVA-TM 2.x for Windows (the primary product for the Taguchi approach to DoE) (MS C++, Statistica). DataNet Technologies, Inc. (Troy, MI) 02.1993 – 06.1994 Software Engineer

Researched and developed all statistics for the WinSPC (statistical process control software), including the non-normal capability analysis for all types of Pearson’s and Johnson’s distributions (MS C++).

education Voronezh State University (Voronezh, USSR) Master of Science, Mathematics

Nonlinear functional analysis. The master's thesis: Diffeomorphisms in Banach spaces. Continuing Education with Certification

Train & Fine-Tune LLMs for Production (Intel Disruptor Initiative) 2023

Quantum Computation using Qiskit v0.2X (IBM) 2022

Quantum Computing (Coursera, Saint Petersburg State University) 2021

Algorithmic Information Dynamics (Santa Fe Institute) 2018

Parallel Programming (Courser, École Polytechnique Fédérale de Lausanne) 2017

Text Mining and Analytics (Coursera, University of Illinois at Urbana-Champaign) 2015

Statistical Learning (Stanford University) 2014

Mining Massive Datasets (Coursera, Stanford University) 2014

Decision Making in a Complex & Uncertain World (University of Groningen) 2014

Data Mining with Weka. Parts 1 & 2 (University of Waikato) 2014

Introduction to Dynamical Systems and Chaos (Santa Fe Institute) 2014

Introduction to Complexity (Santa Fe Institute) 2013

Game Theory (Coursera, Stanford University) 2013

Natural Language Processing (Coursera, Columbia University) 2013

Algorithms: Design and Analysis. Parts 1 & 2 (Coursera, Stanford University) 2013

Machine Learning (Coursera, Stanford University) 2012

Model Thinking (Coursera, University of Michigan – Ann Arbor) 2012



Contact this candidate