Post Job Free
Sign in

Data Science Machine Learning

Location:
Northborough, MA
Posted:
January 25, 2025

Contact this candidate

Resume:

Shujuan (Susan) Ji

Cambridge, MA

857-***-****

*********@*******.***

A resourceful problem-solver with experience in AI product design, Python programming, Machine Learning, and Statistical modeling, complemented with multiple certificates from MIT and Coursera.

Ph.D. in Mathematics from Caltech

12+ years of software engineering experience with a proven record of delivering innovative software solutions.

Seeking opportunities in AI Engineering, Data Science, or Software Engineering, to apply diverse technology skills in solving complex problems.

EDUCATION

MIT edX MicroMasters: Statistics and Data Science

Machine Learning with Python-From Linear Models to Deep Learning Grade: 91/100

Data Analysis: Statistical Modeling and Computation in Applications Grade: 89/100

Fundamentals of Statistics Grade: 85/100

Probability - The Science of Uncertainty and Data Grade: 93/100

Online

May 2024– Sept 2024

May 2024– Sept 2024

Sept 2024–Dec 2024

Sept 2024–Dec 2024

MITxPRO: Designing and Building AI Products and Services Grade: 100/100

California Institute of Technology

Ph. D. in Mathematics – Research Interest: Number Theory

Nanjing University

B.S. in Mathematics

Online

May 2024–July 2024

Pasadena, CA

Sept 1990–June 1995

Nanjing, China

Sept 1984–June 1988

PUBLICATION

Analogs of (z) for triangular Shimura Curves Acta Arithmetica LXXXIV. 2(1998)

EXPERIENCE

Freelance Cambridge, MA

Software Engineer AI and Data Science Specialist JULY 2023–Present

AI Design: Developed blueprints for several AI products, analyzed the strategy of each development process, evaluated the pros and cons of applying different AI algorithms, and conducted cost estimation. One of the projects was recognized as an “Exemplary Assignment” of the MITxPRO class “Designing and Building AI Products and Services ”.

DL, ML, and Data Science Projects in Python:

Implemented Gradiant Descent, CNN, RNN, DNN,LSTM, Transformer both by using plain Numpy, and by TensorFlow, Keras packages. Optimize the algorithms by proper initialization, regularization, minibatches, etc. Applied the algorithms to various applications such as a DNN classifier, face recognition, Neural Style Transfer. Adapted a pre-trained to new data.

Modeled, formed hypotheses, and performed statistical analysis on real data.

Used dimension reduction techniques such as principal component analysis, t-distributed, stochastic neighbor embedding to visualize high-dimensional data and applied this to genomics data.

Analyzed networks and used centrality measures to describe the importance of nodes, and applied this to criminal networks.

Modeled time series using moving average, autoregressive, and other stationary models for forecasting with financial data.

Used Gaussian processes to model environmental data and made predictions.

Communicated analysis results effectively with written reports and graphs created by Python.

Implemented a classifier to use for sentiment analysis of product reviews, compared and analyzed various algorithms, conducted parameter tuning.

Implemented multiple-digit recognition solutions using Neural Networks and other technologies.

Built a mixture model for collaborative filtering.

Implemented Q-learning algorithms to learn control policies for a text-based game.

Prompt Engineering: Prompted mathematical and logical questions to train AI models in step-by-step reasoning.

Web Development: Created web applications using Python, Flask, PostgreSQL, and JavaScript.

Math Tutor: Tutored grade 7-12 students on math enhancement, AMC preparation, and Calculus classes.

Skills: Python, Flask, Pandas, Pytorch, matplotlib, scipy, h5py, sklearn, tensorflow, keras, yad2k,JavaScript, PostgreSQL, Linear Classifier, SVC, Nonlinear Classifier, Kernel Methods, Feature Engineering, PCA, Neural Network, RNN, LSTM, CNN, Gaussian Mixtures, MLE, K-Means, K-medoids, EM algorithm, Reinforcement Learning, MDP, Hypothesis Testing, Unsupervised Learning, Clustering, Graph Centrality, Time Series, Seasonality, Stationarity.

Career Break Taipei

Homemaker Feb 2010– June 2023

Take Care of Family

Teaching online Mathematics enrichment classes with Zoom

Bank of the West Los Angeles, CA

IT Application Engineer II Aug 2006– Jan 2010

Key contributor to the Cash Management workflow automation using Content Management System and Process Management System

Designed and implemented innovative Java and JSP plug-ins, as well as other hybrid programs to customize the Filenet solutions

Skills: Oracle, Java, JSP, Java Database Connectivity (JDBC), OOP, XML, JavaScript, and FileNet

Evercomm Technology Inc. Taipei

Senior Architect Mar 2004– Dec 2004

Spearheaded the development of a business process management system, incorporating key functions such as data maintenance, order processing, inventory control, and reporting.

Architected and Supervised the full development cycle: inception, design, implementation, and deployment.

Skills: MVC design pattern, Struts, Java Servlets, JSP, SQL, XML and XSLT.

Commission Junction (ValueClick) Santa Barbara, CA

Senior Software Engineer Jan 2003– Feb 2004

Developed and maintained an innovative e-marketing system, focusing on both business rules and financial area

Skills: Struts, OJB, Oracle SQL and Eclipse

Wave Three Software San Diego, CA

Senior Software Engineer Nov 2001– Sept 2002

Designed and developed a standard (SIP) based, cross-platform, real-time collaboration software with integrated IP Voice and Video Communications.

Skills: Java, C/C++, XML-RPC, Socket Programming, UML, CodeWarrior.

Idealab Pasadena, CA

Senior Software Engineer June 1999– Nov 2001

Designed and developed a series of web applications, including an online stock advice exchange platform, advertisement-integrated online games, sports data processing and analysis systems, online surveys, and email lottery systems.

Skills: Linux, Apache, Java Servlets, JDBC, Perl, Regular Expression, MySQL, Caching, MVC Design Pattern, Java Applets, JSP, ASP, Http Tunnel, Java Multithreading, Singleton, Factory Design Pattern, JDBC, Oracle, XML, DOM, JavaMail API, SMTP, NNTP, JSP-JavaBeans, FTP, MD5, and Crontab.

VIC High-Tech Corp. Cerritos, CA

Senior Software Engineer Oct 1998– June 1999

Maintained and customized a family of video conference products over the Internet, ISDN, and POTS, using H.323, H.320, and H.324 protocols.

Resolved a critical issue of the product, resulting in seamless interoperability with the products of industry leaders.

Skills: Visual C++, Delphi, Windows API, Microsoft NetMeeting SDK, DirectShow, Lucent AVP Teleconferencing API, Philips SAA 7146 Multimedia Bridge Software.

Department of Mathematics, Columbia University New York, NY

Assistant Professor Sept 1995– July 1998

Taught college Mathematics including Calculus and Differential Equations

Use Mathematica to visualize functions and mathematical objects

Enrolled in CS classes: Data Structure and Algorithms, Computer Networking

CERTIFICATES

Fundamentals of Statistics issued by MITx edX.

Topics: Parametric Statistics models, Confidence Intervals, Methods of Estimation, MLE, EM Algorithm, MSE, Hypothesis Testing, Wald’s test, Likelihood Ratio Test, Chi-Squared Goodness of fit test, Bayesian Statistics, Linear Regression, Hypothesis Test for Linear Regression, Generalized Linear models, Exponential families.

Probability - The Science of Uncertainty and Data issued by MITx edX.

Topics: Discrete Random Variables, Continuous Random Variables, conditioning and independence, Bayesian Inference, limit theorems, Bernoulli and Poisson Processes, Markov Chains.

Machine Learning with Python - From Linear Models to Deep Learning issued by MITx edX.

Topics: Reinforcement Learning, Mixture Models and EM Algorithm, Clustering, CNN, RNN, NLP, Digit Recognition, Recommender Systems, Linear and Nonlinear Classifiers, Linear Regression.

Data Analysis: Statistical Modeling and Computation in Applications issued by MITx edX.

Topics: Visualization, Classification, and Clustering of High-Dimensional Data and Genomics, Network Analysis, Graphical Models, Time Series Analysis, Environmental Data and Gaussian Processes.

Deep Learning Specialization issued by Coursera, DeepLearning.AI

5 courses taught by Dr. Andrew Ng of DeepLearning.AI,

Topics: CNN, RNN, LSTMs, Transformers, Dropouts, BatchNorm and Xavier(He) initialization, Python, TensorFlow, Speech Recognition, Music Synthesis, Chatbots, Machine Translation, NLP

Online

Dec 2024

Online

Dec 2024

Online

Sep 2024

Online

Sep 2024

Online

Jan 2024

Google IT Automation with Python Professional Certificate issued by Coursera

8 courses developed by Google

Topics: Python, Git, IT Automation, Google Cloud, CICD, Configuration Management, Docker, Kubernetes, IaC, Django

Google Data Analytics issued by Coursera, Google

8 courses developed by Google

Topics: Essence of data analytics stages, spreadsheets, SQL, Tableau, and R.

Python for Everybody issued by Coursera, University of Michigan

5-course specialization covering Python language basics, Web development, and Database access using Python.

Sun Certified Business Component Developer

Sun Certified Web Component Developer

Sun Certified Developer

Sun Certified Programmer

Online

May 2024

Online

Nov 2023

Online

Mar 2024

Mar 2005

Oct 2004

Jan 2004

Oct 2003



Contact this candidate