/
BEI WANG
***********@*****.*** Seattle, WA beiwang bellaforjob
SUMMARY
Data Professional with a software engineering background. I have over 5 years of experience working throughout the software development life cycle. I am skilled in data management including data integration, modeling, optimization and data quality. I am growing my passion towards a data centered career. Currently, looking for an opportunity as a data scientist to continue to grow. DATA COLLECTION: JSON, CSV, API, Web-Scraping, Excel LANGUAGES: C#, Java, JavaScript, Numpy, Scikit-Learn, SQL, Python, R, NoSQL, Datetime DATA VISUALIZATION: MatplotLib, Seaborn, Plotly
DATA ANALYSIS: Data Wrangling, Data Statistic, In_Deep Analysis PREDICTIVE MODELING: Classification, Regression, Clustering, Prophet, Item-Item Recommendation System MACHINE LEARNING: Hypothesis Testing, A/B Testing, Linear Regression, Logistic Regression, Classification, KNN, Random Forest, Naive Bayes, K-Mean Clustering, Neural Networks, Collaborative Filtering, Decision Tree, Time Series, Deep Learning STATISTICAL METHOD: Bootstrapping, Hypothesis Testing, Bayesian, Statistics, Inferential and Descriptive Statistics SKILLS
PROJECTS
Forecasting Avocado Prices July 2020 - July 2020
Tools: datetime, statsmodels, fbprophet, Numpy, matplotlib, seaborn, Hypothesis testing with Bootstrapping. Calculated AIC values to find the best model by p, d, q values with pmdarima. Performed time series prediction with Prophet of R, ARIMA and EMA, compared the results of three models. Valuation Prophet forecasting result by R square value and visualization. Analysis of data by modify datetime frame to monthly of a year, weekly of a month. Result: The predictions: Prophet, ARIMA and EMA show the similar result. EMA responds quickly to any factor’s change, better than ARIMA in this project, Prophet shows more details, clearest average price trend. In this project, the Prophet gives the best forecasting. Combining three models above, the average price of conventional avocados will be frequency oscillation go up in the future.
Recommendation of E-commerce Products Dec. 2019 - May 2020 Tools: Numpy, matplotlib, seaborn, sklearn.neighbors, scipy.linalg Performed item to item based recommendation engine. Built a recommendation engine by supervised machine learning algorithm: KNN, find the nearest neighbors by K-Nearest Neighbors algorithm based on real shopping mall customer's data, recommended items in neighbors. Visualized statistical and analysis results. Result: Recommendation system do increase purchase numbers and shorter the customer view time on irrelevant products.It will help customers stop wasting time on irrelevant products viewing, but spend time on system recommended products that he/she may likes. After run it, we get a array of product's index which means the products of array are the similar. If one product has been viewed, we can recommend the rest product in array to the customers. EXPERIENCE
BICW - Buddhist Supplies Center, Data Engineer, Seattle, WA 2020 - Current Created an order interface to user end by google form. Created an order process engine to process the data from response sheet, including: data reading from excel, data cleaning, get order price by matching the item's name from price excel and response sheet, auto merger the orders based on the same orderer, auto send order confirm email, create summary excel. Maintain and update the whole system. Springboard, Data Engineer, Seattle, WA 2019 - 2020 Created 2 big projects: Forecasting Avocado Prices, Recommendation of E-commerce Products Created multiple small projects of A/B test, Naive Bayes, Clustering, Linear Regression, Logistic Regression, Spark, Frequentist Statistics, Bootstrapping, API, Bootstrapping, Hypothesis testing, Bayesian Statistics, Inferential and Descriptive Statistics Teamore LLC, Technical Sales Engineer, New York, NY 2016 - 2019 Designed website and logos for e-commerce website using CCS, HTML and JavaScript. Completed marketing research to establish purchasing, packaging, and pricing logistics. Led product development and design; worked with suppliers to design product and write product description. ioMosaic Corp., Software Engineer, Salem, NH 2012 - 2016 Directed software design and development, focusing on data processing, backend system and UI. Developed ioReport, a software tool, to generate reports from data calculation results and is used to estimate process risk. Administered data conversion projects to convert and augment client data to company databases, including read data from multiple file formats, data mining, data wrangling and data analysis.
Analyzed manual processes and designed programs to automate processes,including designing optional buttons on UI to connect with backend code and generate a report following click of button by users. Analyzed, developed, and documented technical requirements. Utilized NoSQL technologies and oversaw source control using Git. Collaborated cross-departmentally to test and resolve deliverable issues. Completed UI design, programming, and testing or a spell check system for company’s main application. EDUCATION
Duke University, Durham, NC 2017 - 2017
Certificate
Programming Foundations with JavaScript, HTML, and CSS Rivier University Sept. 2008 - May 2011
Bachelors of Computer Science