Post Job Free
Sign in

Dat Analyst

Location:
The Bronx, NY
Posted:
March 29, 2020

Contact this candidate

Resume:

Meizhu Wang

adcist@r.postjobfree.com 585-***-**** *25 Schermerhorn St, 11201 New York City, New York EDUCATION

United Nations Executive Office of Secretary General Strategy Analytics Intern Feb 2020 — Present New York City, USA

• Supported analytics on UN system-wide/cross-cutting needs, priorities and risks; developed and tested data models/products with statistics and ML to make creative and data informed strategies; automated powerBI reporting with PowerQuery, Excel; improved ETL pipelines for users across UN agencies, donors, UN units globally to increase accuracy, clarity and timeliness with Python Know Center Data Scientist Intern Sep 2019 — Jan 2020 Graz, Austria

• Worked on research project HiDALGO (https://hidalgo-project.eu/) 13 million euros total funding; used Markov aggregation, agent- based model and ML in Hadoop, Spark to simulate twitter message spreading to detect fraud; supported partner in London to predict refugee fleeting by designing/optimizing road network algorithm extracted from geospatial data in C++ (academic paper pending) University of Rochester Database Analyst Intern May 2019 — Aug 2019 Rochester, USA

• Developed a system for board members to learn patterns of university internal and external contacts by Tableau and SalesForce reports to find business opportunities; coordinated between multiple departments, improved system integration efficiency by 90% with ML in Python and SQL; augmented internal data by external sources by API, web scraping and SPARQL etc Data Science Consortium Data Analyst Intern Jan 2019 – May 2019 Rochester, USA

• Finished full life cycle data project with minimal guidance from senior researcher; developed ML system to quantitatively/ dynamically determine strengths/weakness and causes for top tier universities to support strategy making; utilized Python (Gensim, NLTK) for topic modeling with LDA algorithm to better understand research topics that the universities are recognized for; time series prediction using deep learning in Tensorflow; consolidated feedback from client and crafted operational notes for future work EO Consulting Business Consultant Intern Mar 2018 – Jun 2018 Beijing, China

• Identified business needs with high-tech companies’ stakeholders and analyzed survey and historical data to to make strategic decisions; identified correlation of the types of businesses and areas of the city, and conducted survey to collect consumer level data to demonstrate market demand and potentials for certain types of business China Electronics Technology Group Corporation Data Scientist Intern Oct 2017 – Feb 2018 Beijing, China

• Translated client's business needs to IT product and conducted statistical analysis and signal processing to build machine learning models with ECG/ brainwave signals to classify mental depression; helped to design devices to collect physical data from patients Industrial and Commercial Bank of China Financial Analyst Intern Jul 2017 – Sep 2017 Beijing, China

• Studied global main competitors' strategic decisions and predicted influences for future stock prices; Monitored capital market indicators and trends to provide weekly report for stakeholders; reviewed and reorganized large volume of documents and drafted presentation notes for senior manager, provided assistance to senior manager in stakeholders/partner investment banks meetings SKILLS

• Software: SQL, noSQL, Hadoop, AWS, Git, Docker, Excel (pivot tables, lookups, VBA), Microsoft Offices

• Programming: Python(Beautiful Soup, Scikit-learn, NLTK, NumPy, Pandas, Scipy, Tensorflow, PyTorch, PySpark), C++, Matlab, Bash, R(dplyer, data.table, glmnet), Linux (Ubantu), HTML, PHP

• Statistics: Statistical Inference/Test, Native Bayes, Bayes Net, Law of Large Number, ANOVA, MRF, Gibbs sampling, A/B testing

• Machine Learning: Logistics Regression, SVR, Latent variables, kernel regression, Ensemble(Random Forest, Boosting, AdaBoost), Clustering(K-Means, EM, Hierarchical Clustering), Neural Networks (Backprop, Chain Rule of Derivative), Deep learning(CNN, DNN, RNN), NLP

• Data Visualization: Tableau, PowerBI, Gephi, box-plots, quartiles, scatter plots, heat maps, EDA, ROC PROJECTS

Music Player Log Files Analysis — Data Mining

• Text features extraction by TF-IDF and classification to evaluate business performance; built churn prediction model based on user behavior in Spark and recommendation system by collaborative filtering and metric factorization Insurance Cashflow — Random Process

• Modeled an insurance company’s business operation activities with continuous time Markov chain; simulated 400 months to predict peaks and valleys of cashflow within the company’s product life cycle Enterprise Capital Structure Study — Network Science

• Manipulated data and built network of top 500 Chinese mid-size listed company chain directors

• Analyzed the casual influences between a company’s position in the network and its capital structure Adversary Tic-Tac-Toe AI Program — Artificial Intelligence

• Coded basic and ultimate 9 by 9 Tic-Tac-Toe human computer interaction game in Python

• Implemented AI algorithms Alpha-Beta pruning and heuristic functions to speed up by 300% Complex Networks deployment for Power Grid Analysis — Network Science

• Visualized US power grid stations graph with geography layout in Gephi; simulated blackout effects assuming various graph topology, station loads etc based on percolation theory to give stable power supply suggestions Uncertain Inference with Bayesian Networks — Artificial Intelligence

• Applied XML parser to extract and represent Bayesian network in Python

• Implemented Monte Carlo Markov Chain to simulate joint distribution probabilities and achieved 99.5% accuracy



Contact this candidate