Post Job Free
Sign in

Python SQL Rstudio MATLAB Mapreduce Tableau Hadoop Spark Pandas Oracle

Location:
Austin, TX
Posted:
November 08, 2020

Contact this candidate

Resume:

SHUHENG (MARKUS) MA

*******@******.***

linkedin.com/in/shuhenglonghorn/ • Austin, TX 78705 • 903-***-**** EDUCATION

The University of Texas at Austin Master of Science, Business Analytics May 2021 Overall GPA: 3.67

Coursework includes: Advanced Predictive Modeling, Data Analytics Programming, Text Analysis, Decision Analysis, Data Management, Supply Chain Analytics, Marketing Analytics, A/B Testing. The University of Texas at Austin Bachelor of Science, Mechanical Engineering May 2020 Bachelor of Arts, Economics

Overall GPA: 3.74

EXPERIENCE

Quake Capital Partners – Data Analyst; Austin, Texas May 2019 – August 2019

• Integrated Google Calendar, Doc, Drive and Gmail with Google Spreadsheet by Google API

• Built up local Database by MYSQL Workbench based on ER model, connected it with Front-end and AWS Cloud

• Implement database management system to populate, manipulate, and control existing financial data

• Design A/B testing to optimize front-end Interface PricewaterhouseCoopers – Management Consulting, Data Analyst; Shanghai, China June 2018 – August 2018

• Scrape top consulting firms’ recruiting information by Python Selenium

• Tokenize and stem scrapped dataset and formatted into CSV file

• Utilizing TF-IDF score, cosine similarity, sentiment analysis, and topic modeling to analyze targets’ recruiting plan

• Visualize terms frequency, scores, and final topics to discuss the potential recruiting recommendation RESEARCH

Oden Institute: Bayesian logic networks for COVID – Researcher; Austin, Texas May 2020 – August 2020

• Collect COVID-19 datasets from sources including Johns Hopkins, the Word Health Organization, and the New York Times

• Applying pyAgrum to develop Dynamic Bayesian Logic Networks via temperature, humidity, positive rate, policy and etc

• Implementing entropy to cross-validate and hyperparameter tuning for the best performance

• Visualize plots of how external factors affect performance of COVID indicators Schlumberger: Development of Polymer Interaction Prediction Model – Researcher; Austin, Texas Jan 2020 – May 2020

• Collect data by conducting experiments to measure effects of fluids on two polymers under extreme environment conditions including salinity, temperature, aging time

• Split data then fit via including Random Forest, XGBoost, and multilayer perceptron neural networks in Rstudio

• Hyperparameter tuning and cross-validated trained models

• Visualized plots for lifespan of targeted polymers with ggplot2 STUDENT ORGANIZATION

Chinese Students and Scholars Association (CSSA) – Vice President May 2017 – August 2018

• Managed students’ data by utilizing Zoho Creator and coordinated with organizations to assign volunteers to pick up 200+ students from airport to targeted addresses

• Coordinated volunteers and organizations into committees for directing events including: Homecoming (300+ guests), Mid-autumn festival (400+ guests with representatives of Chinese Ambassador), New Year Celebration. (300 guests) TECHNICAL SKILLS

• Computer Skills: Python(Pandas), R, MATLAB, SQL

• Visualization: Matplotlib, Seaborn, ggplot2

• Computer Software: Tableau, Mapreduce, Hadoop, Spark, Pytorch, Jupyter Notebook, Rstudio, Stata, Oracle, AWS, MySQL

• Certificate: IBM Data Science Professional Certificate HONORS

• Presidential Scholarship, University of Texas at Austin August 2016 – Present

• University Honors, University of Texas at Austin May 2017 – Present

• Work Eligibility: Extended eligibility to work in the U.S. due to S.T.E.M. certification; will require visa sponsorship for long- term employment



Contact this candidate