202-***-**** firstname.lastname@example.org LinkedIn
ETL SQL (relational database, Subqueries, Joins), Sqoop, Hive/Impala, Pig (basic) Data Mining Python (NumPy, Pandas, Scikit-learn, Pipeline, TensorFlow, Keras), Spark (MLib), R Cloud AWS (EC2, EMR, SageMaker, RDS)
BI Tableau, Power BI (basic)
Robert H. Smith School of Business, University of Maryland, College Park, MD Dec 2018 Master of Science in Business Analytics (Major: Business Statistics, STEM)
Sun Yat-sen University, China 2017
Master of Business Administration (Major: MBA)
Guangdong University of Foreign Studies, China 2009 Bachelor of Management in International Business Academic Innovation Scholarship
University of Maryland College Park, Maryland
Incomming Research Assistant at Center for Health Information and Decision Systems Nov 2018–Now
Prepare and integrate data from different sources, build machine learning model for healthcare project.
Graduate Teaching Assistant in “Big Data and Artificial Intelligence in Business” Course Aug 2018–Oct 2018
Revised course instructions from Master of Science to MBA focus.
Designed and taught computer labs on Hadoop, MapReduce, Spark with Cloudera Virtual Machine.
Rockpointe Corporation, Maryland Jun 2018–Aug 2018 Data Analyst
Performed ETL on physician payment data (30 GB, 53 million records) in SparkSQL .
Conducted text mining and association rule analysis in Python.
Identified outliers, seasonal pattern and cash flow pattern by data visualization in Tableau.
Findings was presented in “12th Annual Forum on Transparency & Aggregate Spend Conference” in DC.
China Custom Inspection and Quarantine Technology Center, China 2011–2016 Quantitative Analyst, Assistant Branch Manager
Accomplished quantitative analysis and data modeling for 400+ shipments recovering 8-million-dollar loss for clients. Awarded Distinguish Analyst in consecutive two years (2013,2014).
Led information system upgrade by business rule analysis improving efficiency by 40%.
Guangdong Plastics Exchange, China 2009–2011
Analyzed and turned market report, industry updates and company statistics into business stories.
Invented new business model and developed 12 new long-term business partnerships in Asian Pacific region. Awarded Annual Outstanding Business Analyst in 2010.
Built customized CNN for image classification to predict 12 different plant seed classes (97% F1).
Built text analytics model on Reddit comments to predict topics with Naive Bayes (80% accuracy).
Populated business strategies through unsupervised data mining on Yelp (6 GB) with K-mean clustering.
Won 2nd place in predictive modeling competition predicting Airbnb review rating with Random Forest.
Developed ER diagram, relational database, SQL queries on school ranking data with Microsoft SQL server.