Erbo Zhang
Phone:240-***-****■****.*****@*******.***.***■Linkedin:www.linkedin.com/in/erbozhang
EDUCATION
Robert H. Smith School of Business, University of Maryland, College Park, MD December 2016 Master of Science in Information Systems GRE: 326 (V: 159(81%), Q: 167(94%) GPA: 3.28 Central University of Finance and Economics, China 2015 Bachelor of Science, Major: E-Commerce GPA: 3.14
PROFESSIONAL EXPERIENCE
IBM China, Beijing, China 07/2016-08/2016
Presale Data Solution Engineer
■ Piloted Proof of Concept (PoC) to implement big data solution for total assets $126 billion insurance company.
■ Designed big data solution using IBM BigInsights product to process, store and analyze 80TB of data.
■ Administered Hadoop cluster with three worker nodes, and installed relevant IBM tools on four linux physical machines separately for test preparation.
■ Performed testing on structured data using bigSQL and used duration time and accuracy rate metrics to identify query performance issues.
■ Loaded semi-structured log into HBase using IBM BigInsights with Apache Hadoop for further analysis.
■ Extracted request time and webpage URL columns from semi-structured webpage log data and transformed into structured table using IBM BigInsights Text Analytics to analyze frequency of webpage request. China Ministry of Education, Beijing, China 03/2014-06/2014 Data Analyst
■ Conducted descriptive statistics for research funding and corresponding performance in 800 Chinese Universities using R.
■ Classified universities into 36 groups based on regions and categories to enable further data analysis.
■ Built linear regression model using key performance indicators to measure research capability of each university.
■ Analyzed and ranked top 10 return on investment of universities using SE-DEA (Super Efficient-Data Envelopment Analysis) model and Malmquist methods.
■ Used ranking algorithm for university recommendations for further target based investigation and presented the recommendations to senior officials.
Huaxia Life Insurance Co., Ltd, Beijing, China 01/2013-02/2013 Data Analyst
■ Practiced time series forecasting techniques to do comparative analysis for 9 cities using daily transaction volumes.
■ Created weekly reports containing data of profit using excel graphs and tables for helping senior management in strategic decision-making.
RELEVENT SKILLS/PROJECTS
Big Data Analysis of Stack Overflow 2016
Big Data Analyst
■ Conducted data cleaning using Hadoop Pig to get pre-processed dataset for further analysis.
■ Conducted exploratory and descriptive analysis using Hadoop Pig and Hive to find a distribution of tags across topics and frequency of certain tags showing up together.
■ Used time series techniques and comparative analysis of top 14 frequent tags to find the trend of certain technologies.
■ Loaded dataset into AWS s3 using AWS CLI.
■ Built logistic model using Spark on AWS EMR cluster to predict questions status. Data Mining and Predictive Analysis of Stock Market 2016 Data Analyst
■ Built logistic, k-nearest neighbors, naïve bayes, tree and neural networks model using stock price, volatility index, SPX index and total put and call number of equity of one day as predictor to predict stock price of next day in R.
■ Calculated accuracy, precision and recall as prediction performance metrics and compared performance of different model to obtained the most effective model.
■ Used time series techniques to predict tomorrow stock price based on previous stock return in R. Instructional Software for UMD – Online Project Management System 2016 JAVA Developer/SQL Developer/Scrum team member
■ Extracted business requirements from client and collaborated with product owner to translate them into technical requirements.
■ Designed system processes using Agile Scrum method and developed user stories to meet client specifications.
■ Developed front-end JAVA using eclipse and back-end database using IBM Bluemix cloud database MySQL.
■ Developed front-end HTML using bootstrap frameworks, CSS, Javascript and JQuery.
■ Practiced data queries using SQL scripts.
Technical: JAVA, C#, C, HTML, JAVASCRIPT, JSP, Object Oriented PROGRAMMING CONCEPTS, Hadoop, Pig, Hive, Flume, Sqoop, Spark, R, SQL, AWS, Salesforce, SPSS, Tableau and Advanced Microsoft Excel (solver, decisiontrees, stattools).