Chen Yang
Jersey City, NJ, ***** 646-***-**** ******@********.*** linkedin.com/in/cycy
Education
Columbia University, New York, NY Expected: December 2020 Master of Science in Applied Analytics GPA 3.89/4.30 Core Course: Machine Learning, Data Science (R, Python), Applied Analytics, SQL, Anomaly detection Southwestern University of Finance and Economics (SWUFE), Sichuan, China July 2019 Bachelor of Business Administration in Finance GPA: 3.60/4.00 Baruch College, New York, NY August 2018
Bachelor of Business Administration in Finance GPA 3.88/4.00 Professional Experience
Allied Millennial Partners LLC New York, NY
Data Analyst Intern August 2020 – October 2020
• Used R/Excel to process, clean, transform and normalize on over 100 million data records
• Made exploratory data analysis on market index data and stock price data for three companies: Pfizer, Nvidia, and Moderna
• Constructed Autoregressive models for 2010-2020 daily price of three companies, tested if the data shows any seasonal effects over days of the week, and generated a quantitative analysis report with adjusted R^2 of 0.65 and P value of 0.72
• Analyzed business strategies behind data and gave data-driven solutions to clients’ needs, which increased client revenue by 5% CICC Beijing, China
Quantitative Analyst Intern July 2020 – August 2020
• Summarized and analyzed 2010-2019 SW primary markets data in performance forecasts, performance reports, official financial reports, and consensus expectations
• Carried out multi-space Backtesting in Python to evaluate performance of single industry indicators
• Imposed weight on well-performed indicators to construct a composite prosperity index with stronger industry selection capability
• Researched return and win ratio of 28 SW primary markets and constructed a sector rotation strategy with 12.3% excess returns Project Experience
Columbia University Capstone Project – IBM NLP Research Analysis Fall 2020
• Consulted with our sponsor, IBM to understand their objectives and requirements for the project
• Conducted research and validated assumptions and interim results with sponsor as appropriate
• Build scoring model pipeline with Apriori algorithm and decision tree to identify and quantify relationships between business entities in the news source from IBM
• Provide business insights and develop business strategies for brand awareness, competitive analysis, and market research Anomaly Detection Project Summer 2020
• Performed data cleaning (outliers, null value) in Python and R, resolved imbalanced data and performed feature engineering
• Applied and compared different machine learning classification algorithms to detect fraud behaviors in credit card transaction, mortgage, financial markets, and healthcare industries: Principal Component Analysis (PCA), K Nearest Neighbors (KNN), Autoencoder, Random Forest, Gradient Boosting (GBM), XGBoost, and GLM
• Generated benchmarks, described outlier clusters, and explained the business insights behind outliers achieving the best classification accuracy for up to 73%
Expedia Hotel Booking Revenue Decrease Consulting Analysis Spring 2018
• Communicated with business stakeholders to understand their demand in investigating the decrease in hotel booking revenue
• Designed KPIs and metrics: revenue by country, markets, and customer types, revenue growth forecast, and average booking price and created dashboard in Tableau
• Presented business insights behind data and provided recommendations which increased booking rate by 13% Skills
• Technical Skills: SQL/NoSQL, Tableau, Python (Pandas, Numpy, Matplotlib, Scikit-Learn), R, SAS, Microsoft EXCEL (Pivot Table, VLOOKUP, VBA), PowerPoint, Power BI, MongoDB, Hadoop, Hive
• Statistical Skills: Logistic Regression, Cross Validation, K-Nearest Neighbors, Decision Tree, Random Forest, Neural Network, K-means Clustering, Dimensionality Reduction, Natural Language Processing
• Analysis Techniques: Feature Engineering, EDA, Experimental Design, Hypothesis Testing, A/B Testing