Sign in

Data Analyst Python

Greenbelt, MD
October 21, 2019

Contact this candidate

Resume: • 240-***-****

**** ********** ****, ********, ********, 21046

Wen Na

Professional Experience

Adsystech, Inc

Database Developer

May 2018 - Present

Performed data visualization in designing rich interactive dashboards and reports, integrating various components from multiple sources using Python Bokeh and SSRS. Possess strong knowledge of database management in writing complex SQL queries, Stored procedures, functions, database tuning, query optimization in SQL Server for accessing the database, and supporting the above dashboards and reports. Experienced in designing database using ER modeling and Normalization. Developed multi-functional REST Web APIs using C#/ASP.NET/MVC/Entity Framework. Developed Window Services in C#/.Net/SQL Server to perform data conversion.

Paradyme Management, Inc

Technical Analyst Intern

Sep 2017 - Dec 2017

Participated in development of Resume Repository Project built on top of Python Django. Constructed keyword extraction text cluster analysis using Python Sckit-learn. Participated in CRM system development in Python, using agile methodology.


2016 - 2017

Master of Science, Major in Information System

University of Maryland, Robert H.Smith School of Business Bachelor of Management in Information

2012 - 2016

Management and Information System

Shanghai University of Finance and Economics

Academic Experience

Steam Recommendation Model

Spark, R, MySQL

Led data cleansing and preprocessing of 200K steam users data using R and MySQL. Developed recommendation model based on historical users' behavior in R. Model primarily driven by ALS algorithm.

Developed interactive UI in R shiny to visualize recommendation output.

Expedia Hotel Cluster Prediction

Python pandas, seaborn, MySQL

Collaborated with a team of five to predict the hotel cluster a user of Expedia would choose.

Led data cleansing and preprocessing using Python Numpy, Pandas, and MySQL. Performed exploratory data analysis using Python Seaborn and Pandas. Developed data mining models using logistic regression, KNN, Classification Tree, Random Forest, Gaussian Naïve Bayes, increased the model accuracy by 31.5% compared to the basic Naïve Bayes model.

Skills & Certificate

Python, C#, R, SQL, Hadoop Ecosystem.

Data modeling, data mining, data visualization.

Jupyter Notebook.

Probability/Statistics, Machine Learning.


AWS Certified Cloud Parctitioner

DataCamp Data Scientist with Python track


Contact this candidate