Post Job Free
Sign in

Data Analyst Developer

Location:
Union City, NJ
Posted:
September 10, 2020

Contact this candidate

Resume:

Mengdie (Celia) Ji

*.**@********.*** 212-***-**** Union City, NJ LinkedIn: linkedin.com/in/mengdieji/ Github: github.com/celiajmd Personal Site: mengdieji.shinyapps.io/Personal_Site/ SUMMARY

Customer-centered professional with 5-years’ experience in data engineering, product analysis and management. EDUCATION

Columbia University, New York, NY Dec 2020

M.S. in Applied Analytics (STEM) Current GPA: 4.0/4.0 University of International Business and Economics, Beijing, China Jul 2014 B.S. in E-Commerce (Minor in Finance)

SKILLS

Languages/Techniques: Python, SQL, R, SPARQL, Django, HTML, CSS, Hive, Machine Learning, NLP, Java Tools: Jupyter Notebook, GraphDB, OpenRefine, AWS, Hadoop, Docker, MongoDB, PostgreSQL, Tableau Management: PMP (Certification), Agile/Kanban, Scrum Master, Jira, Git, Design Thinking EXPERIENCE

KGC, Knowledge Graph Developer Intern, New York, U.S.A Jun 2020 – Sep 2020 x Conducted data wrangling for source JSON files; and reconciled to WikiData using OpenRefine, and Docker x Built a food knowledge graph into RDF with Python Rdflib and query data in GraphDB with SPARQL x Retrieved recipes’ images, Chinese translation from WikiData with federated queries to enrich local dataset x Generated markdown files for cross reference among 1K+ ingredients and recipes using Python x Visualized knowledge graph using Hugo framework and deployed the website using Github Pages KE Holdings Inc. (NYSE BEKE), Senior Product Analyst, Beijing, China Feb 2019 - Jul 2019 x Enhanced data quality by enabling real estate deduplication and combination through a data migration project x Increased housing resources by 127%, and active broker users by 50% through web crawling strategy JD.com (NASDAQ: JD), Product Analyst, Beijing, China Jul 2017 - Feb 2019 x Initiated joint-marketing recommendation strategy to cluster 120K+ third-party sellers by data analysis using R and HiveQL, which Increased user coverage by 150% and weekly usage by 42% x Developed key metrics visualization dashboard using SQL in data mart to automate product performance report SAP, Data Engineer, Beijing, China Jul 2013 - Jun 2017 x Analyzed large scale enterprises’ operational demands and translated into system technical requirements x Delivered ERP data migration project for 10M+ mortgage loan records using ABAP and SLT x Shortened data pipeline running time from 20 hours to 2 hours and applied this approach to 5 other projects x Maintained data consistency by testing, troubleshooting, data analysis and validation using SQL. x Delivered data service including ETL from OLTP to OLAP, company code deletion, data anonymization to satisfy different business use cases with HANA, Oracle, MySQL databases. x Analyzed and solved SAP ETL related CRM tickets by root cause analysis and business impact evaluation. PROJECT EXPERIENCE

Personal Blog (Django), Full Stack Developer (In progress) Jul 2020 x Design a blog using Python, implemented user authentication, posting, drafting and commenting features x Develop backend using Django; front-end using HTML, CSS, JavaScript, and Bootstrap US Colleges Search App, Full Stack Developer / Project Lead Sep 2019 x Created a dashboard to visualize US college location information and US News ranking using RShiny x Developed interactive filters to display college selections based on region, state, transportation and conference Rental Price Prediction of Airbnb, Data Analyst Nov 2019 x Cleaned 36K+ New York rental records in Airbnb with missing value imputation and explored with ggplot x Selected features using Lasso and applied models including linear regression, decision tree, and XGBoost x Applied linear regression, decision tree, random forest and boosting; and Lasso for feature selection using R x Predicted 9K+ price on Airbnb with RMSE result ranking top 20 among 464 teams on Kaggle competitions



Contact this candidate