Post Job Free
Sign in

Databricks / PySpark Developer

Location:
Princeton, NJ
Posted:
November 01, 2019

Contact this candidate

Resume:

Julia (Li) Wang

609-***-**** • *********@*****.*** • : julia-li-wang-53a3615b • :llwang8

CERTIFICATION

Databricks Certified Associate Developer for Apache Spark 2.4 October 2019 DATA SCIENCE PROJECTS

Predicting House Sale Prices November 2018

Explored approaches to clean data, transform features, and use k-fold cross-validation to train the optimal Linear Regression model for predicting house sale price Titanic Passenger Survival Prediction January 2018 Used scikit-learn machine learning algorithms to predict passenger outcome based on their demographics. Improved model performance score to 0.79425 CodeForGood Challenge at J.P. Morgan Chase & Co. Jersey City, NJ - April 2018 Second Prize winner - team built an algorithm-driven web application to streamline Best Buddies’ administration process using Python, SQL, Flask and HTML PROFESSIONAL EXPERIENCE

Data Engineer Intern - AirisDATA Princeton, NJ - 3/2019 to present

● Work with team building new product of big data solutions using PySpark

● Build Pytest modules for features and integration testing Founder - CherryValley Studio Princeton, NJ - 2002 to 2015

● Built studio to provide comprehensive DVD services to businesses and families

● Managed production, accounting, taxation, marketing and website Associate Programmer - C & A Somerville, NJ - 2000 to 2001

● Contributed to the development of People Profile Management and Essistme application

● Enhanced News Engine for various clients using ASP, JS, SQL, HTML, CSS TECHNICAL SKILLS

Python, PySpark, Pytest, R, Excel, Access, SQL, MongoDB, JSON, JavaScript, HTML EDUCATION AND PROFESSIONAL DEVELOPMENT

Computer Programming Certificate - Chubb Computer Services M.B.A., Accounting - Bryant University

B.S., Biology - Nankai University



Contact this candidate