Post Job Free

Resume

Sign in

Data Science Machine Learning

Location:
Brooklyn, NY, 11238
Posted:
April 21, 2024

Contact this candidate

Resume:

ZICHUN XIA

Contact: +1-858-***-****, Email: ad46bi@r.postjobfree.com, Address: Brooklyn, NY, US, Authorized to work in the US EDUCATION

NEW YORK UNIVERSITY NY, USA

M.S. In Mathematical Science (3.778 / 4.000) 2022–2024 THE UNIVERSITY OF CALIFORNIA, SAN DIEGO CA, USA

B.S. in Mathematics (3.735 / 4.0) Minor in Data Science Honors for nine quarters. 2018– 2022

● Relevant Coursework: Mathematical Statistics. Advanced Statistical Methods and Machine Learning, Methods of Applied Mathematics, Data Analysis and Inference, Risk and Portfolio Management, Data Science in Practice WORK EXPERIENCE

FUTU HOLDINGS SHENZHEN, CHINA

Software Management Intern (2 terms) 2020, 2021

● Led a comprehensive data analysis project on hostile websites data, employed data mining and cleaning techniques, and created interactive plots to categorize hostility types and levels.

● Analyzed data and automated a recurring dashboard in Tableau to display the relational factors of hostile websites, successfully revealed the hidden factors/patterns of the hostile websites.

● From the original system, designed a more efficient and convenient data-collecting system in Java and SQL, resulting in a better way to collect user data for analysis.

● Using Java, designed an emoji interface platform on Visual Studio Code to better organize the emojis in Futu app. Started from the design phase, built the platform completely from scratch all the way to frontend coding and backend rewiring and eventually making it functional for work. Vastly improved the efficiency of emoji update for the communication design team.

Joblogic-X Corporation Remote

Data Analyst Intern 05-07/2023

● Recoded and streamlined the existing code in MySQL. Successfully improved the data performance which fastens the speed of collecting and updating data by 30%.

● Designed and built an ETL pipeline in Python to extract and transform data from SQL Server Data system.

● Built a model using time series analysis(ARIMA) to analyze data and reported results in tables and graphs using Tableau. Predicted future trends of inventory and made recommendations of investments, provided better insights to facilitate data-driven decision making.

ACADEMIC PROJECTS

Arabic number Recognition with Bernoulli & K-means clustering models

● Developed an artificial intelligence (AI) that is capable of differentiating hand-written numbers.

● Cleaned a hand-written number dataset in Python and performed feature engineering on pre-process data.

● Constructed multiple supervised learning systems using the Single Bernoulli model, K-means clustering model, and Mixture Bernoulli model, and compared their performance vs. accuracy. Racial Differences and Results of Complaints on NYPD:

● Conducted hypothesis testing on whether or not NYPD responded to complaints differently based on the race of the complainer on Google Cloud Platform.

● Cleaned, constructed, and analyzed a data frame in Python, performed feature engineering on the data frame. N-gram Language Models:

● Developed a language model that calculates the probability of a given sentence occurring in a language using Python.

● Built Naive Baseline models: Uniform and Unigram Language Models and analyzed their restrictions

● Built N-Gram Language model that tokenizes a phrase and employs bayes theorem of calculating the probability of occurrence of tokens in such order.

SKILLS

● Data Science Tools: Python, JAVA, SQL, Tableau, Powerbi, Git, PyTorch, TensorFlow, VS Code, Airflow, AWS, GCP

● DS/Math/Language Skills: Chinese, English, Machine Learning, Time series, Statistical Modeling, ETL, NLP



Contact this candidate