Post Job Free
Sign in

Big Data Analysis

Location:
New York City, NY
Posted:
March 31, 2025

Contact this candidate

Resume:

Anzhi (Andrew) Chen

+1-929-***-**** ******@********.*** New York, NY, 10025

EDUCATION

Columbia University New York, NY

M.A. in Statistics Sep 2024 – Dec 2025

North China University of Technology Beijing, China B.E. in Computer Science and Technology Sep 2020 – Jun 2023

• GPA: 85.09/100 (top 25%) Honor: Outstanding Graduation Thesis Award ACADEMIC ACHIEVEMENTS

Publication

• Yuan Li, Xinhao Zhao, Anzhi Chen Guoli Yang and Wei Song, “GERWkNN: GPU-accelerated Exact Random Walk-based kNN Query in Large Graphs”, Proceedings of the 2023 International Conference on Big Data Engineering (BDE 2023). Copyright of Computer Software

• A big data scheduling service management system, Anzhi Chen, Yuan Li, 2023SR1161807, 2023

• An enterprise informatization big data analysis system, Anzhi Chen, Yuan Li, 2023SR1161809, 2023 PROFESSIONAL EXPERIENCES

LinkMedicine Baltimore, MD

Generative AI for Healthcare Intern Jun 2024 – Aug 2024

• Development of DrH@GPT beta, a GPT-based AI assistant for healthcare and patient navigating.

• Implemented hospital and physician recommendations based on user-provided information and hospital ranking algorithm, and implemented interaction of users and backend datasets for MetaRUN events.

• Conducted in-depth UI/UX exploration, and implemented data visualizations based on Plotly and D3.js while collaborated with front-end developers to conduct A/B testing and improved user’s NPS by 26%. TrueHealth Medical Technology Beijing, China

Development Engineer Intern Jun 2020 – Sep 2020

• Enhanced organ nodule detection accuracy to 97.3% by fine-tuning U-Net models in TensorFlow, optimizing hyperparameters, adjusting learning rates, and refining the loss function.

• Built a DICOM image processing pipeline for data cleaning, normalization, and augmentation to enhance model generalizability. PORJECTS & RESEARCH

Columbia University Advisor: Prof. Tri Vi Dang New York, NY Analyses of the Effects of Activist Ownership Oct 2023 – Jan 2024

• Generated descriptive statistics of the cumulative abnormal return (CAR) data and corporate finance variables.

• Conducted ordinary least squares (OLS) and heterogeneous treatment effects (interaction) regression analyses with STATA to assess the impact of activist ownership on firm performance. Dr. Yang Liu’s Research Group Advisor: Dr. Yang Liu, Research Scientist of Duke University Beijing, China Optical Biopsy Data Analysis for Fast and Accurate Cancer Detection Jul 2023 – Aug 2023

• Using Python to integrate and preprocess dual-spectral data (autofluorescence and diffuse reflectance) from 151 lung tissue samples, implementing normalization, denoising, and feature extraction.

• Participated in designing the classification algorithm of spectrum data of lung adenocarcinoma samples and normal lung tissue samples using scikit-learn framework, combining PCA (principal component analysis) and other ML algorithms, achieved 98.4% accuracy in tissue-level prediction results.

North China University of Technology Advisor: Prof. Yuan Li New York, NY Enterprise Big Data Management and Analysis Systems Jan 2023 – Jun 2023

• Developed an enterprise-level big data analysis system, integrating ETL pipelines with Apache Flink to process real-time data, and built visualization dashboards with D3.js to support business intelligence and decision-making, enable trend analysis with a query response time under 500 ms.

• Developed a scalable big data scheduling service management system, utilizing distributed computing frameworks Spark to optimize task scheduling and resource allocation in high-load environments. GPU-Accelerated kNN Query in Large Graphs Nov 2022 – May 2023

• Improved the existing algorithm of finding -nearest-neighbor ( NN) of a query node in a large graph by introducing a new data structure and using GPUs to accelerate the computational speed.

• Conducted extensive experiments on real-world datasets, demonstrating 20% reduction in data operation time with our approach. EXTRACURRICULAR ACTIVITIES

NCUT Department of Recreation and Sports Beijing, China President Sep 2020 – Jul 2021

• Initiated and organized Campus Host Competition in 2020.

• Initiated and organized University Sports Festival in 2021. SKILLS

Coding Skills: Python, C, C++, Java, R, STATA, SQL, HTML, CSS, JavaScript, Latex Software and databases: Power BI, Bloomberg, WRDS, Compustat, CRSP, TAQ, WIND, Worldscope Languages: Mandarin (Native), English (Fluent)



Contact this candidate