Lucy Chen
**.*****@**********.****.*** +1-917-***-**** linkedin.com/in/lucychenbrooklyn Brooklyn, NY EDUCATION
Baruch College, Zicklin School of Business (GPA:3.6/4.0) New York, NY Master of Data Science August 2023 – December 2024 Core courses: Principles of Database Management Systems(A), Programming for Analytics(A), Data Visualization(A) Nanjing Normal University (GPA:3.7/4.0) Nanjing, China Bachelor of Economics, Minor in Computer Science August 2017 – June 2021 Core courses: Microeconomic(A), Econometric(A), Accounting(A), Corporate Finance(A), Data Structure(A), Machine Learning(A) WORK EXPERIENCE
Radical AI New York, NY
Software Engineer Intern May 2024 – August 2024
• Designed and Developed back-end enhancements for Marvel Platform by designing and implementing efficient algorithms using Python and NestJS
• Implemented RESTful APIs for a robust model management system using Python and NestJS, facilitating model creation, management, fine-tuning, and evaluation with detailed logging
• Developed an end-to-end model evaluation system with advanced metrics, leveraging AWS S3 for golden file storage to streamline validation processes
BitMart Jersey City, NJ
Data Analyst Intern, Trading Operations December 2023 – May 2024
• Extracted and tracked weekly data of crypto transaction by analyzing deposit/withdraw users and amount via different channel payment through Excel. Visualized fiat data using SQL and Python through Superset, generating report using Tableau
• Designed promotional activities with payment companies to improve User Experience and increase new users 300%
• ETL Management, Data cleaning & Integrity: Utilized SQL for crypto trading data cleaning, managed ETL processes, and maintained data integrity using Azure Blob Storage
• Designed and implemented systems to monitor and set up alerts for real-time news and announcements with potential market impact and other exchanges like Binance, Coinbase, OKX
NetEase (NASDAQ: NTES) Beijing, China
Data Scientist 2, Strategic Operation February 2023 – August 2023
• Analyzed user data using HiveSQL to identify opportunities for user growth and produced analysis reports; designed core metrics and built user portrait
• Established an automated data processing template for weekly TMK data in Excel through Alteryx to calculate the performance of sales staff, improving data efficiency by 50%
• Building efficient data pipelines using PySpark, collaborating closely with cross-functional teams to ensure timely and accurate delivery of data resulting in more accurate business decisions and improved operational efficiency JD.com (China’s Largest e-commerce, NASDAQ: JD) Beijing, China Data Engineer July 2021 – February 2023
• Develop data collection, storage, and processing models, including sourcing, loading, transformation, and extraction using AWS internal tools such as S3, Redshift, Athena, SageMaker
• Design data gathering, mapping, and processing pipeline and procedures using AWS data technologies with advanced SOL queries
• Perform data analysis and statistical modeling with Python to provide deep-dive analysis into key O2O business metrics and drive business decisions
• Build machine learning model with open source packages to detect anomalous customer usage and recommendation system for customer outreach programs, which improved recommendations accuracy by at least 20% average Data Engineer Intern May 2020 – August 2020
• Develop a data pipeline in PySpark and prepare the data for an model calculating CLV(Customer Life Value).
• Designed and implemented email campaigns and A/B testing in Mailchimp, collaborating with product teams on new features, establishing key metrics, and calculating test parameters, resulting in a significant 23.1% increase in click-through rate, which enhanced ROI by 16.3%
TECHNICAL CAPABILITIES
Programming: SQL((MySQL, PostgreSQL), Python (Numpy, Pandas, Scikit-Learn, PyTorch), R, AWS(DynamoDB, EventBridge, Kinesis, Lambda, SQS), Azure
Data Tools: Tableau, Power BI, Alteryx, Snowflakes, PySpark, Advance MS Excel Data Science Methods: A/B Testing, Data Visualization, Machine Learning, Deep Learning, Data Warehousing, Applied Statistics, Natural Language Processing