Sihui (Rainie) Feng
***********@*****.*** 301-***-**** linkedin.com/in/sihui-rainie-feng-900585185
PROFESSIONAL EXPERIENCE
Digital Infuzion Gaithersburg, MD
Data Analyst 03/2022-Present
●Managed data pipeline and cleaned 1000+ datasets with Azure and Power BI, cutting data processing time by 50% and enabling faster insights.
●Collaborate and communicate with coworkers and supervisors to analyze project proposals and budgets to identify recommendations, solutions, and guidance for cross-functional stakeholders.
●Ensure data visibility and accessibility by processing data for analysis through National Institution of Aging systems.
●Reduce error rate from 35% to 10% by troubleshooting system and user errors, to ensure availability for collaboration and data-driven strategic decision making.
National Institute of Mental Health Bethesda, MD
Data Scientist 08/2021-03/2022
●Managed 100+ projects by creating and maintaining data infrastructure, visualizations, and analysis for large datasets and executing on projects’ scope, timeline, documentation, budgets, and deliverables.
●Increased cost-savings 25% by supporting large datasets from 100-1000+ lines of data within archives and improving data business processes and best practices.
●Manage stored data services infrastructure to support research utilizing SQL, Python, R and Azure.
Census Bureau Washington DC
Data Scientist Project Intern 01/2022-12/2022
●Built an API pipeline to collect large-scale datasets for analyzing Census Bureau website usage and used Brandwatch to identify platforms with the highest volume of user-generated content.
●Performed web scraping and sentiment analysis to better understand public perception of the Census Bureau website.
●Applied Natural Language Processing (NLP) techniques to clean, transform, and analyze text data from scraped sources.
●Developed interactive data visualizations to communicate findings and provide actionable recommendations to guide strategic decisions.
DATA SCIENCE PROJECTS
Customer Behavior Forecasting for Capital Bike Share 08/2021-12/2022
●Developed machine learning models in R, including linear and multiple regression, and performed diagnostic testing to ensure model accuracy.
●Created visualizations and conducted advanced exploratory data analysis (EDA) to identify usage patterns and key business insights.
●Delivered actionable recommendations to improve operational efficiency and customer experience.
Canterra Management Analysis Project 08/2021-12/2022
●Selected and implemented appropriate machine learning techniques, designing deployment architecture for production use.
●Built models in R using logistic regression, decision trees, bagging, and random forests to predict employee turnover.
●Presented insights and strategic recommendations to improve employee retention, backed by data-driven evidence.
EDUCATION
GEORGETOWN UNIVERSITY, McDonough School of Business
Master of Science in Business Analytics, Cumulative GPA: 3.8/4.000 08/2021 — 12/2022
UNIVERSITY of MARYLAND COLLEGE PARK, College of Information Studies
Bachelor of Science (BS) in Information Science, Data Science 08/2018 – 12/2020
TECHNICAL SKILLS
●Programming skills: SQL, Python, R, AWS, NumPy, Microsoft Excel VBA, Wolfram Mathematica
●Additional Skills: FinTech, Data Mining, Modeling, Database Management, Tableau, Time Series Forecasting, Oracle DB, A/B Testing, Analytics (Marketing, Text, Customer, Statistical), Power BI