Post Job Free
Sign in

Business Intelligence Data Scientist

Location:
Rochester, NY
Posted:
May 19, 2025

Contact this candidate

Resume:

Yaxuan (Olivia) Wang

585-***-**** ******.**.****@*****.*** linkedin.com/in/yaxuan-olivia-wang github.com/olivia-xuan Data Scientist with 2 years of experience in analytics, ML, NLP, and leadership, adept at implementing data-driven solutions and business intelligence strategies. Eager to tackle customer-centric challenges using inferential and statistical modeling. EXPERIENCE

Data Ops Associate CB Insights Manhattan, NY Dec 2021 – Oct 2022 o Built an ETL pipeline to automate data extraction from 100+ websites using Python (Pandas) and advanced Excel, improving data accuracy and reliability by unifying formats, removing duplicates, and verifying multilingual funding data (e.g., M&As, IPOs). o Developed comprehensive profiles for startups, patents, and venture capital firms using advanced analysis and Tableau visualizations, improving content quality and increasing client engagement by 30%. o Streamlined workflows by managing and resolving Jira tickets in cross-functional teams, boosting workflow efficiency and reducing resolution time by 25%.

Project Management Intern Bell Mechanical Contractor Rochester, NY June 2021 – Aug 2021 o Performed financial analysis using SAGE ERP and Excel to identify cost-saving opportunities, optimize budgets, and improve resource allocation, reducing costs by 40%.

o Streamlined workflow procedures and optimized material purchasing processes, increasing team productivity by 20% through data-driven recommendations.

o Tracked and visualized company expenses involving purchase orders, change orders, and subcontracting costs, ensuring data accuracy and adherence to budgets.

Business Analyst Intern IT Services at RIT Rochester, NY May 2019 – Aug 2019 o Led a multidisciplinary team of 5 developers in an Agile environment to refine the Tiger Centre student portal, enhancing user experience for over 19,000 students by integrating user feedback and implementing feature updates. o Analyzed user data with SQL to identify key usability issues, optimizing feature accessibility and streamlining workflows with Jira & Trello. o Created 20+ Tableau dashboards to provide actionable insights for over 10 departments, addressing key student needs through ad-hoc and recurring visualizations.

o Facilitated stakeholder discussions to define requirements, create user stories, design solutions, and implement updates within timelines, ensuring optimal functionality and user satisfaction. PROJECTS

Publications Analyzing ChatGPT-Developer Conversations for Software Refactoring o Co-authored a paper presented at the Mining Software Repositories (MSR) conference (ICSE 2024), published by ACM and IEEE. o Conducted exploratory data analysis (EDA) and data annotation using Python and SQLite on GitHub and HackerNews datasets. o Performed keyword analysis to uncover key insights into developer-ChatGPT interactions during software refactoring. Natural Language Processing Detecting Humor through Reddit Jokes o Utilized advanced NLP methodologies (Doc2Vec, BERT, and XGBoost) to classify popular jokes with an 80% F1 score from 100k web-scraped Reddit submissions. Identified semantic factors influencing joke virality, providing insights into social media engagement patterns. Data Analytics and Business Intelligence Twitter Sentiment Analysis for popular trading platforms o Mined real-time Twitter data on Coinbase, Robinhood, and Binance using Twitter API, applying R for data collection and noise reduction. Employed NLP techniques with syuzhet and afinn packages for sentiment analysis, tracking investor sentiment in financial markets. o Utilized Tableau for visualization, including word clouds, to translate sentiment analysis into strategic recommendations for trading companies. SKILLS

Database and Programming Languages: Python (Pandas, NumPy), SQL, NoSQL, R, Java, C#, HTML, JavaScript Machine Learning: Scikit-learn, Time Series, Supervised & Unsupervised Learning, Natural Language Processing (NLP) Business Intelligence & Data Visualization: Tableau, Power BI, Matplotlib Enterprise & Workflow Tools: Kubernetes, Git, Google Cloud, Docker, SAP (ERP Systems), Minitab, JMP, Jira, Confluence, Microsoft Office Suite

(Advanced Excel: Pivot Tables, VLOOKUP, Macros)

Statistical Analysis: ANOVA, T-tests, Statistical Modeling, A/B Testing, Data Cleaning, ETL EDUCATION

Rochester Institute of Technology Master of Science in Data Science GPA: 3.86/4.00 Dec 2024 Coursework: Explainable AI, Human Factors in AI, Speech Processing, NLP, Software Engineering for Data Science Rochester Institute of Technology Bachelor of Science in Management Information Systems GPA: 3.64/4.00 Aug 2021 EXTRACURRICULAR ACTIVITIES

President Data Science and Data Engineering Club Rochester, NY Aug 2023 – May 2024 o Led a 21% growth in club membership by organizing weekly workshops on ML and AI, facilitating hands-on projects, hackathons, and industry talks, all enhanced by strategic marketing and communication, enriching members’ educational and practical learning experience. AWARE AI NSF Research Trainee Human Computer Interaction Track Rochester, NY Aug 2023 – May 2024 o Conducted HCI research to enhance ASR technology for the deaf and hard-of-hearing older adults, optimizing prototypes for improved accessibility.



Contact this candidate