Haiyi Wang
+1-571-***-**** *******@*****.*** www.linkedin.com/in/haiyiwang
Summary
Proficient Data Engineer with extensive experience designing, building, and optimizing data infrastructure at enterprise-scale organizations such as VF Corporation, Microsoft, and Carrefour. Recognized for improving data pipeline efficiency by up to 30% and enhancing backend query performance by 20%. Skilled in data modeling, cloud technologies, ETL, and analytics, with proven ability to translate complex data into actionable insights. Committed to driving innovation and delivering scalable, high-quality solutions.
Work Experience
VF Corporation - Data Engineer Jul 2023 - Present
• Designed and maintained data infrastructure and models for loyalty programs, improving data accessibility and reliability.
• Migrated user profile data between OCP and Cheetah, resolving data discrepancies and ensuring 100% data integrity.
• Managed upstream/downstream data flows across OCP, OMS, CDP, and GDF, increasing reporting accuracy by 15%.
• Developed Power BI dashboards for actionable loyalty insights, supporting data-driven decisions for multiple VF brands.
• Collaborated with product owners and brand controllers using SAS to analyze customer behavior, boosting engagement strategies.
Microsoft - Data Scientist Oct 2022 - Apr 2023 Seattle, WA
• Extracted and processed subscription and usage data from Cosmos DB and Azure Data Lake, enhancing analytical capability.
• Applied XGBoost and Random Forest models to identify “at-risk” users, improving Xbox retention by 0.5–1.2%.
• Deployed ML models on Azure to optimize and personalize marketing campaigns, translating data insights into measurable outcomes.
University of North Texas - Graduate Teaching Assistant Feb 2022 - Aug 2022 Dallas, TX
• Administered exams, graded assignments, and facilitated study sessions, improving student performance in database fundamentals.
• Contributed to teaching material development, increasing student satisfaction and comprehension. Carrefour - Data Engineer Jan 2016 - Dec 2017 Shanghai, China
• Defined relational tables, primary/foreign keys, and clusters in BigQuery, optimizing data retrieval and storage.
• Developed data pipelines using Python, Scala, Spark, and SQL, improving pipeline efficiency by 30%.
• Extracted and transformed data from Kafka and GCP sources, ensuring reliable data integration Shanghai Lujiazui International Financial Asset Exchange - Data Analyst Jul 2012 - Dec 2015 Shanghai, China
• Built ETL pipelines using Presto SQL and Airflow, ensuring timely and accurate data delivery.
• Migrated unstructured test logs to MySQL, Hive, and Scuba, improving data accessibility.
• Created Tableau dashboards for stakeholders, accelerating backend query performance by 30%. Education
University of North Texas Master of Computer Science May 2023
• Achievements: GPA 4.0
East Normal of University Shanghai Master of Finance Shanghai University of Electric Power Bachelor of Computer Science and Technology, Skills
Analysis Tools: Python (Pandas, NumPy, scikit-learn, matplotlib), R, SAS, Tableau, Power BI, advanced MS Excel Machine Learning: Decision Tree, SVM, KNN, K-Means, XgBoost, Deep-Learning, Time Series Databases: Presto SQL, MySQL, Microsoft SQL Server, COSMOS, Oracle, BigQuery,Redshift. Big Data Technologies: Data Lake Storage, Hadoop, HDFS, MapReduce, Sqoop, Hive, Spark, Data Brick, Snowflake,DBT Cloud: Azure, Google Cloud, AWS
Certifications
AWS Certified Data Engineer - Associate
Issued May 2025 Expires May 2028
Credential ID 76e0539ac6ee42e1a534f97fbae3c9cb
https://www.credly.com/badges/2d353536-9127-487e-883c-acfafb8cdf80/public_url