Post Job Free
Sign in

Data Engineer Scientist

Location:
McKinney, TX
Posted:
September 10, 2025

Contact this candidate

Resume:

Haiyi Wang

+1-571-***-**** *******@*****.*** www.linkedin.com/in/haiyiwang

Summary

Proficient Data Engineer with extensive experience designing, building, and optimizing data infrastructure at enterprise-scale organizations such as VF Corporation, Microsoft, and Carrefour. Recognized for improving data pipeline efficiency by up to 30% and enhancing backend query performance by 20%. Skilled in data modeling, cloud technologies, ETL, and analytics, with proven ability to translate complex data into actionable insights. Committed to driving innovation and delivering scalable, high-quality solutions.

Work Experience

VF Corporation - Data Engineer Jul 2023 - Present

• Designed and maintained data infrastructure and models for loyalty programs, improving data accessibility and reliability.

• Migrated user profile data between OCP and Cheetah, resolving data discrepancies and ensuring 100% data integrity.

• Managed upstream/downstream data flows across OCP, OMS, CDP, and GDF, increasing reporting accuracy by 15%.

• Developed Power BI dashboards for actionable loyalty insights, supporting data-driven decisions for multiple VF brands.

• Collaborated with product owners and brand controllers using SAS to analyze customer behavior, boosting engagement strategies.

Microsoft - Data Scientist Oct 2022 - Apr 2023 Seattle, WA

• Extracted and processed subscription and usage data from Cosmos DB and Azure Data Lake, enhancing analytical capability.

• Applied XGBoost and Random Forest models to identify “at-risk” users, improving Xbox retention by 0.5–1.2%.

• Deployed ML models on Azure to optimize and personalize marketing campaigns, translating data insights into measurable outcomes.

University of North Texas - Graduate Teaching Assistant Feb 2022 - Aug 2022 Dallas, TX

• Administered exams, graded assignments, and facilitated study sessions, improving student performance in database fundamentals.

• Contributed to teaching material development, increasing student satisfaction and comprehension. Carrefour - Data Engineer Jan 2016 - Dec 2017 Shanghai, China

• Defined relational tables, primary/foreign keys, and clusters in BigQuery, optimizing data retrieval and storage.

• Developed data pipelines using Python, Scala, Spark, and SQL, improving pipeline efficiency by 30%.

• Extracted and transformed data from Kafka and GCP sources, ensuring reliable data integration Shanghai Lujiazui International Financial Asset Exchange - Data Analyst Jul 2012 - Dec 2015 Shanghai, China

• Built ETL pipelines using Presto SQL and Airflow, ensuring timely and accurate data delivery.

• Migrated unstructured test logs to MySQL, Hive, and Scuba, improving data accessibility.

• Created Tableau dashboards for stakeholders, accelerating backend query performance by 30%. Education

University of North Texas Master of Computer Science May 2023

• Achievements: GPA 4.0

East Normal of University Shanghai Master of Finance Shanghai University of Electric Power Bachelor of Computer Science and Technology, Skills

Analysis Tools: Python (Pandas, NumPy, scikit-learn, matplotlib), R, SAS, Tableau, Power BI, advanced MS Excel Machine Learning: Decision Tree, SVM, KNN, K-Means, XgBoost, Deep-Learning, Time Series Databases: Presto SQL, MySQL, Microsoft SQL Server, COSMOS, Oracle, BigQuery,Redshift. Big Data Technologies: Data Lake Storage, Hadoop, HDFS, MapReduce, Sqoop, Hive, Spark, Data Brick, Snowflake,DBT Cloud: Azure, Google Cloud, AWS

Certifications

AWS Certified Data Engineer - Associate

Issued May 2025 Expires May 2028

Credential ID 76e0539ac6ee42e1a534f97fbae3c9cb

https://www.credly.com/badges/2d353536-9127-487e-883c-acfafb8cdf80/public_url



Contact this candidate