Xiaoyan Xu
435-***-**** *************@*****.***
EDUCATION
Utah State University Jon M. Huntsman School of Business Logan, UT Master of Management Information Systems, GPA: 3.9/4.0 April 2020 Sun Yat-Sen University Guangzhou, China
Master of Professional Accounting, Transferred, GPA: 87/100 March 2018 Guangdong University of Foreign Studies Guangzhou, China Bachelor of Management, Major: Marketing Minor: Accounting June 2012 GPA: 3.57 / 4.0 (Major) 3.49 / 4.0 (Minor)
PROJECT
Twitter Stream Processor (Spark, Python, JavaScript)
• Built a data ingestion layer with external Twitter API
• Processed Twitter hashtags using a real-time processor Spark
• Visualized the results for 1% of all public Tweets using Ajax and JavaScript
• Deployed and configured on Google Cloud Platform to support external viewing Real-time Sentiment Analysis (Spark, Kafka, Python)
• Implemented an event queuing layer to ensure stable data flow and system reliability using Kafka
• Set up a monitor with Spark (60 Tweets per second) for positive/negative sentiments
• Visualized the trends with Python for real-time positive/negative sentiments Synchronize Data from MySql with Hbase (MySql, Hbase, Maxwell, Kafka, Spark)
• Parsed business layer’s binlog and sent data to Kafka using Maxwell
• Built a spark streaming layer to connect Kafka and Hbase
• Enabled dynamic resource allocation on the Cloudera platform
• Defined customized data source based on Hbase with SparkSql PROFESSIONAL EXPERIENCE
Utah State University Logan, UT
Graduate Assistant in Information System Department August 2018—December 2019
• Analyzed data using Natural Language Processing (NLP) techniques from three major credit bureau: Equifax, Experian, and Transunion
• Researched published reports and using NLP to study event impact, presented as conference presentation Industrial and Commercial Bank of China Limited Guangzhou, China Associate Manager July 2012—March 2016
• Lead corporate banking business specializing in corporate liability side including credits, due diligence and compliance
SKILLS
• Languages: Java, Python, C/C++, C#, SQL, NoSQL
• Framework/Tools: Spark, Kafka, Flask, MongoDB, Power BI, RapidMiner, Numpy, Pandas, Hadoop, Linux
• IDE: IntelliJ IDEA, Visual Studio
• Computer Science Courses: Data Structure & Algorithm, Management of Database Systems, Systems and Analytics Programming, Machine Learning, Data Science Incubator, Advanced Topics in Information Security, Advanced Web-Based Management Information Systems Development, Advanced Website Development