Junxian Zhang - BI Developer ***********@*****.*** / 520-***-****
PROFESSIONAL SUMMARY
• IT professional with practical experience in Business Intelligence, Business Analysis, Data Modeling, and Data Points Targeting that improve productivity and cost management
• Excellent Computer Science (CS) background, extensive knowledge & experiences in Software Development Lifecycle (SDLC) and Relational Database Management System
(RDBMS)
• Expertise in developing and programming various Web Applications, Responsive Websites and Dashboards using Programming tools, SQL and Business Intelligence tools (Power BI, D3 & Tableau)
• Expertise in using various tools to handle Enterprise level volume data in ETL, Cleansing, Pre-processing, Transformation, Normalization, Analysis and Reporting
• Strong experience with SQL Server and Microsoft Business Intelligence Suites (SSIS, SSAS & SSRS)
• Extensive experience in Extracting data from various data sources, including text files, csv files, excel files, MySQL database, Oracle SQL database, SQL Server database, PostgreSQL database and Web APIs using PHP, Java and SSIS
• Excellent skills in Transforming, Cleansing, processing data into proper/desired/Structured format for querying and analytics purposes using Java, Python, R, and SSIS
• Excellent Data Warehouse/Mart design Knowledge that optimizes Data Analyzing
• Strong experience in loading large volume data into target Data Mart (SSIS)
• Excellent skills in creating SQL objects like Tables, Views, Stored Procedures, Triggers and user defined functions
• Expertise in writing SQL queries to gather Business Insights and questionable Data Points
• Extensive experience and knowledge in Data Mining & Machine Learning Algorithms/Modeling including Supervised Learning, Unsupervised Learning, Descriptive Models and Predictive Models
• Extensive experience in Data Visualization, Data Presentation, and Reporting using BI tools and self-programmed tools including Tableau, SSRS, Power BI, R, Java, and D3
• Solid organizational, multi-tasking, time management, attention to detail, and communication skills
TECHNICALSKILLS
SDLC Methodologies: Waterfall, Agile, Scrum
IDE Tools: Eclipse, PyCharm, R Studio
Reporting Tools: Power BI, Tableau, SSRS, Excel, D3 Operating Systems: Windows, Mac OS, Ubuntu, Cent OS Database: Database Design, SQL Server, Oracle SQL, MySQL, Dynamo DB Data Mining Algorithms: Decision Tree, Linear Regression, K-means, Naive Bayes, ANN Languages: Java, Python, SQL, T-SQL, HTML, JavaScript, CSS, PHP, Bash Cloud: Amazon EC2, EMR, Azure Machine Learning, Google App Engine EDUCATION
The University of Arizona – Eller College of Management Master of Management Information Systems (MIS) 08/2015 – 05/2016 BSBA in Management Information System (MIS) 01/2011 – 05/2015 BSBA in Operations Management (OM)
Minor in Computer Science (CS)
EXPERIENCE
University of Arizona Main Library 03/2014 – 05/2016 The University of Arizona Libraries in partnership with the Afghanistan Centre at Kabul University is collaborating on Preserving and Creating Access to Afghanistan Literature from the Jihad Period, a project to catalog, digitize, and create metadata. The University of Arizona Libraries and ACKU currently fund this project. From 2007 to 2012, the National Endowment funded the initial project for the Humanities (NEH). About 5,500 titles (over 600,000 pages) are currently available. ACKU and UAL are processing an additional 300,000 pages.
Role: Software Developer Database Administrator
• Administrated the world largest Afghanistan academic publications database & repository with size over 10TB (http://www.afghandata.org)
• Designed and implemented our Pashto-English dictionary website ahead of Google
(http://www.pashtoenglish.org/about.php)
• Resolved the major compatibility issues due to server upgrade which made the project on hold for more than 3 months
• Led a team of 3 members and responsible for managing and distributing their daily work
• Established work standards and optimize workflow by segmenting tasks
• Initiated and recommended new automated operating processes to increase work efficiency
• Administrated and managed repository application (DSpace) and built index that allows public to view and full-text search publication in database
(http://www.afghandata.org:8080/xmlui/)
• Designed, developed and coded new automated operating processes that increased efficiency at least 500% on each task
• Wrote Java programs to automate batch process publication preparation phase including backup, validation, error handling, note generating and file moving
• Eased the upload process for non-technical co-workers by aggregating complex Unix command lines into one executable bash script
• Improved publication ingestion system performance by eliminating unnecessary file copying, file moving and inefficient code
• Programmed variety of tools in shell scripts and Perl to help co-workers to solve common issues during quality assurance
• Coached and trained team to follow standards and use automated tools to solve problems more effectively
• Documented tutorials and guidelines for new hires to understand the project, workflow, processes, standards and tools designed and programmed by me Environment: Publication Digitization, Cloud Computing, Ubuntu, MySQL, Perl, PHP, XML PROJECTS
1. Vantage West Credit Union 01/2016 – 05/2016
Vantage West Credit Union with more than $1.5 billion in assets is the largest credit union in southern Arizona. It is a full-service credit union in Arizona, located near Tucson, Casas Adobes, & Phoenix, offers personal loans, accounts, mortgages, business banking, etc. Over the past few years, it has been a plateau in membership growth. Moreover, there has been a substantial membership churn over the past years. To address this issue, Vantage West Credit Union initiated customer retention project.
Role: Customer Retention Consultant Data Analyst
• Documented and Edited project charter that clarify responsibilities of both parties, overview of project, project scope, assumptions, constraints, initial plan, meeting time, timeline, major milestones and expected deliverables
• Proposed main ideas, directions/angles and outlines for team to analysis
• Preprocessed and cleansed dataset provided by Vantage West Credit Union, evaluated data sources, data mapping, data quality and identified questionable data points
• Designed and developed database solutions based on our analytical requirements
• Transformed and integrated various raw data formats from different sources into structured format for team DBA to integrate into data mart
• Created new database objects like new tables, procedures, and views for analytical efficiency and convenience purposes
• Performed analysis on member’s product path, and recommended various cross-selling strategy based on historical statistics to achieve high customer retention rate
• Transformed data into desired format for D3 visualization (Weighted Tree) to present member’s product path
• Performed Logistic analysis on potential leaving credit card members in next 3 months, and provided list of target members with more than 50% coverage for marketing department
• Performed regression analysis and correlation analysis on variable selection phase to best train selected algorithm (Logistic Regression & Decision Tree)
• Normalized variables to maximum eliminate bias and variance for selected data mining algorithms (Logistic Regression & Decision Tree)
• Presented to Vantage West Board Members, provided strategic recommendations, and received excellent feedback
Environment: ETL, OLAP, Oracle SQL, R Studio, Java, D3, Web, Tableau 2. Airbnb 01/2016 – 05/2016
Airbnb is a well-known and trusted community marketplace for people to list, discover, and book unique accommodations in U.S. Our project goal is to develop a web-application using data mining algorithms that helps hosts correctly list their properties. Role: Project Lead Data Analyst Software Developer
• Created SSIS packages to extracted data from various data sources including database, flat file and excel
• Preprocessed and cleansed data from various data sources into structured data, and mapped derived, converted and aggregated data into columns
• Merge joined data sources into one comprehensive and meaningful table using multiple transformation tools in SSIS
• Analyzed and identified external characteristics affects property ratings and pricing
• Programmed in PHP to extract all nearby information of listing properties in dataset from Google Places API
• Increased model accuracy by transforming and normalizing variables with extreme values to reduce bias and variance
• Built a Two-Stage Prediction Model by combining Clustering algorithms and Prediction algorithms
• Predicted rating and price resulted in more than 90% accuracy on average, which improved 30% from baseline
Environment: SSIS, SSAS, SSRS, Google Place API, OLAP, PHP, Java, R Studio 3. University of Arizona Main Library 09/2015 – 12/2015 The Digitization Department requires Quality Assurances (QA) to fill out a paper form for each publication QA reviewed, that supervisors could summarize and generate reports. Our project’s goal is to deliver an Online Supervision System that ease of the supervision process. Role: Project Lead System Architects Software Developer BI Developer
• Proposed an automated web solution to eliminate unnecessary work for both supervisors and staff
• Drafted and finalized project proposal, project charter, project scope and timeline
• Involved in creating and editing BRD (Business Requirement Document), FSD (Functional Specification Document) and NFR (Non-functional Requirements)
• Designed and created system architecture for better work distribution and time management
• Transferred business requirement into ER (Entity Relationship) Diagram, and created database architecture, triggers, procedures and data dictionary to meet FSD and NFR
• Programmed and deployed a website interacts with back end database and file system
• Simplified supervising & monitoring processes by generating automated dashboard and reports
• Increased supervision process speed, saved enormous time, and eliminated most of human errors
Environment: SQL, HTML, CSS, JavaScript, PHP, Amazon EC2