Jie Chu
Summary
Five years working experience in cross platform data integration/manipulation, expert in SQL and relational database performance tuning. Systematic knowledge in machine learning, text analytics, noSQL database, distributed system, object oriented design, dimensional/relational modeling, Unix and and software development life cycle. Education
05/2015-05/2016 Master of Information System Management Carnegie Mellon University, USA 09/2005-07/2009 Bachelor of Human Resource Management Northwest University, China Academic Projects
City of Philadelphia Crash Analysis Project (R, SQL, Tableau, Python, Javascript, HTML)
• Build an Oracle database server for data cleaning and exploration
• Analyze root causes and build predictive models to reduce fatal rate on road - decision tree, logistic regression, association rule
• Develop a web application tool with user friendly UI to automate data integration and visualization by django framework California School System Performance Analysis Project(R, Tableau)
• Data cleaning to remove anomalies and incomplete values
• Regression and correlation analysis to identify key determinant factors related to school performance
• Recommendation report to improve school performance Text mining Project(Weka/LightSDE/Python)
• Develop text representation by unigram/bigram/POS features on twitter, iMDB movie comment, Epinion car comment and amazon product review data sets
• Analyze sentiment using categorization algorithms - NaiveBayes, SVM, Logistic Regression Predict amazon market offer response(AWS)
• Prepare data to create training data source
• Create amazon binary machine learning model to generate predictions
• Review the ML Model's prediction performance and set a score threshold
• Develop real-time and batch predictions to identify potential customers for a targeted marketing campaign Database Design Project(SQL, PL/SQL)
• Design and implemented data model for a construction company's website
• Develop database schema, triggers, procedures and packages to implement business rules
• Prepare data warehouse migration guide
Web/Android Application Development(Java)
• Build a responsive web application to search and fetch pictures from Internet using flickr api
• Implement a web service for image searching record manipulation using JAX-WS (with SOAP) and REST
• Develop an Android application for product search using BestBuy api Linux project
• Installed an open source online education intuition management system -Claroline on linux Ubuntu virtual machine Work Experiences
01/2012-05/2015 Genpact US Senior Technical Associate/ ETL Developer
• Lead an offshore team of twenty people for production support
• Develop and optimized ETL jobs for data integration using Informatics reusable transformations like Joiner, Router, SQL qualifier, Rank, Union, Filter, Update Strategy etc.
• Build customized data quality tools to check the data quality of the data warehouse and source system during ETL process
• Implement Push down Optimization to leverage the power of Oracle and achieve maximum ETL throughput.
• Create specifications for ETL processes, finalized requirements and preparing specification document.
• Improve production performance by 30% through determining bottlenecks like implementing database partitioning and increasing block size, data cache size, sequence buffer length and target based commit interval and SQL overrides
• Internal training on implementing development best practices
• Analyze data issue reported by business intelligence users
• Write Oracle PL/SQL Stored Procedures/Functions/Packages/Triggers
• Host weekly status meetings for production monitor. 01/2011-01/2012 Genpact China Technical Associate/ Oracle ERP developer
• Develop customized reports, forms using Oracle Report/form builder, XML Publisher for Oracle EBS
• Build interface by Oracle SQL*Loader, PL/SQL packages to integrate data across business modules
• Data extraction from legacy system to Oracle EBS
• Maintain and build workflows customization to support business process
• Work with business analysts to translate business requirements into technical designs
• Engage Oracle Support to research and resolve issues Professional Skills
Languages: Java, Python, R, SQL, PL/SQL, Shell script, Javascript Databases: Oracle 10g/9i, SQL Server, Redshift
ETL tools: Informatica PowerCenter 9.1/8.6/8.1
Other tools: AWS, Tableau, R Studio, SQL developer, Toad, Eclipse, Pycharm, Android Studio, Oracle data modeler E-mail: *****@******.***.***, ******.*******@*****.*** Mobile 412-***-****
Address: Apt 46, 3 Bayard Road, Pittsburgh, PA, US, 15213