Post Job Free

Resume

Sign in

Data Project

Location:
Pittsburgh, PA
Posted:
August 13, 2016

Contact this candidate

Resume:

Jie Chu

Summary

Five years working experience in cross platform data integration/manipulation, expert in SQL and relational database performance tuning. Systematic knowledge in machine learning, text analytics, noSQL database, distributed system, object oriented design, dimensional/relational modeling, Unix and and software development life cycle. Education

05/2015-05/2016 Master of Information System Management Carnegie Mellon University, USA 09/2005-07/2009 Bachelor of Human Resource Management Northwest University, China Academic Projects

City of Philadelphia Crash Analysis Project (R, SQL, Tableau, Python, Javascript, HTML)

• Build an Oracle database server for data cleaning and exploration

• Analyze root causes and build predictive models to reduce fatal rate on road - decision tree, logistic regression, association rule

• Develop a web application tool with user friendly UI to automate data integration and visualization by django framework California School System Performance Analysis Project(R, Tableau)

• Data cleaning to remove anomalies and incomplete values

• Regression and correlation analysis to identify key determinant factors related to school performance

• Recommendation report to improve school performance Text mining Project(Weka/LightSDE/Python)

• Develop text representation by unigram/bigram/POS features on twitter, iMDB movie comment, Epinion car comment and amazon product review data sets

• Analyze sentiment using categorization algorithms - NaiveBayes, SVM, Logistic Regression Predict amazon market offer response(AWS)

• Prepare data to create training data source

• Create amazon binary machine learning model to generate predictions

• Review the ML Model's prediction performance and set a score threshold

• Develop real-time and batch predictions to identify potential customers for a targeted marketing campaign Database Design Project(SQL, PL/SQL)

• Design and implemented data model for a construction company's website

• Develop database schema, triggers, procedures and packages to implement business rules

• Prepare data warehouse migration guide

Web/Android Application Development(Java)

• Build a responsive web application to search and fetch pictures from Internet using flickr api

• Implement a web service for image searching record manipulation using JAX-WS (with SOAP) and REST

• Develop an Android application for product search using BestBuy api Linux project

• Installed an open source online education intuition management system -Claroline on linux Ubuntu virtual machine Work Experiences

01/2012-05/2015 Genpact US Senior Technical Associate/ ETL Developer

• Lead an offshore team of twenty people for production support

• Develop and optimized ETL jobs for data integration using Informatics reusable transformations like Joiner, Router, SQL qualifier, Rank, Union, Filter, Update Strategy etc.

• Build customized data quality tools to check the data quality of the data warehouse and source system during ETL process

• Implement Push down Optimization to leverage the power of Oracle and achieve maximum ETL throughput.

• Create specifications for ETL processes, finalized requirements and preparing specification document.

• Improve production performance by 30% through determining bottlenecks like implementing database partitioning and increasing block size, data cache size, sequence buffer length and target based commit interval and SQL overrides

• Internal training on implementing development best practices

• Analyze data issue reported by business intelligence users

• Write Oracle PL/SQL Stored Procedures/Functions/Packages/Triggers

• Host weekly status meetings for production monitor. 01/2011-01/2012 Genpact China Technical Associate/ Oracle ERP developer

• Develop customized reports, forms using Oracle Report/form builder, XML Publisher for Oracle EBS

• Build interface by Oracle SQL*Loader, PL/SQL packages to integrate data across business modules

• Data extraction from legacy system to Oracle EBS

• Maintain and build workflows customization to support business process

• Work with business analysts to translate business requirements into technical designs

• Engage Oracle Support to research and resolve issues Professional Skills

Languages: Java, Python, R, SQL, PL/SQL, Shell script, Javascript Databases: Oracle 10g/9i, SQL Server, Redshift

ETL tools: Informatica PowerCenter 9.1/8.6/8.1

Other tools: AWS, Tableau, R Studio, SQL developer, Toad, Eclipse, Pycharm, Android Studio, Oracle data modeler E-mail: acv5uf@r.postjobfree.com, acv5uf@r.postjobfree.com Mobile 412-***-****

Address: Apt 46, 3 Bayard Road, Pittsburgh, PA, US, 15213



Contact this candidate