Post Job Free

Resume

Sign in

Data Engineer

Location:
Buffalo, NY
Posted:
January 26, 2019

Contact this candidate

Resume:

Ketul Patel

+1-716-***-**** ac8ayj@r.postjobfree.com www.linkedin.com/in/patel-ketul

SUMMARY

Data enthusiastic graduate student with professional experience in Data Analytics, Process Improvement, and Leadership

Leveraged data analysis, statistical analysis, machine learning, operation research, and six sigma skills by collaborating with cross-functional teams to support and optimize processes and business decisions

Knowledge in data consolidation, data exploration and visualization using Tableau, Python and SQL

EDUCATION

Master of Science, Industrial Engineering – GPA- 3.97/4.0 Aug 2017 – Dec 2018

University at Buffalo, the State University of New York Buffalo, NY

Coursework: Programming for Analytics, Data Mining, Design of Experiments, Operation Research, Six Sigma, Supply chain

Bachelor of Technology, Mechanical Engineering – GPA- 9.07/10 Jul 2011 – May 2015

Dharmsinh Desai University Gujarat, India

EXPERIENCE

Data Science Intern – Hoerbiger Corporation of America, Miami, FL, USA May 2018 – Dec 2018

Designed database, carried out data wrangling and mined for statistical and business insight by integrating real time data using Python and complex SQL joins for further analysis

Built model for predicting the leakage value using machine learning algorithms in Python to optimize product quality

Utilized findings for process improvement, reduced rework by 18% and saved around $1M annually

Developed novel methods such as Jump plot for sequence and bottleneck analysis, improving operational performance by 25%

Identified key metrics and integrated MS SQL database with Tableau to create interactive dashboards for actionable insights

Designed and automated graphics generation using python data science library for monthly leakage tracking and analysis, improving efficiency by 98%

Project Engineer – KHS GmbH, GJ, India Jun 2015 – Jul 2017

Collaborated with multinational teams for commissioning of fully automated KHS turnkey line at Coca-Cola, PepsiCo, and SABMiller worth $25M each with 99 % efficiency

Conducted design of experiments and performed statistical analysis to prepare possible remedies to the issues and challenges

ACADEMIC PROJECTS

Direct Marketing Campaign Optimization Predictive Analytics (scikit – learn) Oct 2017 – Dec 2017

Strategized projected growth in user subscription using classification approach, predicted likelihood of user subscription

Preprocessed the data, implemented feature engineering and dealt with imbalanced classes using random over sampling

Trained resampled ensemble with gradient boosting, random forest and logistics regression, achieving ROC AUC 0.83

Text Classification On Boston Traffic Complaints - NLP (Python) Feb 2018 - Apr 2018

Predicted the type of complaints to strategize better categorization of data into semantic categories for safer streets in Boston

Preprocessed the data, trained several BOW models and interpreted for modeling mistakes using confusion matrix and F1 score

Improved model using more complex text features including n grams, increasing recall by 10% and precision by 13%

Reassigned the labels using clustering such as K means, NMF and LDA, further improving F1 score by 2%

Twitter Sentimental Analysis (MySQL, Tableau) Feb 2018 - Apr 2018

Extracted NASDAQ 100’ market data during their release earnings, analyzed for trends on Twitter using MySQL

Executed complex queries, created dashboards to identify relation between market data variables and twitter feeds

Time Series Analysis and Forecasting – Quandl dataset (Python) Mar 2018 - May 2018

Extracted GDP data using quandl API and chose the best subset of predictors to achieve 95% model accuracy using multivariate regression and ARIMA analysis

Recommendation System for Purchase Data (Python) Sep 2018 - Dec 2018

Developed collaborative filtering models with cosine and pearson similarity for recommending products to customers using purchase data, evaluated using RMSE, precision and recall

SKILLS

Analytical Tools: R, Minitab, Tableau, CPLEX, Arena Simulation

Programming Languages: Python, Excel VBA, Matlab

Technical Skills: ANOVA, Hypothesis, A/B testing, Optimization, Anomaly Detection, Network flow, AWS EC2, LP, MIP, Hadoop, Neural Network, SVM, Pandas, Numpy, Matplotlib, Seaborn, PCA, Naïve Bayes, CV, SPC, NLTK, Spacy, genism



Contact this candidate