Ketul Patel
+1-716-***-**** ********@*******.*** www.linkedin.com/in/patel-ketul
SUMMARY
Data enthusiastic graduate student with professional experience in Data Analytics, Process Improvement, and Leadership
Leveraged data analysis, statistical analysis, machine learning, operation research, and six sigma skills by collaborating with cross-functional teams to support and optimize processes and business decisions
Knowledge in data consolidation, data exploration and visualization using Tableau, Python and SQL
EDUCATION
Master of Science, Industrial Engineering – GPA- 3.97/4.0 Aug 2017 – Dec 2018
University at Buffalo, the State University of New York Buffalo, NY
Coursework: Programming for Analytics, Data Mining, Design of Experiments, Operation Research, Six Sigma, Supply chain
Bachelor of Technology, Mechanical Engineering – GPA- 9.07/10 Jul 2011 – May 2015
Dharmsinh Desai University Gujarat, India
EXPERIENCE
Data Science Intern – Hoerbiger Corporation of America, Miami, FL, USA May 2018 – Dec 2018
Designed database, carried out data wrangling and mined for statistical and business insight by integrating real time data using Python and complex SQL joins for further analysis
Built model for predicting the leakage value using machine learning algorithms in Python to optimize product quality
Utilized findings for process improvement, reduced rework by 18% and saved around $1M annually
Developed novel methods such as Jump plot for sequence and bottleneck analysis, improving operational performance by 25%
Identified key metrics and integrated MS SQL database with Tableau to create interactive dashboards for actionable insights
Designed and automated graphics generation using python data science library for monthly leakage tracking and analysis, improving efficiency by 98%
Project Engineer – KHS GmbH, GJ, India Jun 2015 – Jul 2017
Collaborated with multinational teams for commissioning of fully automated KHS turnkey line at Coca-Cola, PepsiCo, and SABMiller worth $25M each with 99 % efficiency
Conducted design of experiments and performed statistical analysis to prepare possible remedies to the issues and challenges
ACADEMIC PROJECTS
Direct Marketing Campaign Optimization Predictive Analytics (scikit – learn) Oct 2017 – Dec 2017
Strategized projected growth in user subscription using classification approach, predicted likelihood of user subscription
Preprocessed the data, implemented feature engineering and dealt with imbalanced classes using random over sampling
Trained resampled ensemble with gradient boosting, random forest and logistics regression, achieving ROC AUC 0.83
Text Classification On Boston Traffic Complaints - NLP (Python) Feb 2018 - Apr 2018
Predicted the type of complaints to strategize better categorization of data into semantic categories for safer streets in Boston
Preprocessed the data, trained several BOW models and interpreted for modeling mistakes using confusion matrix and F1 score
Improved model using more complex text features including n grams, increasing recall by 10% and precision by 13%
Reassigned the labels using clustering such as K means, NMF and LDA, further improving F1 score by 2%
Twitter Sentimental Analysis (MySQL, Tableau) Feb 2018 - Apr 2018
Extracted NASDAQ 100’ market data during their release earnings, analyzed for trends on Twitter using MySQL
Executed complex queries, created dashboards to identify relation between market data variables and twitter feeds
Time Series Analysis and Forecasting – Quandl dataset (Python) Mar 2018 - May 2018
Extracted GDP data using quandl API and chose the best subset of predictors to achieve 95% model accuracy using multivariate regression and ARIMA analysis
Recommendation System for Purchase Data (Python) Sep 2018 - Dec 2018
Developed collaborative filtering models with cosine and pearson similarity for recommending products to customers using purchase data, evaluated using RMSE, precision and recall
SKILLS
Analytical Tools: R, Minitab, Tableau, CPLEX, Arena Simulation
Programming Languages: Python, Excel VBA, Matlab
Technical Skills: ANOVA, Hypothesis, A/B testing, Optimization, Anomaly Detection, Network flow, AWS EC2, LP, MIP, Hadoop, Neural Network, SVM, Pandas, Numpy, Matplotlib, Seaborn, PCA, Naïve Bayes, CV, SPC, NLTK, Spacy, genism