Post Job Free
Sign in

Engineering Data

Location:
Houston, TX
Posted:
November 29, 2020

Contact this candidate

Resume:

SAMI KHAN

Cell: 832-***-****, Email: **********@*****.***

LinkedIn: https://www.linkedin.com/in/samikhan6

Github: https://github.com/samikhan66

WORK EXPERIENCE

NthDS, Houston, TX

Data Scientist I April 2019-Present

•Built an Object Detection model that expanded the company’s footprint to the logistics industry. Tracked metrics by running benchmark tests and measuring overlap between predicted and original ROI.

•Increased $200,000 in yearly revenue by developing a supervised page level/file level classifier for oil and gas technical data. Leveraged petroleum engineering knowledge to come up with ground rules for data preparation. Tracked metrics using Pandas, ROC/AUC curves and confusion matrix.

•Improved classification accuracy from 75% to over 90%. Prototyped pre-processing and post-processing steps and built fallback classification using NLP and OCR techniques such as Tesseract, Google Vision and Image Captioning.

•Saved an average of 15 minutes of QC time per file by improving a ML based tabular digitization tool using techniques such as pixel spacing, image morphology and image contours for text detection.

•Saved data collection time from 6 months to 3 days for a logistics project by developing a synthetic image generator using a combination of GANs, image re-construction and feature engineering.

•Collaborated on a curve digitization solution, performed data augmentation and tweaked hyperparameters to improve performance. Eventually this paved way for projects with oil giants such as ExxonMobil, Shell, BHP, Occidental Petroleum and CGG.

•Developed softwares, utilizing version control such as Git and debugging efficiently using IDE like PyCharm in a team. In turn, this improved team interaction, reducing debugging time to half.

Dragon Shale, LLC, Vernal, UT

Process Engineering Intern May 2018-August 2018

•Prototyped a cost-effective and environmentally friendly process to produce oil from oil shale. This process was 20% cheaper and 50% environmentally friendlier.

•Drew mathematical and statistical inference to improve the process. Efficiency increased by 25%.

SKILLS

Python, SQL, R, Matlab, CNNs, RNNs, GANs, Tensorflow, Keras, Power BI, Tableau, Scikit-learn, Sklearn, Image processing, Data Analysis, Data Mining, Google Vision, XML, Dimensionality Reduction, Extreme Gradient Boosting, Random Forests, Support Vector Machine, K-nearest neighbors, K-means clustering, Spanish (Fluent), Urdu (Fluent), Hindi (Fluent), Teamwork, Leadership, Adaptability, Problem-Solving, Creativity, Work Ethic

EDUCATION

University of Houston, Houston, TX August 2015-May 2019

Major: Petroleum Engineering - GPA: 3.49 (Cum Laude)

Minor: Economics

Engineer in Training Certification: Mechanical Engineering

Relevant Courses: Engineering Statistics, Deep Learning Specialization by Andrew Ng (Coursera), Tensorflow 2 Bootcamp by Jose Portilla (Udemy), Convolutional Neural Networks by Fei-Fei Li (Stanford), Computer Vision by Aaron Bobick (Udacity)

LEADERSHIP EXPERIENCE

Vice President, Triangle Fraternity, University of Houston May 2017-May 2018

Recruited 10 new members by increasing accountability of different chair positions and organizing more networking and leadership events. Consequently, this led to a 100% retention rate.

Volunteer Teacher, AIESEC, Recife, Brazil May 2016-August 2016

•Spread environmental awareness in the favellas of the Recife community among 200+ 8-13 year old students guided by reducing, recycling, reusing. Littering decreased by 50% in the NGO within three months.



Contact this candidate