SAMI KHAN
Cell: 832-***-****, Email: **********@*****.***
LinkedIn: https://www.linkedin.com/in/samikhan6
Github: https://github.com/samikhan66
WORK EXPERIENCE
NthDS, Houston, TX
Data Scientist I April 2019-Present
•Built an Object Detection model that expanded the company’s footprint to the logistics industry. Tracked metrics by running benchmark tests and measuring overlap between predicted and original ROI.
•Increased $200,000 in yearly revenue by developing a supervised page level/file level classifier for oil and gas technical data. Leveraged petroleum engineering knowledge to come up with ground rules for data preparation. Tracked metrics using Pandas, ROC/AUC curves and confusion matrix.
•Improved classification accuracy from 75% to over 90%. Prototyped pre-processing and post-processing steps and built fallback classification using NLP and OCR techniques such as Tesseract, Google Vision and Image Captioning.
•Saved an average of 15 minutes of QC time per file by improving a ML based tabular digitization tool using techniques such as pixel spacing, image morphology and image contours for text detection.
•Saved data collection time from 6 months to 3 days for a logistics project by developing a synthetic image generator using a combination of GANs, image re-construction and feature engineering.
•Collaborated on a curve digitization solution, performed data augmentation and tweaked hyperparameters to improve performance. Eventually this paved way for projects with oil giants such as ExxonMobil, Shell, BHP, Occidental Petroleum and CGG.
•Developed softwares, utilizing version control such as Git and debugging efficiently using IDE like PyCharm in a team. In turn, this improved team interaction, reducing debugging time to half.
Dragon Shale, LLC, Vernal, UT
Process Engineering Intern May 2018-August 2018
•Prototyped a cost-effective and environmentally friendly process to produce oil from oil shale. This process was 20% cheaper and 50% environmentally friendlier.
•Drew mathematical and statistical inference to improve the process. Efficiency increased by 25%.
SKILLS
Python, SQL, R, Matlab, CNNs, RNNs, GANs, Tensorflow, Keras, Power BI, Tableau, Scikit-learn, Sklearn, Image processing, Data Analysis, Data Mining, Google Vision, XML, Dimensionality Reduction, Extreme Gradient Boosting, Random Forests, Support Vector Machine, K-nearest neighbors, K-means clustering, Spanish (Fluent), Urdu (Fluent), Hindi (Fluent), Teamwork, Leadership, Adaptability, Problem-Solving, Creativity, Work Ethic
EDUCATION
University of Houston, Houston, TX August 2015-May 2019
Major: Petroleum Engineering - GPA: 3.49 (Cum Laude)
Minor: Economics
Engineer in Training Certification: Mechanical Engineering
Relevant Courses: Engineering Statistics, Deep Learning Specialization by Andrew Ng (Coursera), Tensorflow 2 Bootcamp by Jose Portilla (Udemy), Convolutional Neural Networks by Fei-Fei Li (Stanford), Computer Vision by Aaron Bobick (Udacity)
LEADERSHIP EXPERIENCE
Vice President, Triangle Fraternity, University of Houston May 2017-May 2018
Recruited 10 new members by increasing accountability of different chair positions and organizing more networking and leadership events. Consequently, this led to a 100% retention rate.
Volunteer Teacher, AIESEC, Recife, Brazil May 2016-August 2016
•Spread environmental awareness in the favellas of the Recife community among 200+ 8-13 year old students guided by reducing, recycling, reusing. Littering decreased by 50% in the NGO within three months.