Sign in

Data Analyst Customer

Houston, Texas, United States
January 25, 2018

Contact this candidate



Houston, TX ● +1-832-***-**** ● ●


Data Analyst with 2+ years of experience working on statistical & predictive modeling, exploratory data analysis, database management, ETL, and dashboards.

Strong practical knowledge of data visualization, data mining, data wrangling, and storytelling.

Proficient in Python and its data science libraries, SQL, Tableau, Excel, and Spark.


Languages: Python, SQL, Java, VBA, HTML, CSS, SAS

Tools: Tableau, Spark, Excel, Access, MS SQL Server, Oracle SQL Dev, Hadoop, MongoDB, MATLAB


Data Analyst - Graduate Instructional Assistant, Bauer College of Business Administration Aug 2016 – Jun 2017

Implemented complex ETL procedures to create and update student information databases like admission records, grade records and surveys spanning over 12 years covering multiple disciplines on MS SQL Server (SSIS).

Performed exploratory data analysis using Python to provide dean’s office insights on the academic impact of sports scholars.

Provided enhanced visibility of admission trends by generating Excel reports utilizing vlookup, pivot tables and charts.

Evaluated and improved existing registration procedures through automated solutions in Excel using Macros (VBA).

Machine Learning Engineer, Nucleus Health, San Diego, California May 2016 – Aug 2016

Developed a predictive model using linear regression to forecast and compare their image sharing web application’s performance with its equivalent mobile application and established that the web application produces 54% more revenue.

Collaborated with the marketing team to build a predictive churn model to roll out retention strategies like discount offers and email campaigns improving customer retention rate by 12%.

Reduced number of salesmen from 15 to 11 by working with sales management team to implement K-means clustering algorithm which helped effectively reassign salesmen to client hospitals.

Built Tableau dashboards and stories by consolidating existing sales and marketing performance reports to provide real-time statistics to senior management, in turn eliminating turnaround time of 2 days per month.

Data Analyst, Flextronics, India Dec 2014 – Jul 2015

Implemented a customer profiling model using clustering algorithm for segmentation of about 20,000 customers.

Forecasted demand trends using exponential smoothing and liaised with supply chain team to minimize stock out situations.

Constructed complex queries on Oracle SQL dev to extract, load and maintain employee information on large databases.

Performed routine and ad-hoc reporting to evaluate and track performance metrics, including quality and cost savings.


Sentiment Analysis of Amazon Review Data

Developed an NLP system utilizing sentiment analysis of Amazon’s customer review data to provide a feature based rating, in addition to the existing customer overall rating, for any product with a sizeable number of customer reviews.

Processed large customer review datasets by performing data transformation, feature parsing using Apache Spark and sentiment analysis using OpenNLP.

Movie Recommendation System for MovieLens

Developed two movie recommendation systems, a model-based collaborative filtering system which uses SVD and a memory-based collaborative filtering system which uses both user-item and item-item filtering.

Evaluated the accuracy of predictions using RMSE and found model-based CF to be 17% more efficient.

Demand Forecasting and Staff Optimization of Concessions Stands at Aramark

Forecasted aggregate and product-wise demand in 130+ concession points of sale to address frequent stock-outs.

Optimized the use of cashiers and runners for 23 stands using Excel Solver to manage the peak demand.

Segmented products and stands by velocity and provided recommendations.


Master of Science in Electrical & Computer Engineering, University of Houston – Main Campus May 2017

Bachelor of Technology in Electronics & Instrumentation Engineering, Amrita School of Engineering, India May 2015


Probability & Statistics, Database Management. Systems, Machine Learning, Business Intelligence, Big Data, Python for Data Science


Participated in Hurricane Harvey relief effort at NRG Center supporting 10,000 refugees in managing and distributing donations.

Organized a state-wide two-day robotics competition with participants from over 15 universities.

Contact this candidate