Post Job Free
Sign in

data analyst/ scientst

Location:
Plano, TX
Posted:
August 31, 2016

Contact this candidate

Resume:

Plano, Tx ***** http://www.linkedin.com/in/elangovankarthik

682-***-**** https://github.com/elon-

acwe8x@r.postjobfree.com Karthik Elangovan Profile Portfolio

Summary

Experience in Machine learning data analysis, data wrangling, database management, project planning, and implemented ETL in several projects. Excellence in statistic and data visualization with data science Nano degree from udacity and master’s degree in Industrial Engineering.

Education

Udacity- Data Analyst Nanodegree Jun.2015- May. 2016

The University of Texas at Arlington - Masters of Science in Industrial Engineering Jan.2014- Dec. 2015

Anna University, Tamil Nadu, India –Bachelor of Technology, Mechanical Engineering May.2009- Mar. 2013

Certifications:

Excel for Data Analysis and Visualization certified from Microsoft.

Quality Engineering and Management certified from Technische Universität München.

Supply chain and logistic certified from Massachusetts Institute of Technology.

Technical Skills

Languages: Python, R,SQL,No-SQL,Javascrip,D3,Unix.

IDE: Anaconda, Spyder, IPython Notebook, R-Studio,Git (version-control).

Databases / Documentation: MongoDB, Infomatica, MS Word, MS Excel, MS Project Management,google anaytics.

Python statistics library: Pandas, Numpy, Matplotlib, Scikit-learn, ggplot.

Academic Projects

Investigated the Enron Employees using Machine Learning to find who have committed fraud based on public Enron financial and email data. Various algorithms such as Naïve Bayes, Decision tree, PCA, SVM was applied using Sklean Machin Learning package.

Optimized and validated Machine learning algorithms with a precision .60 and recall .30 and cross validated using k-fold.

Prepared open street map for the city of Chicago and Chennai using Data Wrangling tools in Python to parsing data from different file formats like json, xml, csv, etc and assess the quality of the data for validation, accuracy, consistency and uniformity.

Stored the cleaned data into MongoDB and run MongoDB No-SQL query’s and aggregate data for future analysis.

Developed regression model and investigated inference statistics with 95% confidence interval using Python to answer "do more or few people ride the NYC subway when it is raining?”

Developed prediction model using ordinary least squares using Python’s Numpy, Pandas, Statsmodels to find various factors such as weather, time of travel etc. influencing NYC subway ridership to make subway more commuter friendly and help bring down operation cost.

Build interactive graphs to tell a story on how the tourist incoming boomed in India till 2014 using D3 and JavaScript with gulp build tools using Git version-control.

Designed an A/B test, including which metrics to measure and how long the test should be run. I also analyzed the result of an A/B test that was run by Udacity, recommended a decision, and proposed a follow-up experiment.

Investigate hypothesis and generated descriptive statistics outcomes for Stroop Effect using Python to describe qualities of sample.

Analyzed inferences from Stroop experiment samples and draw conclusion based on results with 95% confidence interval.

Prepared exploratory data analysis on the quality of red wine sample using R. Generated HTML doc. Via R markdown to explore the relationship using univariate, bi-variate and multi-variate plots.

Professional Experience

Efftronics System Pvt, Ltd., Data Analyst, India Jun. 2012- Dec. 2013

Developed vendor database network to increase 20% more potential suppliers based on company strategy and cost priority in Asian market.

Prepared purchase orders; generated reports on power plant projects using SAP (Warehouse Management System model); and worked closely with purchase department to implement Process improvement methodologies such as LEAN, Six Sigma, PDCT technology.

Prepared spend data of various modes and trend analysis to support the annual reports using MS Excel.

Experienced creating and publishing reports with Business Intelligence (BI) reporting tool, Tableau.

Leadership and Involvement

Kaggle – Ranked 1072 in Home depot product searched relevance.



Contact this candidate