Post Job Free
Sign in

Data Analyst

Location:
Houston, TX
Posted:
July 23, 2018

Contact this candidate

Resume:

SAIKIRAN AMBATI

ADDRESS: **** ***** **** ******, ********* 4102, Houston, Texas 77054

MOBILE: 346-***-**** EMAIL: **********@*****.*** LINKEDIN: https://www.linkedin.com/in/ambati-saikiran/ 1 https://github.com/Ambati1304

DATA ANALYST

QUALIFICATIONS PROFILE

3 plus years of Working experience as Data Analyst at various industries, and an overall 4 years of exposure on technologies related to Data Science. Extensive knowledge of languages like Python, SQL, R, and Databases like SQL Server, Teradata, MySQL. MongoDB and Tools like Excel, SAS, Tableau. Capable of implementing production level Models on Google Cloud Platform (Skills: BIG Query, ML Engine, API's). Experience in Developing ML algorithms such as Classification, KNN, Regression, Random Forest, Clustering(K-means), Neural Nets, SVM, Bayesian Algorithm, Social Media Analytics, Sentiment analysis, Market Base Analysis, Bagging, Boosting in Linux/Unix Environments. Published 3 research papers in field of Robotics and autonomous navigation Equipped with excellent communication, interpersonal, and collaborative competencies in building positive work relationships with diverse individuals.

PROFESSIONAL EXPERIENCE

University of Houston, Houston, TX, USA

Graduate Research Assistant Oct 2016–May 2018

• Designed and optimized an automated system for emotional content retrieval in natural images using deep learning network in TensorFlow to extract high-level semantic features combined with semantic importance obtained from training on semantic categories using random forest regression to give out numerical values of arousal and valence which are scales to represent emotion.

• Development of a solution for the improvement of quality and performance of data by utilizing advanced data mining and statistical modeling techniques such as AIC and BIC in facilitating studies and visualizations

• Active participation in all phases of data mining, data cleaning, data collection, data wrangling, validation, and visualization

• Completion of gap analysis and statistical analysis on manually created databases by working closely with teams

• Demonstration of skills in extracting image data from the internet; coordinating deep learning networks; and working on several data formats, such as JSON, XML, and HTML

• Systematic design of numerous machine learning algorithms using Pandas, NumPy, Seaborn, Matplotlib, scikit- learn, SciPy, and NLTK in Python

• A key contribution in various visualizations for exploratory data analysis with R and Tableau to create the line, scatter, bar, dot and pie charts, histograms, boxplots, time series, error bars, multiple axes, and subplots Environment: Python, Pandas, NumPy, TensorFlow, Neural Networks, R, Tableau, Matplotlib, MATLAB. University of Houston, Houston, TX, USA

Lab Assistant Feb 2017– Apr 2018

• Examined the existing database MS SQL Server and performed data acquisition tasks.

• Merged user data from multiple data sources by writing SQL queries.

• Used Collaborative Filtering with Latent Factors model to build a recommender engine.

• Performed extensive implicit as well as explicit data collection.

• Performed Exploratory Data Analysis using R and Hadoop HDFS.

• Prototype machine learning algorithm for POC (Proof of Concept).

• Performed Data Cleaning, handled missing data, outliers, features scaling and features engineering.

• Developed Performance metrics to evaluate Algorithm's performance.

• Calculated RMSE score, F-score, PRECISION, RECALL, and A/B testing to evaluate recommender's performance.

• Addressed the over-fitting by adding regularization (lasso/ridge) term in the algorithm.

• Fine-tuned low bias and high variance trade-off.

• Performed data visualization on the front end by building R shiny dashboards. Environment: MS- Excel, Unix, SQL, R Studio, Python. SAIKIRAN AMBATI

ADDRESS: 2111 Holly Hall Street, Apartment 4102, Houston, Texas 77054 MOBILE: 346-***-**** EMAIL: **********@*****.*** LINKEDIN: https://www.linkedin.com/in/ambati-saikiran/ 2 https://github.com/Ambati1304

MARS THERAPEUTICS & CHEMICALS Ltd., Hyderabad, India Data Analyst (SAS) May 2014–Jun 2016

• Extracted data from Oracle using SQL Pass through facility, Proc Access, Libname Method and generated reports.

• Performed statistical analysis, wrote SAS code for data management and reporting, and performed validation, including testing SAS code.

• Performed program documentation on all programs, files and variables for accurate historical record and for future reference.

• Extract data from Excel and Flat files into SAS datasets.

• Created summary reports and tabular reports using Proc Tabulate and Proc Report.

• Supporting other Team members in designing and developing programs

• Created analysis ready datasets for statistician as well as sales performance reports for client.

• Performed QC (Quality Check) extensively on tasks performed by other team members and Performed Data Validation and Data Cleaning.

• Extensive use of Proc SQL to perform queries, join tables and created Mockups for the according to the Study Protocols.

• Develop Annotated CRF Pages for the study protocols and Extensive Experience in Clinical Data Analysis, Producing Tables, and Listings.

• Experience in producing outputs in PDF and RTF formats using SAS/ODS.

• Modification of existing SAS programs and creation of new programs using SAS Macros

• Developed numerous SAS programs to create summaries and listings and Generated customized reports using PROC REPORT.

Environment: Python, SAS, SQL Server, R, Tableau, ETL, Excel. EDUCATION

Master of Science in Electrical Engineering: Aug 2016 – May 2018 University of Houston, Houston, TX B.Tech in ECE Sep 2012 – May 2016 JNTU, Hyderabad, India CERTIFICATIONS

Data Analyst Nano Degree

Python for Everybody Specialization

Machine Learning Specialization

Google Cloud with Tensor Flow Specialization

TECHNICAL SKILLS

Programming and

Scripting Languages

SAS Python (NumPy, SciPy, scikit-learn, NLKT, gensim, keras) MATLAB R (shiny, ggplot2, dplyr, tidyr), C C++ JAVA HTML JavaScript Databases SQL Server Microsoft Access MongoDB

Big Data Tools Apache Hadoop Spark Hive HBase Reporting Tools Microsoft Office Applications (Word, Excel, PowerPoint, and Visio) Tableau Crystal reports XI SSRS Business Objects (5.x/ 6.x) Cognos (7.0/6.0) SPSS Version Control

Machine Learning

Algorithms

Cloud Platforms

SVN (Apache Subversion) GIT

Classification, KNN, Regression, Random Forest, Clustering(K-means), Neural Nets, SVM, Bayesian Algorithm, Social Media Analytics, Sentiment analysis, Market Base Analysis, Bagging, Boosting.

Google Cloud: Dataset API, Machine Learning API, BIGQuery, AWS THESIS

Ambati, S. (2018). Design of an automated system for the retrieval of emotional content in natural images. (Unpublished master’s dissertation). University of Houston, Houston, TX.



Contact this candidate