SAIKIRAN AMBATI
ADDRESS: **** ***** **** ******, ********* 4102, Houston, Texas 77054
MOBILE: 346-***-**** EMAIL: **********@*****.*** LINKEDIN: https://www.linkedin.com/in/ambati-saikiran/ 1 https://github.com/Ambati1304
DATA ANALYST
QUALIFICATIONS PROFILE
3 plus years of Working experience as Data Analyst at various industries, and an overall 4 years of exposure on technologies related to Data Science. Extensive knowledge of languages like Python, SQL, R, and Databases like SQL Server, Teradata, MySQL. MongoDB and Tools like Excel, SAS, Tableau. Capable of implementing production level Models on Google Cloud Platform (Skills: BIG Query, ML Engine, API's). Experience in Developing ML algorithms such as Classification, KNN, Regression, Random Forest, Clustering(K-means), Neural Nets, SVM, Bayesian Algorithm, Social Media Analytics, Sentiment analysis, Market Base Analysis, Bagging, Boosting in Linux/Unix Environments. Published 3 research papers in field of Robotics and autonomous navigation Equipped with excellent communication, interpersonal, and collaborative competencies in building positive work relationships with diverse individuals.
PROFESSIONAL EXPERIENCE
University of Houston, Houston, TX, USA
Graduate Research Assistant Oct 2016–May 2018
• Designed and optimized an automated system for emotional content retrieval in natural images using deep learning network in TensorFlow to extract high-level semantic features combined with semantic importance obtained from training on semantic categories using random forest regression to give out numerical values of arousal and valence which are scales to represent emotion.
• Development of a solution for the improvement of quality and performance of data by utilizing advanced data mining and statistical modeling techniques such as AIC and BIC in facilitating studies and visualizations
• Active participation in all phases of data mining, data cleaning, data collection, data wrangling, validation, and visualization
• Completion of gap analysis and statistical analysis on manually created databases by working closely with teams
• Demonstration of skills in extracting image data from the internet; coordinating deep learning networks; and working on several data formats, such as JSON, XML, and HTML
• Systematic design of numerous machine learning algorithms using Pandas, NumPy, Seaborn, Matplotlib, scikit- learn, SciPy, and NLTK in Python
• A key contribution in various visualizations for exploratory data analysis with R and Tableau to create the line, scatter, bar, dot and pie charts, histograms, boxplots, time series, error bars, multiple axes, and subplots Environment: Python, Pandas, NumPy, TensorFlow, Neural Networks, R, Tableau, Matplotlib, MATLAB. University of Houston, Houston, TX, USA
Lab Assistant Feb 2017– Apr 2018
• Examined the existing database MS SQL Server and performed data acquisition tasks.
• Merged user data from multiple data sources by writing SQL queries.
• Used Collaborative Filtering with Latent Factors model to build a recommender engine.
• Performed extensive implicit as well as explicit data collection.
• Performed Exploratory Data Analysis using R and Hadoop HDFS.
• Prototype machine learning algorithm for POC (Proof of Concept).
• Performed Data Cleaning, handled missing data, outliers, features scaling and features engineering.
• Developed Performance metrics to evaluate Algorithm's performance.
• Calculated RMSE score, F-score, PRECISION, RECALL, and A/B testing to evaluate recommender's performance.
• Addressed the over-fitting by adding regularization (lasso/ridge) term in the algorithm.
• Fine-tuned low bias and high variance trade-off.
• Performed data visualization on the front end by building R shiny dashboards. Environment: MS- Excel, Unix, SQL, R Studio, Python. SAIKIRAN AMBATI
ADDRESS: 2111 Holly Hall Street, Apartment 4102, Houston, Texas 77054 MOBILE: 346-***-**** EMAIL: **********@*****.*** LINKEDIN: https://www.linkedin.com/in/ambati-saikiran/ 2 https://github.com/Ambati1304
MARS THERAPEUTICS & CHEMICALS Ltd., Hyderabad, India Data Analyst (SAS) May 2014–Jun 2016
• Extracted data from Oracle using SQL Pass through facility, Proc Access, Libname Method and generated reports.
• Performed statistical analysis, wrote SAS code for data management and reporting, and performed validation, including testing SAS code.
• Performed program documentation on all programs, files and variables for accurate historical record and for future reference.
• Extract data from Excel and Flat files into SAS datasets.
• Created summary reports and tabular reports using Proc Tabulate and Proc Report.
• Supporting other Team members in designing and developing programs
• Created analysis ready datasets for statistician as well as sales performance reports for client.
• Performed QC (Quality Check) extensively on tasks performed by other team members and Performed Data Validation and Data Cleaning.
• Extensive use of Proc SQL to perform queries, join tables and created Mockups for the according to the Study Protocols.
• Develop Annotated CRF Pages for the study protocols and Extensive Experience in Clinical Data Analysis, Producing Tables, and Listings.
• Experience in producing outputs in PDF and RTF formats using SAS/ODS.
• Modification of existing SAS programs and creation of new programs using SAS Macros
• Developed numerous SAS programs to create summaries and listings and Generated customized reports using PROC REPORT.
Environment: Python, SAS, SQL Server, R, Tableau, ETL, Excel. EDUCATION
Master of Science in Electrical Engineering: Aug 2016 – May 2018 University of Houston, Houston, TX B.Tech in ECE Sep 2012 – May 2016 JNTU, Hyderabad, India CERTIFICATIONS
Data Analyst Nano Degree
Python for Everybody Specialization
Machine Learning Specialization
Google Cloud with Tensor Flow Specialization
TECHNICAL SKILLS
Programming and
Scripting Languages
SAS Python (NumPy, SciPy, scikit-learn, NLKT, gensim, keras) MATLAB R (shiny, ggplot2, dplyr, tidyr), C C++ JAVA HTML JavaScript Databases SQL Server Microsoft Access MongoDB
Big Data Tools Apache Hadoop Spark Hive HBase Reporting Tools Microsoft Office Applications (Word, Excel, PowerPoint, and Visio) Tableau Crystal reports XI SSRS Business Objects (5.x/ 6.x) Cognos (7.0/6.0) SPSS Version Control
Machine Learning
Algorithms
Cloud Platforms
SVN (Apache Subversion) GIT
Classification, KNN, Regression, Random Forest, Clustering(K-means), Neural Nets, SVM, Bayesian Algorithm, Social Media Analytics, Sentiment analysis, Market Base Analysis, Bagging, Boosting.
Google Cloud: Dataset API, Machine Learning API, BIGQuery, AWS THESIS
Ambati, S. (2018). Design of an automated system for the retrieval of emotional content in natural images. (Unpublished master’s dissertation). University of Houston, Houston, TX.