SWETCHA CHOWDARY KARNATI
Des Moines, IA ***** 346-***-****
**************@*****.*** https://www.linkedin.com/in/swetcha-chowdary/ Skilled data/statistical analyst with 3+ years of professional experience in analyzing, interpreting and visualizing data for business and research purposes. Proficient knowledge in statistics and machine learning for data modeling and predictive analysis. Highly analytical and process-oriented with in-depth knowledge of analyzing large data sets, fixing the data issues and applying machine learning practices. Energetic presenter and confident communicator with the ability to circulate information in a way that is clear, efficient, and beneficial for end users using informative visualizations generated in R, Tableau and Spotfire. Creative in finding solutions to problems and determining modifications for optimal use of organizational data.
TECHNICAL SKILLS
Programming Languages: C, Java, Python, R, JavaScript, HTML, CSS, LaTeX Databases: MySQL, Oracle, NoSQL, MongoDB, PostgreSQL, T-SQL, MS SQL, SQL, SSIS Big Data Ecosystems: Hadoop, MapReduce, HDFS, HBase Tools: Jupyter, GitHub, Visual Studio, Spotfire, Tableau, AWS. Python Data Stacks: Pandas, Numpy, Matplotlib, NLTK, Scikit-Learn, Tensor Flow Data Visualization: MatplotLib, Spotfire, Tableau, ggplot2, Sea born, Excel Operating Systems: Windows, Mac, Linux.
PROFESSIONAL EXPERIENCE
Jr Developer - University of Houston – Houston, TX Jan’17– May’18
• Worked in a team to develop a Python Flask based web application using agile principles.
• Responsible for front end changes including visualizations using D3js.
• Built Data pipelines to streamline data extraction and cleaning process.
• Developed back-end modules, web App and REST APIs using Python, Flask and conducted unit testing with PyUnit.
• Worked on various stages of application development along with managing the requirements among various teams. Technologies/Platforms used: Flask, d3.js, REST API, PyUnit Data Analyst - University of Houston – Houston, TX Aug’16 – Dec’16
• Organized, updated and maintained large database of undergraduate student details enrolled in CASA for online examination.
• Developed Database Triggers to enforce Data integrity and additional Referential Integrity.
• Used joins and sub-queries to simplify complex queries involving multiple tables.
• Tuned queries by altering the database design and analyzing various query options.
• Optimized data retrieval time using techniques like DB indexing.
• Created and updated weekly report with more than 1500 entries and produced them visually using Tableau. Technologies/Platforms used: Python, SQL Server, Tableau, R. Technologies/Platforms used: Python, SQL Server, Tableau, R. Data Analyst/Statistical Analyst - Nagarjuna Agri Chem Pvt. Ltd - Hyderabad, India May’15 -May’16
• Designed and created an oracle database with all the processed data.
• Created Tables, Views and maintained the table performance by following the tuning tips like Normalization using SQL based on the requirements.
• Created indexes on tables to improve the performance by eliminating the full table scans and views for hiding the actual tables and to eliminate the complexity of the large queries
• Conducted statistical modeling and analysis such as hypothesis testing and linear multi variate regression on the sales data obtained from different branches of the company.
• Retrieved the weekly agricultural sales related data in real time and cleaned the data using python.
• Responsible for creating visualizations in the web application using d3.js.
• Managed the production server on AWS EC2 cluster with regular builds. Technologies/Platforms used: Python, Oracle, AWS EC2, d3.js PROJECT EXPERIENCE
Statistical Analysis of Microsurgery Research data using R programming
• Used microsurgery research data to retrieve important patterns in the relationship of sympathetic arousal and performance in learning micro-surgical tasks using R programming.
• Created multivariate linear and mixed regression, dummy variable statistical models for the given data and perform ANOVA testing to outline the significant contributors to the dependent variable of performance and analysis of residual plots for possible errors in the model.
• Conduct Mann-Whitney U test to identify the difference in the performance between the two gender of Male and Female.
• Visualization of the Trait Psychometric, Biographic, Performance and Perinasal Perspiration data obtained from the study using ggplot2 in R to analyze the relationship of the micro surgical task performance accuracy and speed on the stress signal level varying with the characteristics and session of the task involved. Training and Testing of Electricity and GDP data using R programming
• Modeled time series electricity and GDP data using ARIMA and TBATS models.
• Checked residuals and accuracy for the two models to determine the model efficiency.
• Visualized the data trends and model fitting to the data. Text Learning using python
• Web Scrapped 7000 company’s employee’s data and their company news from Reuters and used machine learning techniques to determine sector of the company and employee job type.
• Separated nouns from the news acquired from company pages using Natural Language Processing (NLP) and employed python package Natural Language Tool Kit(NLTK) to predict sentiment and separating entities.
• Checked the strength of relationship between the companies with the help of frequency chart. Open Street Mapping
• Chosen a desired location from https://www.openstreetmap.org.
• Used data wrangling techniques to check for the quality of the data for validity, accuracy, completeness, consistency and uniformity using python programming.
• Created a database of the wrangled data and using SQL to query the data and clean the inconsistent data like zip codes, street names etc.
EDUCATION
MASTER’S IN COMPUTER & SYSTEMS ENGINEERING – University of Houston – Houston, Texas - May 2018 GPA 3.83 BACHELOR’S IN ELECTRONICS & COMMUNICATION ENGINEERING – Jawaharlal Nehru Technological University – Hyderabad, India – May 2016 GPA 3.65