SONAL SHIRISH PORWAL
LinkedIn Profile *************@*****.*** 919-***-**** 24 Union Square Apt 536, Union City, CA 94587 EDUCATION
Database and Data Analytics Certification Spring 2015 – Present UCSC Silicon Valley, CA GPA – 3.92
Major Courses: Intro to Data Analysis, Predictive Analysis, Data Modeling, Dashboards and Data Visualization, Relational Database Design and SQL Programming, Business Statistics, Principles of Business Analysis. Master of Engineering in Computer Science Fall 2011 – Fall 2013 Mumbai University, India GPA – 4.0
Major Courses: Algorithm & Complexity, Advanced Database Management, Data Warehousing & Mining, Neural Networks & Fuzzy Logic.
WORK EXPERIENCE
Content Analyst Intern, Agylytyx July 2015- Sep 2015
•! Client data cleaning and transformation into a dataset model (using Excel/Python), built a ‘construct library’ for Funding Profile Generator Application.
•! Generated frameworks to build data visualization charts using the constructs library on the client data set. Professor, Fr. Conceicao Rodrigues College of Engineering, India July 2012 –Nov 2013
•! Taught courses related to Database, Machine Learning, and Software Engineering.
•! Mentor for senior student’s project to implement Performance Analysis in Education Sector using Data Mining.
•! Conducted lab sessions aimed at teaching Software Project Management using tools like Excel.
•! Facilitated Hadoop learning sessions as the coordinator of the ‘Faculty Development Program’. Commodity Trader & Analyst, Ariston Capitals Services Pvt. Ltd Jun 2009–Jun 2010
•! Collected and analyzed Commodity & Currency Market data to develop actionable recommendations.
•! Analyzed technical charts and drew inferences by observing different trends for different commodities. TECHNICAL SKILLS
Languages: R, SQL, UML, Python, Java
Databases: MySQL, SQL Server
Tools and Technologies: R Studio, Tableau, MS Excel, Weka, Eclipse, ER Studio, MySQL Workbench ACADEMIC PROJECTS
Predict Survival on the Titanic, (R Language) Summer 2015
•! Predicted the survivors of the tragedy based on their characteristics using Logistic Regression Analysis and Decision Tree techniques.
•! The prediction accuracy was further improved up to 81% by using Feature engineering process and Ensemble learning approach.
Statistical Analysis of a Social Networking Website, (R Language) Spring 2015
•! Identified the demographic factors contributing to different Facebook Habits of people.
•! Performed statistical analysis on the data obtained from an omnibus survey of the people’s Facebook habits.
•! The correlation between the characteristics of the people and their Facebook habits was found using exploratory data analysis(EDA), data grouping, data visualization and hypothesis testing using chi-square test. Regression Analysis of Housing Prices in New York City, (MS Excel) Spring 2015
•! Performed Multiple Regression analysis and hypothesis testing on the dataset of the real estate prices to determine the influence of various house characteristics (area, no. of bedrooms etc.) on the price of home.
•! The significance testing was done using F-statistics and t-statistics. Relational Database Design for President Campaign System, (SQL) Spring 2015
•! Analyzed the problem statement, identified the business rules and constraints in the system.
•! Designed an ER model, created multiple tables and views, granted access privileges using SQL queries. A Naïve Gain Approach to Intrusion Detection System, (Java) Fall 2013
•! Implemented an Anomaly-Based IDS using modified classification approach for data mining.
•! A two phase Naive Gain Classifier system was proposed, where NSL KDD dataset was served as an input, the first phase involved entropy based feature selection, followed by a Naïve Bayes classification algorithm. ACHIEVEMENTS AND PUBLICATIONS
•! Published: ”A Comparative Analysis of Data Cleaning Approaches to Dirty Data”, International Journal of Computer Applications (IJCA), 2013.
•! Published: ”A Naive Gain Approach to Intrusion Detection Systems”, International Journal of Computer Applications (IJCA), 2013.
•! Gold Medalist, for earning the highest grade in Masters in Computer Engineering First Year.
•! Training and Recruitment Coordinator at Fr. Conceicao Rodrigues College of Engineering, India.
•! Ranked 239 amongst 2862 competing teams in Kaggle Competition for predicting survival on Titanic.