SIDDHI SANJAY KAWLE
Boston, MA ***** 617-***-**** *************@*****.*** https://www.linkedin.com/in/siddhi-kawle EDUCATION
Northeastern University May 2020
Master of Science in Engineering Management with specialization - Data Analytics University of Mumbai, Mumbai, India June 2018
Bachelor of Technology – Electronics Engineering
TECHNICAL SKILLS
Programming Languages: R, T-SQL, Python (NumPy, Pandas, Matplotlib, BeautifulSoup), Spark Databases: SQL Server 2017, Oracle, MySQL, PostgresSQL, MongoDB, Cassandra Business Intelligence Tools: Tableau, Power BI, Qlikview, SSIS, SSRS, Talend, Alteryx Other Tools: Jira, Trello, Google AdWords, Google Analytics, MATLAB, Advance Excel, Visual Studio Certifications: KPMG Virtual internship, Analyzing and Visualizing Data with PowerBI, Tableau Data Analyst WORK EXPERIENCE
Squark Aug 2020-Present
Research Assistant, Boston, USA
• Research Visualizing Bias texts and case studies to compile an informational e-book to understand, detect, and address bias in AI.
• Determine statistical methods and tools to identify bias in visualization and present effective solutions.
• Evaluated metrics related to fairness factors for various bias mitigating methods.
• Managed weekly tasks and meetings for the team using JIRA software. Vora Trading Co. Jan 2017-Dec 2017
Data Analyst Intern, Mumbai, India
• Analyzed transaction and behavioral data in Excel and R to identify trends in customer purchasing and business revenue forB2B retail company.
• Created more than 20 ad-hoc data analysis reports in Microsoft SQL Server to provide insights related to annual sales, Stakeholder’s data, and predict the project timeline.
• Automated data cleaning and wrangling tasks using R, improving the operational efficiency by over 85%.
• Visualized dashboards using Tableau and Excel to provide insights to Product Managers for the efficient understanding of customer behavior over specified periods.
PROJECTS
ETL on School Database Talend, Alteryx, Tableau, PowerBI Oct 2019-Dec 2019
• Designed a centralized data warehouse, data pipelining from diverse data sources using Talend.
• Engineered an efficient integration of datasets containing 5 million rows by tuning the ETL mappings, implementing slowly changing dimensions, reject codes, and various performance tuning methods.
• Performed data visualization to pull business insights using interactive dashboards in Tableau and PowerBI. Social Media Database Management System Pl\SQL, Oracle, Tableau Jan 2019-Apr 2019
• Developed SQL queries to maintain and manage a social media database of more than 20 tables normalized until the third normal form.
• Build a dynamic database by collecting raw data, creating ER diagrams, and obtained useful information by querying using SQL developer and SSMS and visualized the output in Tableau.
• Granted User Access, Privileges, and user-defined functions to prevent the exploitation of personal data by individuals on various platforms.
Mental Health Prediction using Supervised learning R Jan 2019-Apr 2019
• Designed a Naïve Bayes model to predict the mental health of an employee based on certain indicators.
• Deployed PCA to improve the usability of the dataset by normalizing, reducing the number of variables, and replacing missing data with mean.
• Implemented logistic regression and support vector machine models for better classification of data. Exploratory Data Analysis for Online Shopping Python (Pandas, Seaborn, BeautifulSoup) Mar 2019-Apr 2019
• Web scraped data from various websites and identified search parameters through a URL before importing as .csv file into Python.
• Visualized data (seaborn library), for customers’ trend, helped business owners to forecast future sales based on products having the highest gross sales.