ANUJ GOLESAR
Boston, MA 857-***-**** *******.*@************.*** www.linkedin.com/in/anujgolesar
IMMEDIATELY AVAILABLE
EDUCATION
Northeastern University, Boston, MA Mar 20
Master of Science in Data Analytics
Relevant Courses: Predictive Analytics, SQL and Data warehousing, Data Mining, Intro to Cloud Computing, Relational Database
University of Pune, India May 18
Bachelor of Engineering in Electronics and Telecommunication
Relevant Courses: Computer Networks (VPC, OSI and TCP/IP, SSH, DNS, IPsec, VPN, Load Balancing)
TECHNICAL SKILLS PORTFOLIO: https://golesara.wixsite.com/anujgolesar
Language Python (Pandas, Keras, Scikit-Learn, Seaborne), R, SQL, NoSQL, JSON, T-SQL
Databases MySQL, MS-SQL server, SNOWFLAKE
BI/ETL Tableau, Power BI, Alteryx, MS-Excel, SSIS, ETL, SSRS, AWS Redshift, SSMS, Looker
Cloud Technologies AWS, Azure, EC2, S3 Bucket, Lambda, HDInsights, Databricks, VPC, IAM, Load Balancing
IDE& Tools Anaconda, Jupyter, KNIME, Azure-VM, SPOTFIRE, MATLAB, Google Data Studio
Certifications Tableau Desktop Specialist, AWS Solution Architect-Associate (Scheduled)
Operating Systems Linux, Windows
z
PROFESSIONAL EXPERIENCE
Graduate Research Assistant, Northeastern University, Boston, MA Sept 19 – Apr 20
Analyzed and filtered the US census data having 20 columns and 220k records to build data pipeline in Python stored in S3 bucket.
Created python application to convert CSV file to JSON file and created GUI using tkinter package
Researched on ways to visualize single household element in each of the states, developed excel forms using VBA Macros.
Collected data from OLTP systems and flat files using different tool such as Join, Scatter plot, Find and Replace in Alteryx
Derived insights through OLAP data cubes processing in SSAS and generated reports using SSRS and Looker
Developed visualizations in Tableau to represent the clustering results and patterns in Households
Data Analyst Intern, Softric Solutions, Pune, India Nov 17 – Aug 18
Worked closely with clients to understand the data having information about Graduate Research admission in overseas universities
Cleaned the data in R using dplyr, tidyverse and developed T-SQL, R scripts to analyze the trends in admissions process and propose solutions to optimize it. Analyzed numerical data using pivot tables to get insights.
Designed a Data Warehouse to store data from different sources, performed ETL using SSIS and created BI reports in SSRS
Developed visualizations in Tableau to present analysis and recommend strategies for improvement in consulting students.
Developed easy-to-consume the Power BI dashboards to show key metrics to clients to provide actionable insight on key metrics.
ACADEMIC PROJECTS Repository Link: https://github.com/anujsanjay
Business Intelligence and Data Warehousing - AdventureWorks2017 (Power BI, Tableau, Talend) Dec 19
Designed a Data Warehouse from four different geographic locations and various data sources
Developed strategic BI Dashboards in Microsoft Power BI and Tableau based on product store sales, online sales and rejects analysis
Pipelined large scale data of a business from multiple database sources and integrated into SCD using Talend
SNOWFLAKE Data warehousing – Twitter tweets analysis (Snowflake, Tableau, Python) Mar 20
Developed Data warehouse for twitter data (2.9 million rows) in snowflake, created staging area and used COPY function to load data from local disk into schema., connected DW instance to Tableau for analysis.
Performed feature selection and developed dashboards in Tableau to analyze the sentiments about US President election.
PSYCHSIGNAL analysis using POWER BI (R studio, Power BI) Mar 20
Processed and analyzed Psych signal data using R, converted timestamps into XTS object for financial data analysis
Integrated R scripts with MS-Power BI to develop dashboards to help investors take decisions about stock market investments
Identified new stock sentiment signal (PANIC-EXCITE) for investors to help them anticipate nature of stock signal in near future
AWS -MVP Application for Digital Transformation ( S3,Lambda,DynamoDB) Dec 19
Deployed server-less web application using Lambda, S3 and API Gateway
Developed interactive UI using HTML, CSS to provide easier registration and data upload to users
Achieved efficiency of 30 % in the process by authenticating users using COGNITO and provided digital transformation services
Viacom Demographics Analysis (Excel, Tableau, Python) Nov 19
Analyzed Viacom demographic data to develop strategies to help in launching new products and services for Viacom
Developed ETL scripts using python for feature selection and implemented Random Forest model to predict CPM values
UK Road accident analysis (Power BI, MS-Azure, HD Insights, PySpark) Feb 19
Performed analysis on UK road accident data ( 2.3 M Rows) stored in Azure BLOB Storage and cleaned it in HDInsights using PySpark and extracted data using Spark-SQL on data frame.
Developed dashboards using Power BI to show the correlation between causalities and different parameters