Post Job Free

Resume

Sign in

Data Analyst Engineer

Location:
Boston, MA
Posted:
February 27, 2021

Contact this candidate

Resume:

HARSH SINDHWA

Data Analyst Data Scientist Data Engineer

Boston, MA 617-***-****

adkja0@r.postjobfree.com https://github.com/Harsh-Sindhwa www.linkedin.com/in/harsh-sindhwa/ EDUCATION

Northeastern University, Boston, USA Expected: April 2021 Master’s in Enterprise Artificial Intelligence (Specialization in Finance) GPA: 3.80/4 Relevant Courses: Fundamentals of AI, Applications of AI, Data Management & Big Data, Accounting Fundamentals in Finance University of Mumbai, India June 2018

Bachelor’s in Electronics & Telecommunications GPA: 8.15/10 TECHNICAL SKILLS

Languages: Python, MATLAB, Java, R, C, C++.

Database: PostgreSQL, Mongo DB, HBase, DynamoDB.

Big Data: Hadoop, MapReduce, Spark, Apache Pig, Hive, Azure. BI & Visualization: Tableau, PowerBI, SQL, Excel, Matplotlib, Plotly, MS-PowerPoint. Data Science & ML: Keras, TensorFlow, Supervised, Unsupervised Learning, Reinforcement Learning, Xgboost, NLTK. Deep Learning: RNN, CNN, DBN, DBM, LSTM.

Certification: Machine Learning, Deep Learning, Tableau Data Analyst, Data Analytics Fundamentals (AWS). ACADEMIC PROJECTS

Movie Recommendation System – Link (R, SQL, Tableau) Sep 2020

• Preprocessed dataset having information of about 6000 users, covering more than 100000 evaluations from 9000 movies.

• Created function to get popular genres, ratings and most active users, plotted visualizations to get valuable insights.

• Implemented Content-based filtering to create function in providing results of recommended movies based on genres.

• Evaluated model by using KNN and achieved an accuracy 89%. Big Data Medical Care Fraud Detection – Link (Apache Spark, Azure, Google Big Query, Tableau, Power BI) Jun 2020

• Preprocessed three different Medicare datasets containing 30 million rows applying Apache Spark & Google BigQuery.

• Constructed final table by performing various joins on datasets adopting Postgre SQL so ML algorithm can be applied.

• Applied Random Forest Algorithm with Cross Validation on key features such resulted in 95% accuracy.

• Visualized Big Data using Power BI to gain insights and exhibited Big Data trends in Medicare. Time Series Analysis of Supermarket – Link (ARIMA, Tableau, Time Series) April 2020

• Interactive dashboard with different filters and manual functions generated with Tableau to obtain valuable insights into results.

• Fabricated function to achieve optimal p, d, q parametric values for the ARIMA model and visualized diagnostic plot with parameters to further mature model.

• Various categories were forecasted with respect to their sales for up to 6 years with MSE less than 0.5. Document Classifier & Text Extractor– Link (Google Vision API, Resnet, VGG, Neural Network) Mar 2020

• Tested Feedforward Neural Network, Resnet, VGG on trained data set to capture features from images and classify accurately on unseen data.

• Refined UI using tkinter, will classify given document and will extract the text from given file.

• Developed algorithm to take coordinates of building boxes given by Google vision API, to append word in database as an attribute, resulting in efficient extraction of text into csv format. Uber Review’s Sentimental Analysis – Link (Pyspark, NLTK) Feb 2020

• Generated pyspark session and passed data through it.

• Improved accuracy by tokenizing each review and removing stop words with custom made stop word list.

• Achieved an accuracy of 89% practicing Counter Vectorizer to convert inputs into frequency of each words and Logistic regression for prediction.

WORK EXPERIENCE

Tata Consultancy Services (TCS), Pune, India Dec 2018 – Aug 2019 Assistant System Trainee Engineer

• Build scripts using MATLAB, & C and handled client calls to understand requirement.

• Matured new projects for Nissan by working in development team of Renault-Nissan.

• Played a vital role in team for developing scripts and was given TCS Gem Award for best employer of month.



Contact this candidate