Tampara Venkata Shwetanjali Dora
*******************@*****.*** 408-***-**** linkedin.com/in/shwetanjalidora/ EDUCATION:
Northeastern University, Silicon Valley
Master of Professional Studies (M.P.S.) in Analytics) Relevant Coursework: Probability and Statistics, Enterprise Analytics, Analytic Systems and Technology (Python), Data Warehousing and SQL, Data Visualization and Business Communication, Predictive Analytics, Machine learning, Big Data
GPA: 3.88
San Jose, CA
Jan 2019 – Jun 2020
Jawaharlal Nehru Technological University
Bachelor of Technology (B.Tech.) in Informational Technology Relevant Coursework: Mathematical Foundation of Computer Science, Computer Organization, Advanced Data Structures, Database Management Systems, Operating Systems, Data Warehousing and Data Mining, Hadoop and Big Data, Management Science
GPA: 4.0
Kakinada, India
Jun 2014 – Apr 2018
SKILLS:
Programming and Design: MySQL, PostgreSQL, Python, R Java, HTML, CSS, Apache Spark, Microsoft Excel Business Intelligence Tools: Tableau Desktop
Additional Computer Skills: Microsoft Office (Excel, Word and PowerPoint) ML Models: KNN, Decision Trees, Random Forest, Naïve Bayes, Logistic Regression CERTIFICATIONS:
Tableau Desktop Specialist issued by Tableau Software (No expiration) Tableau Analyst issued by Tableau Software (No Expiration) EXPERIENCE:
Data Analyst
Riviera Partners
• Extracted data from Amazon RDS and Amazon Redshift using SQL to interpret and analyze key metrics and transform raw data into meaningful, actionable information
• Expertise in using SQL joins, analytical functions and window functions and optimizing complex SQL queries
• Developed standard and custom reports in Tableau that summarize business, financial, or economic data for review by executives, managers, clients, and other stakeholders
• Maintained and updated Tableau dashboards
• Created Datamarts in Amazon Redshift for the email interaction process
• Worked with Type 1 and 2 dimensions and fact tables, star and snowflake schema design
• Developed scripts using Python for calculating relevancy scores using NDCG metric for identifying and improving search accuracy of the recruiting application (Sutro)
• Automated scripts that refresh multiple times a day/week/month depending on the use case
• Worked directly with other product owners, system engineers, developers, testers, and customers to define features and technical user stories
• Executed regression test suites of the application, performed functionality testing on new features, filed detailed bug reports and worked with developers for resolution
• Supported incident tickets raised after the deployment in every release San Francisco, CA
Sept 2019 – Present
Junior Data Analyst
Sresta Natural Bioproducts
• Developed complex SQL queries to leverage large and potentially messy data sets to derive insights
• Analyzed trends in 2-3 years’ worth of historical sales data using MS Excel to identify potential customers and collaborated with the Marketing team to understand different customer retention strategies
• Generated advanced Tableau dashboards with quick/context/global filters, parameters and calculated fields that allowed to track MTD, YTD, MoM, YoY sales and business revenue
• Assisted the Marketing team in performing A/B testing and evaluated the results San Jose, CA
Aug 2018 – Jul 2019
Graduate Data Scientist
Norcal Cannabis (Practicum Project)
Scope: Develop a model to match or deduplicate Cannabis products Tools/Packages used: Python, NLTK (Word2Vec), Sklearn (K-means clustering, KNN Classification, cosine similarity), Flask, HTML, AWS, Tableau
• Built data pipelines for NLP pre-processing of Cannabis products & performed high dimensional data visualization using PCA, t-SNE
• Optimized the model with 85% accuracy by applying a cascading approach of classification, clustering & a distance metric
• Deployed the model on Pyxeda AWS platform & created a website using Flask API - http://capstoneproject.pythonanywhere.com/
San Jose, CA
Apr 2020 – Jun 2020
ACADEMIC PROJECTS:
Customer Churn analysis, Northeastern University
• Used a sample data set from Kaggle shared by IBM on Telecom customer data to analyze and predict the customers who are most likely to churn.
• By analyzing the customer behavior to predict churn, the Telecom company can develop customer retention programs to retain these customers, thus preventing churn.
• Solved the problem using the following machine learning algorithms: Logistic Regression, Naïve Bayes, KNN, Random Forest
Sentiment Analysis on Twitter Data, JNT University
• Extracted streaming data from twitter (10,000 tweets) of two political leaders and performed preprocessing of tweets using text preprocessing techniques like Lemmatization, Tokenization, etc. to apply Machine learning models.
• Predicted election result by conducting Sentiment Analysis using Naïve Bayes and SVM Algorithms using R programming.
• Compared performance of both the algorithms
Bank Marketing Analysis with Artificial Neural Networks, Northeastern University
• Used a sample data set from Kaggle on Bank Marketing Campaign to analyze the best strategies to improve the future marketing campaign
• Predicted the customers who are likely to subscribe for a term deposit with the bank
• Solved the problem using the following machine learning algorithms: Logistic Regression, Random Forest, Decision Trees and Artificial Neural Networks. GITHUB LINKS:
https://github.com/shwetanjalidora/sentimentAnalysisSVM https://github.com/shwetanjalidora/Supervised-Classification-Algorithm-Kickstarter-Dataset https://github.com/shwetanjalidora/Exploratory-Data-Analysis-of-Google-Play-Store-Applications https://github.com/shwetanjalidora/Forecast-
https://github.com/shwetanjalidora/Admission-prediction-using-KNN TABLEAU PUBLIC PROFILE:
https://public.tableau.com/profile/shwetanjali.dora newProfile=&activeTab=0 ACTIVITIES AND HONORS
• Attended a 2-day National Level workshop on CISCO Networking and a 3-day workshop on Cloud computing organized by Computer Society of India.
• A club member at INTACH (Indian National Trust for Art and Cultural Heritage) a non-profit charitable trust.
• Took part in seminar on the topic Big Data held in ANITS College of Engineering under the event SRISHTI -2K16 and secured Runner’s up.