Sneha Sharma
****************@*****.*** +1-405-***-****
Summary:
Motivated and detail-oriented Data Scientist with 1 year of experience in data visualization, dashboard development, and data integration.
Skilled in Python, SQL, and Tableau to generate actionable insights and enhance business decisions.
Proficient in handling large datasets, ensuring data accuracy, and thriving in collaborative environments.
Passionate about solving problems using data-driven approaches.
Experience in Data mining with large datasets of Structured and Unstructured data, Data Acquisition, Data Validation, Predictive modeling, Data Visualization.
Experience in migration from heterogeneous sources including Oracle to MS SQL Server.
Hands on experience in design, management and visualization of databases using Oracle, MySQL and SQL Server.
Experience in Descriptive Analysis Problems like Frequent Pattern Mining, Clustering, Outlier Detection.
Theoretical foundations and practical hands - on projects related to supervised learning (linear and logistic regression, boosted decision trees, Support Vector Machines, neural networks, NLP), unsupervised learning (clustering, dimensionality reduction, recommender systems), probability & statistics, experiment analysis, confidence intervals, A/B testing, algorithms and data structures.
Extensive knowledge on Azure Data Lake and Azure Storage.
Experience in migration from heterogeneous sources including Oracle to MS SQL Server.
Hands on experience in design, management and visualization of databases using Oracle, MySQL and SQL Server.
Technical Skills:
Programming Languages: Python, SQL
Data Visualization Tools: Tableau, QlikView
ETL Tools: SSIS (SQL Server Integration Services)
Database Management: SQL Server 2012 R2
Data Analysis: Pandas, NumPy, Matplotlib
Version Control: Git
Other Tools: Microsoft Excel
Professional Experience
Data Scientist – Panasonic Nov 2024 – Present
Responsibilities:
Developed and deployed machine learning models to analyze and optimize business operations.
Designed and implemented data pipelines for collecting, processing, and transforming large datasets.
Built interactive dashboards and reports using Tableau to visualize key business metrics.
Utilized SQL and Python for data extraction, cleaning, and exploratory analysis.
Collaborated with cross-functional teams to identify and solve data-related challenges.
Performed A/B testing and statistical analysis to support data-driven decision-making.
Optimized ETL processes to improve data accuracy and reduce processing time.
Worked with cloud storage solutions like Azure Data Lake for efficient data management.
Data Scientist - Amazon Jan 2024 - Nov 2024
Responsibilities:
Developed interactive dashboard reports to visualize Key Performance Indicators (KPIs) for executive management, enabling data-driven decision-making.
Published interactive workbooks and visualizations using Tableau Desktop and Server.
Designed and maintained relational databases in SQL Server 2012 R2 to support data retrieval and update processes.
Tuned Extract, Transform, Load (ETL) processes for optimized performance.
Utilized SSIS to implement data integration workflows and create custom views for varied reporting needs.
Analyzed source-to-target mappings and implemented logic designs for efficient data flow.
Migrated and transformed datasets from multiple sources to target destinations, ensuring high data quality.
Conducted ad-hoc analyses and automated daily reports using QlikView.
Designed and implemented cross-validation and statistical tests including Hypothetical Testing, ANOVA, Autocorrelation to verify the models significance.
Designed an A/B experiment for testing the business performance of the new recommendation system.
Supported MapReduce Programs running on the cluster.
Evaluated business requirements and prepared detailed specifications that follow project guidelines required to develop written programs.
Configured Hadoop cluster with Name node and slaves and formatted HDFS.
Used Oozie workflow engine to run multiple Hive and Pig jobs.
Participated in Data Acquisition with Data Engineer team to extract historical and real-time data by using Hadoop MapReduce and HDFS.
system. A highly immersive Data Science program involving Data Manipulation & Visualization, Web Scraping, Machine Learning, SQL, GIT, Unix Commands, NoSQL, MongoDB.
Provide expertise and recommendations for physical database design, architecture, testing, performance tuning and implementation.
Transformed Logical Data Model to Erwin, Physical Data Model ensuring the Primary Key and Foreign Key relationships in PDM, Consistency of definitions of Data Attributes and Primary Index Considerations.
Collaborated with Database engineers to implement ETL process.
Experience in GCP Dataproc, GCS, Cloud Functions, Big Query.
Experience in moving data between GCP and Azure using Azure Data Factory
FINANCEPEER: Senior Marketing Manager, (Fintech- B2B, B2B2C) May 2021 - Dec2022
The Bombay Digital Company (Agency): Marketing Manager- B2B May 2018 - Apr 2021
Education
Master of Business Administration NYIT, New York Jan 2023- Dec 2023
Masters in Management Science Welingkar Institute, India. Aug 2016- May 2018
Bachelor of Technology in Electronics Engineering Mumbai University, India. Jul 2011-May 2015
Academic Projects:
Harold and Lantern, Team Leader, NYIT
Created various predictive and descriptive models to help identify factors affecting participant attrition and resource utilization.
The recommendations helped improve participant attrition by 11% and resource utilization by 8%.