SAMJHANA SHAKYA
** ***** ***** **** *** Britain CT 06053 512-***-**** ********.******@*****.*******.*** SUMMARY
Data scientist with strong statistics analysis and programming background with proficient experience in data mining, and management of the architectural team. Detail-oriented, and quick learner with the ability to work with diverse people and situations. SOFTWARE SKILLS
RStudio, R Shiny, python (TensorFlow, Pandas, NumPy, Seaborn, SciKit-Learn, SciPy), SAS programming, SQL (SQL Server, PostgreSQL, SQLite MySQL, NoSQL), Hadoop, PySpark, HDFS, Hive, Apache, IBM Cognos Insight, Weka, ORACLE, JMP, MS Suite, Power BI, Tableau, TensorFlow, MongoDB, Git, ERDPlus, AWS, MS Access, Visual Studio. EDUCATION
Master of Science in Data Science May 2019-May 2020 South Dakota State University, Brookings, SD 4.0/4.0 GPA Master of Science in Operations Management Aug 2018- Dec 2020 South Dakota State University, Brookings, SD 3.875/4.0 GPA COURSEWORK
Statistical Programming, Modern Applied Statistics, Programming Data Analytics, Big Data Analytics, Time Series Analysis, Predictive Analytics, Data Warehousing, Nonparametric Statistics, Deep learning, Project Management, and Business Intelligence. WORK EXPERIENCE
Data Scientist, GDM Solutions, Brookings, South Dakota 57006, USA May 2020 – Aug 2020 Ø Review the SQL database and offer improvements to database structure using SQL Server and Compare it! 4.2. Ø Create SQL scripts for client database and update the database to the latest database structure. Ø Create Data Analysis queries, did data mining, and reports utilizing SQL server, MS Excel, ARM, Tableau, and Power BI. Ø Visualized the Agriculture data in Tableau and Power BI, created the dashboard and published the result. Teaching Assistant, Constructional and Operational Dept. SDSU, Brookings, SD Aug 2018 – May 2020 Ø Taught Microsoft Office (Excel, PowerPoint, Word) for Management tools and analysis to 25 student’s labs for 3 semesters. Ø Guided the student in Land Surveying for 4 semesters. Student Volunteer, Nepalese Society for Earthquake Technology, Nepal Dec. 2015 -Jul. 2016 Ø Did Data collection, data entry, data analysis, mapping, and reporting the existing condition due to Gorkha Earthquake 2015. Ø Inspected the damaged building and categorized their damaged condition state. PROJECTS
Statistical Analysis Projects:
• Did Exploratory analysis to visualize the corn data set of 6718 observations and built machine learning model of structured data set to interpret result using R Markdown, SAS programming and Python.
• Developed the predictive model to predict the default probability of customer using logistic regression, decision tree, and random forest and validate the result in R Markdown, and SAS.
• Demonstrated skill of unsupervised learning analysis such as clustering, neural network, decision tree, correlation, and did market basket analysis by using association rules models with Apriori by using R and Python.
• Performed logistic regression, random forest, and LASSO and selected the best model with higher accuracy rate to predict the unspecified 199 observation of Microtus data using R-Studio, and Python.
• Explored customer behavior, analyzed the customer loyalty, and organized the customer by using Power BI and Python. Timeseries Analysis Projects:
• Determined stationary or non- stationary nature of timeseries and transformed the data to make stationary in R-studio and JMP.
• Checked the auto correlation and smooth the data by using the exponential and double exponential smoothing.
• Calculated one step ahead forecast and forecast error and confidence and prediction interval of time series data that have trend, high variance, and seasonal pattern in R-studio and JMP. Big Data and Data warehousing Project:
• Maintained a Hadoop environment, explored, viewed, and manipulated files in HDFS
• Compiled Java files created a JAR and run a MapReduce Jobs in Hadoop.
• Did missing value imputation, selected the variables using forward and backward and built logistic regression model in SAS.
• Created relational model, data analysis sheet, built multidimensional, star/snowflake schema in ERDPlus.
• Updated and queried the data, did sorting and counting and constructed data warehousing model in SQL.
• Did data visualization and built the cross tabular table in IBM Cognos Insight.
• Fitted a logistic regression model, neural network and SVM model by testing and validating the result in Weka.