ROOPA SHASTRI
** ********* ******, #**, *******, MA 02452 *************@*****.*** 857-***-**** www.linkedin.com/in/roopashastri EDUCATION
Southern New Hampshire University, Manchester, NH Aug 16 - Aug 17 Master of Science in Information Technology
TECHNICAL SKILLS
ETL and BI: SSIS, Custom Coded ETL, Control-M(scheduling), Tableau, SSRS, Qlikview (Server and Developer), Excel, SSAS Data Science: Machine Learning (Regression, Classification, Clustering, Dimensionality Reduction(PCA), Neural Networks, Association Analysis), Statistics (Hypothesis Inference), Time Series Modeling. Programming: UNIX, PL/SQL, Java, Python, R
Databases: Oracle, MS SQL Server, Sybase
Big Data Analytics: Apache Spark, HDFS, Hive, Pig
PROFESSIONAL EXPERIENCE
Australia and New Zealand Banking Group, Bangalore, India System Analyst Sep 13 – Mar 16
Interacted with Business Users to understand their Analytical and Reporting requirements and deliver BI solutions using QLIK
Analyzed source systems, transformed data into desired format using Qlikview and performed Data Modeling for the Dashboards.
Scheduled jobs on QLIK Server and used NPRINT to publish Dashboards to the users and performed User Access Management on the server.
Implemented Section Access in Qlikview for protecting the data from unauthorized access.
Performed Data Munging and Data Preparation on large datasets using Pandas-Python for Qlikview Reports and scheduled Python/UNIX scripts using Control-M
Mentored team members on QLIK and BI best practices for designing reports and dashboards. TESCO, Bangalore, India
Senior Software Engineer May 11 – Sep 13
Responsible for creating Database Objects - Tables, Indexes, Views, Stored Procedures and User defined functions as per the requirements.
Designed SSIS packages for loading the data coming from various interfaces and used custom transformations to achieve desired output.
Developed SSIS packages for Data Reconciliation between two systems and performed Data Visualization using SSRS Reports.
Responsible for creating Subscriptions and Managing the Report Manager and Report Servers of SSRS.
Performed Data Profiling and Created Source to Target Mapping documents and Used SSIS Packages to build Data warehouse using SCD’s
Designed SSRS Reports using MDX and DAX queries and also built SSAS Cubes defining dimensions, measures and adding hierarchy.
Performed Data/Process Flow design reviews, code reviews of team members to ensure high quality of deliverables. Infosys, Bangalore, India
System Engineer Nov 08 – Apr 11
Designed SSRS,Power pivot and Crystal reports using different parameters and involved in their Installation, Configuration and Deployment.
Designed SSIS packages for loading the data coming from various interfaces and used custom ETL transformations to achieve desired output.
Optimized queries and enhanced database performance using Execution Plan, SQL Profiler and Database Engine Tuning Advisor. ACADEMIC PROJECTS
Data Analytics:
Designed Dashboard and Stories in Tableau using Bar, Line, Pie and various other graphs focusing on the hospital management needs
Build a Regression Model for Diabetes Data Analysis using Apache-Spark R in a distributed environment on Amazon AWS. Accessed data from csv's, json, hdfs, and S3 and performed distributed modeling using GLM.
Performed Exploratory Data Analysis to decide on an approach to design and build a Book Recommender System. Applied several algorithms to find similarities between users, used Validation techniques for evaluating the model and created Confusion Matrix for Sensitivity Analysis.
Build a predictive model to determine the timing of purchase of a new eReader by the registered customers and cancer data analysis to classify the tumors. Applied Decision Tree with Pruning techniques, Random Forest algorithms, Artificial Neural Networks, PCA in R and chose the best approach with less error rate.
Developed a console based Hotel Reservation System using OOP features like Abstraction, Polymorphism, Encapsulation and Inheritance.
Built data warehouse schema in Hive for NYC Taxi trip data with a data volume of 100 million+ records. Created Hive scripts to load Dimensions and Facts after uploading the data into HDFS.
Developed forecasting model using ARIMA to forecast tractor sales. Used ACF and PACF plot to identify AR and MA components