POOJA
Data Analyst
Contact: 469-***-**** Email: *****.****@*****.***
PROFESSIONAL SUMMARY:
Around 7 years of experience as a Data Analyst and Data Science Capable in all periods of the Software Development Lifecycle (SDLC).
Create Analysis Datasets, Summary Tables, Listings and Plots according to specifications of the study and statistical analysis.
Expert involvement in Data Modeling and Data Analysis as a Proficient in social event business prerequisites and taking care of necessities the board.
Experience with Statistical Analysis, Data Mining and Machine Learning Skills using R, Python and SQL.
Proficient in turning SQL queries to improve the database performance and availability.
Experience in developing Business Intelligence assists using tools like Tableau and Oracle.
Very good understanding and experience in User Acceptance Testing, Regression testing, Performance Testing and Functional Testing.
Expertise in various types of software development life cycle like waterfall and agile.
Act as a liaison between the client and technical solutions/ support groups, using advanced communication skills set to document, analyze and validate client requirements.
Provide support on production issues including troubleshooting, coordinating with It, and end user communication related to data issues.
Prepare, develop, and analyze daily, weekly, monthly, and quarterly reports.
Hands-on experience on Python and libraries like NumPy, Pandas, Matplotlib, Seaborn, NLTK, Sci-Kit learn, SciPy
Strategic planning for improving the organizations efficiency.
Self-starter and able to handle multiple tasks based on priorities.
Experience with data cleansing and manipulation.
Experience in Univariate, Multivariate Analysis, model testing, problem analysis, model comparison and validating model, ANOVA, Regression Analysis
Experiences in Machine learning, data mining, structured and un-structured data analysis, and image data analysis, including feature extraction, pattern recognition, algorithm development, text mining, computer simulation, data modeling, databases design, model evaluation and deployment.
Advanced level skill translating concepts into compelling visuals and Microsoft PowerPoint slides.
Build internal data quality measures and processes to ensure the accuracy of data models, reports, and dashboards
Create documentation for technical and not technical teams on reporting definitions and how to use the reports.
Experienced in writing functional specifications, translating business requirements to technical specifications, created/maintained/modified database design document with detailed description of logical entities and physical tables.
Excellent Knowledge of Health Insurance Portability and Accountability Act (HIPAA) Standards and Compliance issues.
Comfortable with R, Python, SAS and Weka, MATLAB, Relational databases
Strong business sense and abilities to communicate data insights to both technical and nontechnical clients
Ability to prioritize, organize, and work on multiple tasks simultaneously are required
Education
Bachelors in Health Informatics and Minor in Clinical Applications
Texas Woman’s University - Denton, TX
Technical Skills:
SQL Tools
SQL server, Oracle 10g and 11g, Access, SQL, MS-Office, Word, Excel, PowerPoint, Access, Project
SAS Tools
SAS V9.4, SAS/Graph, SAS/Stat, SAS/ report, SAS log checker
Analytical Languages
R, SAS, SQL, and Python
Visual Analytical Tools
Tableau 7/8.x Desktop, Server, Reader.
Statistical Methods
Hypothetical Testing, ANOVA, Confidence Intervals, Bayes Law, Principal
Component Analysis (PCA), Dimensionality Reduction, Cross-Validation, Auto-correlation
Databases
Oracle, Microsoft SQL, MS-Access, SQL Server and My SQL.
Artificial Intelligence/Machine Learning
Regression analysis, Bayesian Method, Decision Tree, Random Forests, Support Vector Machine, Neural Network, Sentiment Analysis, K-Means Clustering, KNN and Ensemble Method
Operating Systems
UNIX, Linux, and Windows
Reporting Tools
Tableau Suite of Tools 10.x, 9.x, 8.x which includes Desktop, Server and Online, Server Reporting Services (SSRS)
Client: McKesson Corporation Jan 2021 - Present Location: Irving, TX
Role: Clinical Data Analyst
Responsibilities:
Ensuring collection, integration, management, and availability of clinical data.
Analyzing clinical data using Excel functions (spreadsheets, writing formulas, V-lookups, pivot tables).
Conducting database searches.
Gathered, analyzed, documented, and translated application requirements into data models and
Supports standardization of documentation and the adoption of standards and practices related to data
and applications.
Created the Test plan, Test Strategy and Test scenarios based on scope of the project.
Participated in Data Acquisition with Data Engineer team to extract historical and real-time data by
using Pig, Flume, Hive, MapReduce and HDFS.
Developing and implementing strategies to improve treatment outcomes and prevent medication and hospital related errors for patients in long-term healthcare facilities.
Preparing reports, statistical comparisons, data charts, and other presentation materials.
Maintaining SharePoint.
Maintaining and updating patient’s records.
Participated in all phases of data mining, data collection, data cleaning, developing models,
validation, and visualization and performed GAP analysis.
Working with complaint team to ensure HIPAA guidelines and compliance.
Developed comprehensive data visualizations in Tableau dashboard to illustrate complex ideas to
various stakeholder levels on KPI’s like cost analysis and trend analysis.
Implemented Random Forests classification algorithms in R to identify the customer behavior to
improve customer engagement.
Developing plans to provide targeted care to special needs and vulnerable populations.
Looking at data sets as a whole and analyzing for trends as well discrepancies, outliers, and possible errors.
Worked with teams using Agile development methodologies.
Prepared meeting agenda and minutes gathering requirements document upon receipt of feedback with client.
Created test scenarios and test plans and drove UAT (user acceptance testing) in development and validation environments for the team.
Interface with other technology teams to load (ETL), extract and transform data from a wide variety of data sources
Good Knowledge in Normalizing and DE normalizing the tables and maintain referential integrity by using triggers, Primary and Foreign Keys.
Client: The Vanguard Group, Inc August 2019 - Jan 2021 Location: Wayne, PA
Role: Data Analyst
Responsibilities:
Agile methodology was used throughout the project and had daily scrum meeting and bi-weekly sprint planning and backlog meetings.
Extensively involved in almost all the phases of Project Life Cycle (SDLC) right from the requirements Gathering to Testing and Implementation, Reporting etc.
Creating Checklists for coding, Testing and Release for a smooth, better and error free project flow.
Extensive knowledge in Business Intelligence and Data Warehousing concepts with emphasis on ETL and System Development Life Cycle (SDLC).
Experience in gathering and writing detailed business requirement and translating them into technical specifications and design.
Extracted data from a Hive database on Hadoop using PyHive and HiveQL as well as with Spark through PySpark.
Utilize a broad variety of OLAP function like Count, SUM, CSUM and worked on MS Excel
using Pivot tables, Graphs.
Hands -on creating solution driven views and dashboards by developing different chart types including Heat Maps, Geo Maps, Pie Chart, bar charts, Tree Maps, Gantts, Circle Views, Line Charts, Scatter Plots, and Histograms in Tableau Desktop versions 8.0,8.1 and 8.2.
Organizes meeting structures with project teams and plans and facilitates internal team meetings.
Interact professionally with diverse group of professionals in the organizations including managers and executives.
Participate in strategic Digital Marketing team meetings to help coordinate initiatives to improve our processes.
Coordinate and setup Google Analytics, Tag Manager and Retargeting for client websites
Built deep learning neural network models from scratch using GPU-accelerated libraries like
PyTorch, Scikit-Learn and XGBoost libraries were employed to build and evaluate the
performance of different models.
Was also involved in the production support.
Wrote various queries using SAS/Base, SAS/SQL for creating reports according to the user’s requirements.
Conducted and generated the Regression, Correlation studies and Analysis of Variance Anova using Proc REG, Proc CoRR, Proc ANOVA.
Leveraging the exciting codes that can be used in processing, developing, maintain and analyzing data more efficiently.
Worked with presentation and reporting procedures like Format, Report, Print, Sort.
Communicating with development and QA teams regularly to ensure accurate understanding and interpretation of requirements.
Clint: Southwest Airlines Dec 2017–Aug 2019
Location: Dallas, TX
Role: Data Analyst
Responsibilities:
Collected, cleansed and provided analyses of structured and unstructured data for major business initiatives. (R, Python)
Manipulated and processed large data using Excel, Access and SQL.
Responsible for loading, extracting and validation of client data.
Coordinated with the front-end design team to provide them with the necessary stored procedures and packages and the necessary insight into the data.
Participated in requirements definition, analysis and the design of logical and physical data models.
Created multiple Visualization reports/dashboards using Dual Axes charts, Histograms, Bubble chart, Bar chart, Line chart, Tree map, Box and Whisker Plot, Stacked Bar etc.,
Generated Tableau Dashboard with quick/context/global filters, parameters and calculated fields on Tableau reports.
Created Tableau Dashboards with interactive views, trends and drill downs along with user level security.
Used Pandas Data Frame for structuring, cleansing and manipulation of data.
Published Tableau Workbooks by creating user filters so that only appropriate teams can view it
Designed and developed various analytical reports from multiple data sources by blending data on a single worksheet in Tableau Desktop and created Tree Map, Heat maps and background maps.
Involved in generating dual-axis bar chart, Pie chart and Bubble chart with multiple measures and data blending in case of merging various sources.
Conducted workflow, process diagram, and gap analysis to derive requirements for existing systems enhancements. Responsible for developing monthly reports using Tableau.
Conducted key analytics (data sourcing, analysis, testing) required to inform strategic recommendations (e.g., competitor benchmarking, market assessment and financial modeling.
Performed data mining on the client data through various clustering techniques using SPSS with an intention to identify structures within the data or homogeneous groups of data based on variable types.
Analyzed data points and trend and generated reports for each business region.
Segmented this clustered data and built a predictive model for forecasting region’s performance and potential markets for revenue generation and investment.
Developed visualizations of the forecasted trend for profiling of data through R and Tableau.
Used SPSS to mine, alter, code and retrieve data from a variety of sources and perform statistical analysis on them.
Coordinated with customers, support, sales and development teams, resolved issues.
Organized and facilitated sprint planning, daily stand-up meetings, Sprint review, Sprint retrospectives, and other Scrum-related meetings. Ensured high quality data collection maintaining the integrity of the data.
Client: JPS Health Network, Fort Worth, TX Jan 2016-Dec 2017
Role: Data Analyst
Description: JPS Health Network provides transformative healthcare solutions throughout the Fort Worth community, operating several specialty clinics and institutes to provide care tailored toward each individual. The network specializes in behavioral health, cancer, cardiology, dental, primary care, stroke, surgical and additional fields of research, leading to solutions that lead to help people live healthier lives.
Responsibilities:
Extracted, manipulated and analysed health care and retail data using Teradata from multiple sources to derive and visualize actionable insights for decision making.
Responsible for Analyzing report requirements and developing the reports by writing Teradata SQL Queries and using MS Excel, Power Point and UNIX.
Conducted analysis to measure financial and clinical impacts and outcomes of programs and interventions using descriptive statistics, correlation tests, linear/logistic regression and ANOVA.
Built dynamic dashboards using Tableau to track patient adherence and other user-centered metrics.
Created forecasts using ARIMA models to track performance of the Operations team for budget forecasting/variance analysis.
Designed summary data layers by integrating and manipulating data from disparate sources to support planned analyses, using SAS and SQL.
Tailored descriptive analytic solutions to the specific client needs and formulated recommendations to operations team based on findings
Gathered and analyzed business requirements, interacted with various business users, project leaders, developers and took part in identifying different data sources.
Involved with Data Analysis primarily Identifying Data Sets, Source Data, Source Meta Data, Data Definitions and Data Formats.
Generated new data mapping documentations and redefined the proper requirements in detail.