Post Job Free
Sign in

Data Analyst - 5+ Years in Python, SQL, SAS Tools

Location:
Plainsboro Center, NJ, 08536
Posted:
November 19, 2025

Contact this candidate

Resume:

SUMMARY

Sai Ruchitha Babu

Data Analyst

New Jersey, USA +1-551-***-**** *************@*****.*** LinkedIn

• Data Analyst with 5+ years of experience leveraging Python, SQL, and SAS to transform structured/unstructured datasets into actionable insights, enabling efficiency, accurate reporting, and improved strategic business outcomes.

• Designed and automated scalable ETL pipelines using SSIS, Informatica, and Python, integrating multi-source cloud/on-premises data, improving reliability, accessibility, analytics readiness, and eliminating redundancy.

• Engineered optimized SQL queries across MySQL, PostgreSQL, Oracle, and enterprise cloud databases (AWS, Azure), reducing query times drastically, enabling real-time reporting, and driving agile operational decision-making.

• Built and deployed Tableau, Power BI, and Excel dashboards applying interactive data storytelling, empowering leadership to monitor KPIs, visualize trends, and accelerate mission-critical business decisions with clarity.

• Applied machine learning, predictive modeling, clustering, and statistical techniques (regression, A/B testing) using Scikit-learn, TensorFlow, and Alteryx, generating forecasts and actionable insights to boost profitability and retention.

• Delivered high-quality analytics solutions within Agile, Scrum, and SDLC frameworks, leveraging Git, Bitbucket, and CI/CD pipelines, fostering collaboration, scalability, and timely delivery aligned with stakeholder requirements. SKILLS

Methodologies: SDLC, Agile, Scrum, Waterfall

Programming Language: Python, SQL, Scala

Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-learn, TensorFlow, Seaborn, dplyr, ggplot2 Visualization Tools: Tableau, Power BI, Advanced Excel (Pivot Tables, VLOOKUP, VBA, Macros) IDEs: Visual Studio Code, PyCharm, Jupyter Notebook, IntelliJ Database: MySQL, PostgreSQL, SQL Server, EHR, Oracle Cloud Platform: Amazon Web Services (Athena, Redshift, Sagemaker, Glue, Lambda, Kinesis), Microsoft Azure(ADF, Synapse Analytics, Databricks), Google Cloud Platform(GA4, BigQuery, Dataflow) Data Engineering Tools: SSIS, SSRS, Informatica, Apache Airflow, Docker, Kubernetes, ServiceNow, Databricks, Delta Lake, Hadoop, Spark, Snowflake, MapReduce, Alteryx, Dataiku Machine Learning Tools: Regression, Classification, Clustering, Time Series Forecasting (ARIMA, Prophet, LSTM), A/B Testing, Hypothesis Testing, Model Metrics, Confidence Intervals, ANOVA Other Tools & Skills: Data Cleaning, Data Wrangling, Data Transformation, Data Mining, Data Warehousing, Data Storytelling, Data Interpretation, Data Pipelines, OLAP & OLTP, Data Quality, Data Governance, KPI/OKR Tracking, Statistical Analysis, Git, GitHub, Bitbucket, MLflow, SAS, SPSS, MS Office Suite, Linear Algebra, Probability Distributions

WORK EXPERIENCE

Data Analyst Cigna HealthCare, USA Jan 2024 – Present

• Built a predictive analytics solution using Python and Scikit-learn that reduced patient readmission risks by enabling real-time, proactive care through seamless integration with enterprise healthcare systems.

• Created interactive dashboards in Tableau and Matplotlib to visualize outcomes and metrics, applying data storytelling to help leadership identify care gaps and improving patient satisfaction by 15% across multiple facilities.

• Streamlined ETL processes using Apache Airflow and Docker on AWS, enabling the secure handling of HIPAA- compliant claims data, reducing manual workload by 40%, and improving overall system reliability.

• Projected seasonal healthcare utilization trends using Pandas and Time Series models, which optimized clinic staffing strategies, improving operational efficiency, and reducing patient wait times during high-demand periods.

• Conducted A/B testing on patient outreach campaigns, using statistical analysis to optimize engagement methods, enhance patient outcomes, increase communication response rates, and improve treatment adherence program-wide

• Led bi-weekly Agile sprints with cross-functional teams, collaborating with product owners and stakeholders to prioritize deliverables, streamline data workflows, and ensure they meet healthcare compliance requirements.

• Established data quality frameworks using SQL Server and advanced queries by implementing validation layers and governance standards, which improved the accuracy and trustworthiness of clinical reporting systems.

• Analyzed claims data in AWS Redshift to identify high-risk patients, helped launching targeted wellness programs that reduced hospitalization which improved patient outcomes, and optimized care delivery costs.

• Developed advanced predictive models using machine learning techniques to identify high-risk patient cohorts, enabling targeted interventions that reduced emergency department visits by 20% and improved patient outcomes. Data Analyst Deloitte, India Sep 2018 – Dec 2022

• Managed 45% of platform development by designing advanced SQL queries and conducting detailed EDA on credit portfolio data using PostgreSQL and Spark, enhancing accuracy for time-sensitive credit risk reporting.

• Developed dynamic Power BI dashboards and Advanced Excel techniques like Pivot Tables and VLOOKUP, empowering executives to visualize key credit risk indicators and promptly detect anomalies across lending portfolios.

• Designed and deployed ETL pipelines on Azure Kubernetes Service using automation and containerization, optimizing cloud infrastructure for scalable data ingestion, high performance, and real-time analytics delivery.

• Directed the integration of Databricks and Delta Lake to unify OLAP queries with OLTP loan data, improving reporting speed by 40%, enhancing data integrity, and enabling faster, more accurate credit risk insights.

• Validated predictive models using clustering, regression, and hypothesis testing on Hadoop-based datasets for patterns and anomalies, uncovering key financial risk indicators, and supporting forecasting strategies.

• Built time series forecasting models using ARIMA and Prophet to predict cash flow and repayment trends, enabling risk teams to plan and proactively manage potential financial risks.

• Implemented SDLC best practices by managing iterative sprint cycles and comprehensive documentation, delivering compliant, scalable analytics solutions aligned with evolving business needs and regulatory requirements.

• Led 20% of data governance initiatives by designing advanced SQL validation frameworks across Databricks and Azure, improving data accuracy by 37% and strengthening enterprise reporting.

• Enhanced reporting efficiency by implementing SSIS and SSRS solutions, streamlining data integration processes, improving operational efficiency, and aligning analytics workflows across business and technical teams. EDUCATION

Master of Science in Data Science Jan 2023 – Aug 2024 University of Massachusetts Dartmouth, USA

Bachelor of Engineering in Information Science and Engineering Apr 2016 – Aug 2020 The Oxford College of Engineering, Bangalore, India CERTIFICATIONS

• Core Designer Certificate from Dataiku

• Google Advanced Data Analytics Specialization.

• Data Science Tools Certification from Cognitive Class.

• Data Visualization with Python Certification from Cognitive Class.

• Data Analytics and Visualization job simulation certification from Accenture.

• AWS Solution Architect Certificate.



Contact this candidate