Bhavana Sajja Data Analyst
Arlington, Texas +1-940-***-**** ****************@*****.*** LinkedIn PROFESSIONAL SUMMARY
Data Analyst with over 3 years of experience designing and delivering robust data solutions that support business decision-making. I am skilled in SQL, Python (Pandas, Scikit-learn), and cloud platforms including Azure and AWS for building automated ETL pipelines and managing large-scale datasets. I have experience developing interactive dashboards and reports using Power BI and Tableau to generate actionable insights. My expertise includes statistical analysis, machine learning modeling, and data validation, with a strong focus on data governance and compliance with HIPAA, GDPR, and RBAC standards. I work effectively within Agile environments, collaborating across teams and utilizing version control and DevOps tools to streamline workflows and improve reporting efficiency.
TECHNICAL SKILL
Programming & Querying: Python (Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn), SQL (T-SQL, PostgreSQL, MySQL), Shell Scripting (Bash, PowerShell), PySpark
Big Data & Streaming: PySpark, Apache Spark, Apache Kafka (stream processing fundamentals), AWS Kinesis Cloud Platforms & Services: Azure (Data Factory, Synapse Analytics, Data Lake Storage, and Blob Storage), AWS (S3, Redshift, Glue, SageMaker, and Lambda for lightweight automation), and Azure DevOps basics ETL & Orchestration: dbt (modular transformation pipelines), Azure Data Factory (pipeline orchestration), Apache Airflow (workflow automation, DAGs), SQL-based ETL pipelines
Data Warehousing & Databases: Snowflake, Azure Synapse Analytics, AWS Redshift, PostgreSQL, MySQL (RDBMS fundamentals, indexing, query tuning)
Analytics & Modeling: Statistical Analysis, Machine Learning (Scikit-learn, basic modeling pipelines), Forecasting, Churn Modeling, Fraud Detection, A/B Testing, Data Profiling & Validation
BI & Visualization Tools: Power BI (DAX, Power Query, dashboard design), Tableau, Excel (PivotTables, VLOOKUP, INDEX-MATCH, VBA macros) DevOps & Infrastructure Awareness: Familiarity with containerization basics (Docker fundamentals), version control with Git/GitHub, CI/CD pipeline concepts
Data Security & Compliance: Understanding of HIPAA, RBAC, GDPR basics, data governance principles, role-based access controls in cloud data environments
Collaboration & Methodologies: Agile & Scrum methodologies, SDLC awareness, version control best practices, cross-functional communication, documentation in Jira and Confluence
PROFESSIONAL EXPERIENCE
Data Analyst Humana Texas, USA Dec 2024 – Present
• Developed and maintained systematized ETL pipelines using Azure Data Factory and Synapse Analytics, integrating large-scale healthcare claims and patient data while ensuring HIPAA compliance.
• Planned and optimized Power BI dashboards with DAX and Power Query for real-time KPIs related to claims processing, patient risk stratification, and care management.
• Built predictive models for churn and fraud detection using Python (Scikit-learn), improving early identification of at-risk members and reducing fraudulent claims.
• Collaborated cross-functionally with clinical teams, data engineers, and stakeholders following Agile methodologies, utilizing Jira and Confluence for project tracking and documentation.
• Applied rigorous data profiling and validation processes to maintain data quality and support audits aligned with GDPR and role-based access controls (RBAC).
• Enhanced automation and deployment processes with Azure DevOps and GitHub, improving pipeline reliability and reducing manual intervention by 45%.
• Achieved a 15% reduction in claim denials by providing actionable insights through interactive Power BI dashboards. Data Analyst Hexaware Technologies India May 2021 – Jul 2023
• Built scalable SQL-based ETL pipelines on Snowflake and AWS Redshift to aggregate and transform healthcare and insurance data from legacy systems.
• Mechanized data cleansing and batch processing using Python (Pandas) and PySpark, ensuring high data integrity across multi-terabyte datasets.
• Created and delivered Tableau and Excel reports featuring PivotTables, INDEX-MATCH, and VBA macros to support claims lifecycle monitoring and financial reconciliation.
• Orchestrated data workflows with Apache Airflow to automate pipelines and ensure timely analytics delivery.
• Supported near-real-time data ingestion projects leveraging Apache Kafka and AWS Kinesis streaming fundamentals.
• Implemented data governance best practices, adhering to GDPR and HIPAA compliance with secure data access through role-based controls.
• Optimized SQL queries and indexing on Snowflake, reducing report generation time by 30%.
• Developed data validation frameworks that improved data accuracy by 25%, enhancing regulatory compliance.
• Computerized recurring reporting processes, increasing analyst productivity by 20%. EDUCATION
Master’s in Business Analytics University of Texas at Arlington Arlington, TX May 2025 Bachelor’s in Computer Science in Business Systems SRM Institute of Science and Technology India May 2023 CERTIFICATIONS
• Microsoft Power BI Data Analyst (PL-300)
• Google Data Analytics Professional Certification PROJECT
Electricity Demand Forecasting Under Climate Scenarios Treeni, US
• Developed time-series models (ARIMA, SARIMA, and ETS) in Python and R to forecast electricity usage through 2040 with a 6% MAPE across 10+ commercial facilities.
• Modeled climate-based scenarios using RCP pathways and integrated simulation constraints (capacity, growth rates), supporting policy-driven energy planning initiatives.
• Insights refined client forecasting accuracy by 22% and helped 30% of the organization’s sustainability roadmap execution.