SUMMARY
ANKITHA SAJJAN
Data Analyst
Austin, TX ***********@*****.*** +1-913-***-**** LinkedIn Data Analyst with 5+ years of experience transforming complex datasets into actionable insights using SQL, Python, R, and Excel across cloud platforms like AWS, Azure, and GCP. Experienced in designing and optimizing ETL/ELT pipelines, data modeling, and predictive analytics to improve decision-making efficiency. Skilled in Power BI, Tableau, and Looker dashboards, KPI tracking, A/B testing, and data governance for accurate and scalable reporting. Adept at trend analysis, anomaly detection, and customer segmentation to drive growth and operational improvements. SKILLS
Programming & Databases: SQL (Advanced), Python, R, MySQL, PostgreSQL, Oracle, Excel (Advanced), Data Modeling, JSON/JSONB
Cloud & Data Platforms: AWS (S3, Redshift, EMR), Azure (Data Factory, Synapse, ML), GCP (BigQuery), Snowflake, Hadoop, Spark, ETL/ELT, Data Integration, PowerShell
Data Science & Analytics: Predictive Modeling, Machine Learning (scikit-learn, XGBoost, LightGBM), Data Mining, Descriptive Statistics, Trend/Pattern Analysis, A/B Testing, Data Cleaning & Validation, Customer Segmentation, Anomaly Detection Visualization & BI Tools: Power BI, Tableau, Looker, Dashboard Design, Reporting, Data Visualization Methodologies & Governance: Agile, Scrum, Requirement Gathering, Data Governance, KPI Tracking, Documentation, Stakeholder Engagement
WORK EXPERIENCE
Nordstrom Austin, TX Sr. Data Analyst Oct 2023 – Present
• Designed and optimized ETL pipelines using Azure and MySQL with Snowflake Schema models, powering real-time Power BI dashboards that cut reporting latency by 6 hours per refresh and served 200+ business users across operations.
• Leveraged Python (Pandas, Matplotlib) for EDA on 5TB+ Hadoop data, uncovering data quality issues that improved analytical accuracy by 28% and reduced reprocessing workload.
• Architected and built scalable ELT pipelines integrating Azure Data Lake Storage (ADLS) Gen2 and Azure HDInsight, automating multi-source reporting for 5M+ records daily and reducing refresh cycles by 86% (from 3 hours to 25 minutes)
• Developed predictive analytics models with scikit-learn in python, integrated into Excel dashboards versioned in Git, boosting forecast accuracy by 32% and eliminating 80% manual effort.
• Partnered with finance and inventory teams to translate insights into retail strategies, improving restock accuracy and reducing out-of-stock events by 15%.
Murphy Gas Irving, TX Data Analyst Aug 2022 – Sept 2023
• Deployed automated ETL workflows using R and PostgreSQL on AWS, leveraging JSON/JSONB for unstructured data; powered Tableau dashboards serving 100+ daily users with zero downtime.
• Engineered Spark–Snowflake data integration pipelines in R and scikit-learn, cutting data ingestion time by 45% and scaling data flow 3 without added cost.
• Implemented Wrapper Methods, XGBoost and LightGBM models, enhancing KPI prediction accuracy (0.71 0.91 ROC- AUC) and accelerating deployment cycles by 40%.
• Collaborated with marketing and product teams to convert analytics insights into campaign optimizations, increasing customer engagement by 22% and ROI by 18%.
• Designed Anomaly Detection and A/B Testing dashboards using Tableau and PostgreSQL, improving issue identification from days to minutes and raising test reliability by 33%. Hexaware Technologies India Data Analyst March 2019 – July 2021
• Integrated MySQL and CSV datasets into unified Looker Studio dashboards, streamlining analytics across 6 departments and saving 15+ reporting hours weekly.
• Built ELT frameworks on Google BigQuery (GCP) processing 1M+ records monthly, delivering analytical datasets 30% faster than legacy systems
• Automated ML workflows using Python (Pandas) and LookML, reducing manual data wrangling by 60% and cutting model training time by 3 hours per iteration.
• Optimized KPI focused data cleaning in BigQuery, improving data reliability by 35% and enabling 100% verified insights for leadership reports.
• Built Excel dashboards using Power Query, Pivot Tables, and INDEX-MATCH, cutting manual reporting time by 75% and improving forecast accuracy for sales and operations. EDUCATION
UNIVERSITY OF CENTRAL MISSOURI Missouri, USA
Master of Science in Computer Science Aug 2021 - Dec 2022