Roshini Padmanabha
******@****.*** +1-619-***-**** Open to Relocate in US linkedin.com/in/roshini-p21 Portfolio Github PROFESSIONAL EXPERIENCE
Data Engineer Intern
RESMED
•Engineered high-volume ETL pipelines with Apache Airflow, DBT & Python to ingest and transform terabyte-scale healthcare data.
•Migrated healthcare SaaS data from Azure MySQL onto AWS S3 & Snowflake warehouse, reducing query processing times by 30%.
•Developed 10+ BI Analytical dashboards using Power BI & DAX, driving business-critical analytics on patient metrics, operational costs, and SLA adherence, supporting decision-making for senior product managers using KPI's.
•Implemented data models for SaaS products, configuring Snowflake to optimize ETL processes, improving data pipeline efficiency by 15%. Analytics Engineer Intern
CLEANTECH SAN DIEGO - SDSU Research Foundation
•Streamlined site expansion processes for EV startups by creating 5 Tableau dashboards leveraging data from GCP, BigQuery and SQL.
•Automated ETL pipelines using DataProc, integrating data from multiple APIs into a centralized database, improving data accessibility and reducing manual effort by 30%. Delivered product risk analysis and go-to-market strategies using A/B testing methods. Data Analyst & Software Developer
TATA CONSULTANCY SERVICES
•Developed intuitive analytical dashboards and comprehensive retail reports using SQL, Databricks, and Excel, clearly visualizing key metrics such as sales performance, inventory turnover, and pricing effectiveness, directly enhancing strategic media marketing insights.
•Engineered and optimized complex SQL queries, data schemas, and models, reducing analytics pipeline processing time by 30%, enabling timely delivery of accurate reports focused on media engagement and customer behavior insights.
•Automated and streamlined data workflows leveraging scripting, ETL orchestration, and process automation tools, significantly reducing manual intervention by 40%. Delivered accurate, and error-free media performance reporting and promotional analytics insights.
•Managed and enhanced data warehousing solutions with AWS Redshift and S3, seamlessly integrating diverse data (CSV, JSON, XML) from E- Commerce, ERP, and marketing channels, resulting in a 25% improvement in data processing speed and quality assurance.
•Collaborated cross-functionally with CRM, Brand Marketing, Finance, & Technology teams, translating complex data into actionable insights, performing detailed data validation in UAT and PROD environments, achieving 98%+ system availability to meet strict SLA standards.
•Conducted rigorous root-cause analysis for data-flow and reporting issues by systematically analyzing logs and resolving tickets via Zendesk, ensuring high data accuracy, reliability, and quick turnaround times for internal stakeholders. Data Analyst - Graduate Assistant
SAN DIEGO STATE UNIVERSITY - Digital Innovation Lab
•Built a scalable database, data integration solutions using SSIS, Python & MySQL database improving performance optimization & data quality.
•Improved research workflows by 15%, through data preprocessing & statistical analysis using SQL & Excel for Qualtrics survey data. EDUCATION
San Diego State University
Master of Science in Information Systems, GPA - 3.8/4.0 Visvesvaraya Technological University
Bachelor of Engineering in Computer Science, GPA - 3.6/4.0 Achievements • Founder - TOCE Cult • Vice-Chair, IEEE Computer Society • SDSU S3 Symposium Winner • Volunteer - TCS Purpose4Life SKILLS
Programming Languages - Python, SQL, PL/SQL, R, Java, HTML, CSS, JavaScript, PHP, Linux, UNIX, Bash Databases - Oracle, Snowflake, PostgreSQL, MySQL, NoSQL, BigQuery, DynamoDB, MongoDB Foundations: Agile, SDLC, CI/CD, Restful API Tools/Software - AWS, Azure, GCP, Github, Toad, Looker, Apache Flink, Kafka, Databricks, Hadoop, Docker, Terraform, Kubernetes Libraries/Frameworks - React, NumPy, Pandas, Scikit-learn, Spark, TensorFlow Soft Skills/Project Management - Jira, Confluence, Organizational, Leadership, Problem Solving, Analytical Thinking, Teamwork, Communication PROJECTS
AWS Real-Time Data Streaming Pipeline and Visualization : Github
•Deployed an AWS pipeline and end-to-end real-time solution for data ingestion of player game leveraging (Kinesis, Firehose), stream analytics
(Apache Flink), storage (S3), cataloging (Glue), querying (Athena), and built 5+ visualization dashboards on QuickSight. Warehouse Inventory Data Analytics - Tableau Dashboard : Github
•Built Tableau dashboards, used Python for data cleaning & Kaggle API to extract datasets, enabling real-time inventory tracking & decision-making.
•Developed key metrics - Product turnover rates, Stock levels, and Inventory aging, optimizing stock replenishment & inventory costs by 15%. New York Taxi Dataset - Real-Time Data Analytics Pipeline & Machine Learning : Github
•Built a real-time ETL pipeline using Kafka, Spark, ElasticSearch, & HDFS, streaming 1M+ NYC Taxi trip records daily, & created a live dashboard in Power BI, reducing reporting delays. Also, implemented ML-based ETA predictions with 90% accuracy. CERTIFICATES
AWS Cloud Foundations, AWS Data Engineering, Advanced Databases, Apache Airflow, Tableau, Data Structures & Algorithms May 2024 – Aug 2024 San Diego, United States
Aug 2024 – Dec 2024 San Diego, United States
Aug 2021 – Jul 2023 Bengaluru, India
Sep 2023 – Present San Diego, United States
Aug 2023 – May 2025 San Diego, United States
Aug 2017 – Aug 2021 Bengaluru, India