Vaishnavi Peddada
979-***-**** Austin, TX *******************@*****.*** LinkedIn
SKILLS
Programming: Python, Java, R, Scala, SQL, DAX, HTML, CSS, JavaScript, Shell Scripting Data & Visualization: SQL (PostgreSQL, MySQL, T-SQL, SQL Server), NoSQL (MongoDB, Redis, DynamoDB), Power BI, Tableau Big Data & DevOps: Apache Spark, Apache Airflow, Hadoop, dbt, Kafka, Alteryx, Docker, Kubernetes, Terraform, Git, GitLab, Jenkins Cloud: AWS (Redshift, Glue, Lake Formation, S3, EC2), Azure (Synapse, Data Factory, Data Lake), Databricks Libraries: NumPy, Pandas, Matplotlib, Scikit-learn, TensorFlow, Keras, SciPy, Seaborn, PyTorch, OpenCV, BeautifulSoup, NLTK Certifications: AWS Solutions Architect Associate, Tableau Desktop Specialist, Salesforce AI Associate, Professional Scrum Master PROFESSIONAL EXPERIENCE
Texas A&M University, Graduate Assistant College Station, TX August 2024 – Present
• Developed Tableau dashboards, Snowflake analytics, delivering real-time student insights and AI forecasting to improve retention
• Built Airflow based ETL workflows, integrating Canvas and exam sites, cutting manual ingestion by 60% and ensuring instant updates
• Refactored PostgreSQL queries, Python scripts, reducing reporting time by 45%, enabling faster identification of at-risk students Crane CPE, Data Analyst Houston, TX May 2024 – August 2024
• Automated ETL processes, reducing manual data cleaning time by 50%, by introducing data governance policies and quality checks for Salesforce and 24 ERPs leveraging T-SQL, Python, and Power BI (DAX, Power Query)
• Constructed 5+ data pipelines to integrate structured and unstructured data into Azure Synapse, increasing data consistency and accelerating analytics workflows by 35%
• Created interactive dashboards in the Gold Layer of the BI project, boosting data accessibility by 30% and enabling real-time data monitoring and anomaly detection
• Enhanced leadership reports for Sales, Finance, Manufacturing, Marketing, and Supply Chain by establishing 10+ KPIs for 50+ key data quality metrics in a Microsoft Fabric BI project involving Azure Synapse Analytics, Azure Data Factory, and Power BI Oracle, Applications Developer Bengaluru, India January 2021 – July 2023
• Developed and optimized Spark-based ETL pipelines to process large-scale data for integration between Oracle products and AWS, SAP, Workday, enhancing data transformation and analytics performance
• Implemented Hadoop-based batch processing for logging, improving system diagnostics and reducing troubleshooting time by 35%
• Deployed Databricks notebooks for business intelligence workflows, automating data cleansing and real-time KPI monitoring
• Automated data pipelines utilizing Java, SQL, Python, and Apache Airflow, visualizing KPIs for system performance, error tracking, interface status, tenant usage, and business reporting
• Refined SQL queries with window functions, CTEs, stored procedures, and indexing, reducing query execution times by 200%
• Migrated tenant data from Oracle DB, SharePoint to AWS S3 and Redshift, cutting storage costs by $800K, and boosting scalability
• Implemented CI/CD pipelines with Jenkins & Git, increasing release frequency by 400%, and integrated Terraform for automation
• Led a team of 5 for PIF upgrade from Grails to Java Spring Boot for modular architecture, faster deployments and API responsiveness Oil and Natural Gas Corporation, Data Intern Mumbai, India June 2019 – July 2019
• Performed ETL using MySQL to analyze factors influencing oil recovery for analytical reports improving decision making by 200%
• Built a predictive analytics model with Python (Scikit-learn, NumPy, Pandas) to predict optimal drilling locations with 80% accuracy ACADEMIC EXPERIENCE/PROJECTS
Dominick’s Fast Food Supply Chain Optimization, (SSMS, SSIS, SSAS, SSRS, OLAP, Data Warehousing)
• Constructed an end-to-end data warehouse using Kimball’s methodology to solve supply chain inefficiencies
• Designed and implemented ETL pipelines utilizing SSIS and SQL Server, loading data into a star-schema data model
• Created SSRS and Power BI reports for real-time decision-making, enabling at least 40% increase in supply chain efficiency Point of Sale System on AWS EC2, (MySQL, MariaDB, MongoDB)
• Built a MariaDB instance using MySQL triggers, materialized views, stored procedures, and ETL workflows for 100K+ sales records
• Migrated transactional data from MariaDB to MongoDB, restructuring data into JSON for scalability and real-time analytics Machine Learning based Predictive Maintenance, (Python, Pandas, NumPy, Scikit-learn, Matlplotlib, Jupyter Notebook)
• Developed Pandas, NumPy, and Scikit-learn models to predict equipment failures across 10K records with 14 parameters
• Optimized classification using operational parameters, improved predictive maintenance with accuracy, precision-recall, and F1- score, while addressing dataset limitations
EDUCATION
Texas A&M University, College Station, TX May 2025 MS in Management Information Systems GPA: 4/4
Coursework: Advance Data Management, Big Data, Data Warehousing, Machine Learning and Data Analytics Vellore Institute of Technology, India June 2021
BTech in Electronics and Communication Engineering GPA: 3.94/4