BASANTH PERIYAPATNA ROOPA KUMAR
San Jose, CA +1-408-***-**** ***********@*****.*** linkedin.com/in/basanth-p-r/ github.com/BasanthPR EDUCATION
Master of Science in Data Analytics and Applied Data Science, San Jose State University, San Jose, CA Aug 2024 - Present Relevant Coursework – Math Models for Data Analytics, Data Warehouse & Pipelines, BI & Data Visualization, Machine Learning, Deep Learning, Generative AI Applications, Big Data Technologies, Distributed Systems for Data Engineering. Bachelor of Engineering in Information Science and Engineering, PESCE, Mandya, India Aug 2017 – Jul 2021 Relevant Coursework – DBMS, Python, Data Science, Machine Learning, Cloud Computing, Software Project Management, Operating Systems, Computer Architecture, Discrete Mathematics, Data Structures, Analysis & Design of Algorithms. WORK EXPERIENCE
Machine Learning Research Assistant, San Jose State University, San Jose, CA Jan 2025 – Present
• Design and implement machine learning models (Random forest, Support Vector Machines, and k-nearest neighbors) for pixel-level classification and object detection using ArcGIS Pro, and enhance wildfire hazard identification accuracy.
• Leverage deep learning methods (Mask R-CNN & Segment Anything Model via segment-geospatial Python package) for precise segmentation, automating delineation of defensible spaces and vulnerable structures. Data Analyst, AtkinsRealis, Bangalore, India Apr 2023 – Jun 2024
• Spearheaded design and execution of scalable ETL process for a Resource Usage Dashboard utilizing Azure Data Factory, Databricks, & SQL Server, enabling efficient cloud data orchestration.
• Optimized resource allocation in cloud environments by analyzing CPU & Virtual Machine metrics via SQL window functions, CTEs, and subqueries for performance insights, reducing overall computational costs by 55%..
• Crafted interactive dashboards in Power BI with embedded KPIs, DAX measures, and calculated columns, fostering decision- making and improving data analysis turnaround time by 40%.
• Built a GUI tool with PyQt5, combining Selenium, BeautifulSoup, and requests for efficient web data extraction, improving data retrieval by 65%, with Kubernetes deployed for workload orchestration and automation. Business Operations Analyst, 6D Technologies, Bangalore, India Sep 2021 – Sep 2022
• Fine-tuned ML models on the MAGIK CVM platform to predict churn and recommend next-best actions, leading to a 15% reduction in churn and accelerated ARPU, leveraging insights from SMSC & USSD..
• Developed predictive models for customer behavior utilizing Python, automating workflows in MAGIK’s Data Scientist Workbench, and integrated Kafka for continuous data streaming, boosting customer retention by 28%.
• Engineered Tableau dashboards by unifying SQL-based ETL pipelines with Apache NiFi and Dockerized deployments, enhancing visibility into network performance, NPS/SAT scores, and campaign effectiveness.
• Tested and deployed AI-driven segmentation models on the Digital BSS platform via REST APIs & Postman, improving customer classification & campaign execution through CI/CD pipelines, reducing latency time by 35%. Software Development Intern, Arohaka Technologies Pvt. Ltd, Mysore, India Jan 2021 – May 2021
• Strengthened company’s logistics website applying HTML, CSS, JavaScript & React, collaborating with UI/UX team to maintain a consistent, user-friendly interface, leading to a 30% increase in traffic and boosted engagement. SKILLS
Programming: Python, C, C++, JavaScript, Linux, Shell Scripting, Node.js. Software Engineering: Data Structures & Algorithms, Object-Oriented Programming, RESTful API Development, Redux, React, Git, CI/CD, SDLC, Microservices, Docker, Kubernetes. Data Science & Machine Learning: Statistics, Linear Algebra, Calculus, PyTorch, TensorFlow, Neural Networks, Regression Analysis, Decision Trees, Time Series Analysis, Gradient Boosting, EDA, Natural Language Processing. Data Engineering: Kafka, Spark, Flink, Data Warehouse (Snowflake, Redshift, BigQuery, Microsoft Fabric), ETL Tools. Data Analysis: Statistical Analysis, Predictive Analytics, Clustering Analysis, Hypothesis Testing, Data Cleaning. Database Management: SQL, MySQL, MongoDB, PostgreSQL, SQL Optimization, NoSQL, Redis. PROJECT EXPERIENCE
Big Data Pipeline for Multi-Source Breast Cancer Risk Analysis: Engineered an extensible end-to-end data engineering pipeline ingesting heterogeneous healthcare data (EHR, medical imaging, wearable sensor logs) into HDFS and AWS S3. Orchestrated batch processing workflows with Hadoop MapReduce and Spark MLlib for feature engineering and risk modeling and built a sub-second streaming pipeline with the aid of Apache Kafka and Spark Streaming. Trained Random Forest, XGBoost, and CNN (ResNet, EfficientNet) models with SHAP, LIME, and Grad-CAM explainability, delivering real-time insights at scale. Predictive Modeling of Alzheimer’s Risk Using Behavioral and Seasonal Indicators: Utilized CDC’s Healthy Aging dataset to model Alzheimer’s risk among adults 65+ across U.S. states. Performed EDA applying Pandas and NumPy, reducing nulls by 37%. Built classification models (Random Forest, XGBoost) achieving 91% accuracy, identifying key features like depression scores, sleep hours, and seasonal mood shifts. Visualized temporal trends and model outcomes in Power BI, enabling regional and time-based filtering for health policy decision-making. Data-Driven Strategies for Reducing Customer Churn in Auto Insurance: Constructed a scalable data warehouse on Amazon Redshift, assimilating structured customer data for churn analysis. Devised automated ETL pipelines harnessing Apache Airflow to streamline data ingestion and transformation, optimizing query performance. Configured star schema modeling to amplify analytical capabilities and visualized key churn metrics and customer demographics on Power BI, driving strategic retention initiatives.
UberEATS Prototype Full-Stack Application: Developed a full-stack UberEATS prototype with Node.js, Express, and MySQL for the backend, and React with Bootstrap for the frontend. Implemented session-based authentication with bcrypt and designed robust RESTful APIs for customer and restaurant workflows. Used Redux for modular, maintainable state management and ensured data integrity adopting MySQL foreign and composite keys. A Cloud-Based Approach to Soybean Traceability in Agricultural Supply Chain employing Blockchain: Modeled a cloud- based soybean traceability system to refine transparency in agricultural supply chain. Structured a secure backend employing Blockchain, C#, ASP.NET, GlusterFS, and MySQL, enabling immutable data recording and real-time tracking of soybean lifecycle events.
RESEARCH EXPERIENCE
Inspecting Residential Wildfire Hazards using UAVs and Object-Oriented Machine Learning Methods: Contributing to CalFire funded study leveraging high resolution UAV imagery and object oriented machine learning to automate detection of residential wildfire hazards at sub meter scale. Processing drone ortho mosaics into four band rasters and deriving DSM/DTM height models in ArcGIS Pro. Trained and benchmarked Random Forest, SVM, and k NN classifiers to classify six land cover classes and quantify 0–5 ft defensible space hazards with >92% accuracy. LEADERSHIP
• Secretary & Director of Social Media, ISO and Kannada Sangha, SJSU: Led social media strategy and community engagement, driving a 40%+ increase in event turnout and expanding digital reach through targeted content, collaborations, and consistent branding.
• Youth President, International Lingayat Youth Forum: Directed the strategic vision and execution of nationwide youth led social impact programs, mobilizing cross functional volunteer teams, forging partnerships with NGOs and educational institutions, and significantly boosting program visibility and stakeholder collaboration.
• Founder & Chief of Social Media and College Affairs, Office of Alumni Affairs Club (OAAC), PESCE: Pioneered OAAC’s launch securing alumni funding to amplify peer opportunities, and managed social media and college affairs to strengthen constituent relations.
• Fundraiser, Make A Difference (MAD): Mobilized and guided a team of volunteers to plan and execute public fundraising initiatives, raising over $5,000 for educational resources and increasing donor participation by 60%. CERTIFICATIONS & COURSES
1. Master Data Science and Advanced Programming: Completed a professional Data Science certification from IIT Madras and GUVI in collaboration with NASSCOM, mastering advanced Data Science skills through hands-on projects. 2. Microsoft Certified: Azure Fundamentals (AZ-900), Microsoft Certified: Azure Data Fundamentals (DP-900) and Microsoft Certified: Power Platform Fundamentals (PL-900).
COMMUNITY SERVICE
• MAD (Make a difference) Top NGO in India – Teaching Volunteer, Mysore May 2019 – Oct 2019 Volunteered extensively to mentor foundational skills for underprivileged children. Conducted engaging educational programs, encouraging a positive learning environment. Contributed to the academic and personal development of children, leaving a lasting impact on their lives.
• Amanuensis for Physically Challenged Senior: Apr 2014 Provided one on one academic support as a volunteer scribe for a physically challenged student, enabling successful completion of 10th grade (sophomore year) coursework and fostering an inclusive learning environment. AWARDS AND ACHIEVEMENTS
• #WOW Award (Star performer in a project), AtkinsRealis Oct 2023 Recognized for exceeding client expectations by resolving persistent issues and creating a robust GUI tool for efficient data extraction.
• IEEE Tech speaker, PESCE Mandya: Recognized for delivering appealing presentations on state-of-the-art technology, contributing to knowledge dissemination and nurturing a tech-savvy community.
• State Level Quiz Winner (10th Grade): Demonstrated excellence by clinching the title of State Level Quiz Winner. EXTRACURRICULARS
• Photography, Blogging, Hiking, Travelling, Volunteering and active participation in Hackathons.