Srujan Shekar Shetty
Chicago, IL 312-***-**** *********************@*****.*** linkedin.com/in/srujanshetty Github Portfolio Youtube WORK EXPERIENCE
ITM Brigade, Illinois Institute of Technology Chicago, Illinois Big Data Volunteer. Jan 2025 - Present
Configured and optimized KVM switch systems for a lab environment, enhancing server performance by 30% and supporting the deployment of 15 rack-mounted servers, which streamlined the Smart Lab setup process. Developed and executed 7 robust end-to-end Spark applications utilizing PySpark and Scala, achieving a data processing efficiency improvement of 40% while analyzing datasets from notable projects such as the SF Fire Department and Divvy Trips.
Implemented advanced data compression techniques in PySpark applications on UNIX server using Unix Shell Scripting, comparing Iz4, Snappy, and Brotli to reduce storage requirements by 25%, while ensuring version control through GitHub for seamless collaboration across development teams. NetConnect Global Bengaluru, India
Business Intelligence Analyst May 2022 - Jul 2023
Spearheaded the development of advanced analytics frameworks utilizing SQL and Python, transforming over 20K data points into actionable HR insights that increased decision-making efficiency by 30%. Executed the design and implementation of data validation pipelines with SQL and Power Query, ensuring a 99% accuracy rate in critical business metrics across enterprise reporting dashboards, which enhanced stakeholder confidence in decision-making processes. Enhanced Power BI report performance through strategic optimization of DAX calculations and query folding techniques, resulting in a 15% reduction in load times for dashboards used by over 50 users daily, thereby accelerating access to key insights for operational strategies. Aspire Tele-Solutions Bengaluru, India
Data Analyst Sep 2021 - Apr 2023
Spearheaded the development of over 15 Tableau dashboards, transforming intricate business requirements into actionable visual analytics solutions, resulting in a 40% reduction in reporting time for clients like OLA and enhancing decision-making capabilities.
Engineered more than 10 interactive Excel reports utilizing pivot tables, advanced formulas, and VBA scripting to streamline HRIS analytics processes, increasing workforce performance tracking efficiency by 30%.
Implemented comprehensive data integrity checks across 5 operational datasets using SQL and Power Query, ensuring a consistent accuracy rate of over 98%, which significantly improved reliability for analytical reporting and strategic initiatives.
PROJECT EXPERIENCE
Lakehouse ETL for Sales & Ops Analytics
Data Engineering & Analytics
Built a scalable ETL pipeline (100K+ records/day) using Azure Data Lake & Snowflake to unify sales, inventory, and transaction data; reduced processing time from 24 to 1 hour via Bronze/Silver/Gold layers and incremental updates with Tasks & Streams, enabling timely, accurate reporting across departments. Sound Trends: What Makes a Hit Song?
Advanced PowerBI Analytics
Developed a full-scale ETL pipeline in Python to extract and enrich Spotify streaming data, including album art via API integration, and designed an interactive Power BI dashboard using advanced DAX, custom visuals like DENEB and HTML to showcase KPIs, heatmaps, and trend analysis delivering actionable insights to artists by visualizing performance metrics and engagement trends across albums and tracks. EDUCATION
Illinois Institute of Technology. Chicago, IL, United States Master of Science in Data Analytics & Management GPA: 3.66/4.0. Graduation Date: May 2025 SKILLS
Programming Languages: Python, R, Java, C/C++, JavaScript, TypeScript, HTML/CSS, SQL, Unix Shell Scripting, Object-Oriented Programming (OOP).
Data Engineering & Cloud: Apache Spark, PySpark, Kafka, AWS (S3, Glue, Lambda, Athena, QuickSight), GCP (BigQuery, Cloud Storage, Compute Engine), Azure Data Lake, Snowflake, MongoDB, Terraform, Docker, Kubernetes.
Machine Learning & Libraries: Scikit-Learn, TensorFlow, PyTorch, Pandas, NumPy, Beautiful Soup, Pinecone. Visualization & BI Tools: Power BI, Tableau, Matplotlib, Seaborn, Jupyter Notebook. Development & DevOps Tools: Git, VS Code, Mage, GitHub, CI/CD fundamental.