SAI ROHITH KONGARI
Dekalb, IL, USA +1-872-***-**** ****************.***@*****.*** LinkedIn GitHub Summary
Results-oriented Computer Science graduate student with 2+ years of experience building data-driven applications using Python, SQL, and JavaScript. Delivered 5+ end-to-end projects in machine learning, NLP, and data visualization. Passionate about deriving insights from data and building intelligent solutions.
Education & Certification
Lewis University Aug 2024 - May 2026
Master of Computer Science(GPA: 4.0/4.0)
B V Raju Institute of Technology Aug 2020 - Jun 2024 Bachelor of Bachelor of Technology in Electronics and Communication Engineering (GPA: 7.66/10)
• Achievements: Appointed Lab Assistant - mentored 50+ students in C Programming fundamentals Professional Work Experience
Rebecca Everlene Trust Company Apr 2025 - Jul 2025 Data Analyst/Engineer Chicago, IL
• Improved reporting accuracy by 25% by analyzing educational cost data from 500+ U.S. schools using SQL, Excel, and Python to identify trends across school types and demographics.
• Reduced effort by 40% by automating ETL pipelines on 500k+ records and building real-time Tableau/Power BI dashboards, boosting stakeholder engagement and executive decision-making.
• Collaborated with UI/UX team to design interactive, user-friendly dashboards and teen-focused visuals, increasing engagement by 35% and improving clarity for diverse student audiences. Javatpoint Aug 2023 - Feb 2024
Technical Content Writer India, Uttarpradesh
• Increased Data Science site traffic by 37% by authoring 30+ technical articles on Python, machine learning, and neural networks, reaching 50K+ monthly readers.
• Translated complex AI/NLP topics into 15+ practical tutorials and code walkthroughs, resulting in a 35% boost in average session duration among applied learners.
• Redesigned content structure to improve clarity and navigation within the Data Science section, increasing returning user rate by 25% and enhancing engagement with machine learning topics. Academic Projects
End-to-End Big Data Pipeline AWS Apache Kafka Python
• Engineered high-performance distributed streaming system using Apache Kafka to process 10,000+ records per second with 99.9% uptime, implementing fault-tolerant message processing architecture.
• Developed data cleaning algorithms for COVID-19 dataset processing, reducing data outliers by 15% and improving analysis accuracy through statistical validation and anomaly detection techniques. From Screens to Streams: An Interest-Based OTT Recommendation System Python PostgreSQL AWS
• Built an interest-based OTT recommendation engine by designing an ETL pipeline to ingest and transform 1500+ titles from 7 major OTT platforms, categorizing content by genre, language, format, and rating to support personalized recommendations
• Engineered and optimized a MySQL database with indexed queries and views, reducing content retrieval time by 90% and enabling sub-2s personalized recommendations across multiple user categories.
• Delivered a provider-facing analytics dashboard that revealed top-performing genres and languages, enabling 35% more targeted content strategy based on real-time user engagement and preference trends. Image Classification and Image-Based Product Filtering API Scikit-Learn MLP Python
• Engineered a FastAPI-based RESTful API delivering 20+ real-time image classifications per second with sub-100 ms latency on standard hardware.
• Trained a Multi-Layer Perceptron (MLP) on a curated dataset of 10,000+ real product images, achieving 88.5% test accuracy through extensive preprocessing and model tuning.
• Automated a scalable preprocessing pipeline handling 10,000+ diverse real product images, improving data ingestion throughput by 50% and reducing errors by 30%, enabling efficient downstream model training and inference. Skills
• Programming : Python, SQL, C++, JavaScript, HTML/CSS, NoSQL
• Distributed Systems & Cloud: AWS (EC2, S3, CloudFront, Athena, CloudWatch, Lambda, Glue, LighSail, EMR), GCP, Microsoft Azure, Apache Kafka, Hadoop, Spark, Docker
• Database Technologies: MySQL, PostgreSQL, DynamoDB, MongoDB, ER Modeling, ETL Pipelines, Hbase, Hive
• Data Analysis & Visualization: NumPy, Pandas, Excel, Tableau, Power BI, Matplotlib, Seaborn, Data Modeling, Google BigQuery, Performance Optimization
• Machine Learning: Scikit-learn, XGBoost, MLP, SVM, Logistic Regression, Linear Regression, Random Forest Classifier, Decision Trees, PCA, Cross-Validation, Hyperparameter Tuning, GridSearchCv
• Al & Deep Learning: TensorFlow, Keras, PyTorch, Computer Vision, CNN, NLP (SpaCy, NLTK, Hugging Face), Generative Al
• Other Tools: Flask, Django, REST APIs, Shell Scripting, Google Colab, Git, Jupyter Notebook, VS Code