SRIJA CHERUKU
+1-947-***-**** ***************@*****.*** www.linkedin.com/in/srijacheruku
SUMMARY
Recent graduate with a Master's in Big Data Analytics and Information Technology from the University of Central Missouri. Proficient in Python and machine learning, with hands-on experience from an undergraduate internship focused on "Machine Learning with Python." Published a research paper leveraging machine learning and artificial intelligence. Advanced knowledge of AWS gained through specialized coursework in big data solutions and data resource management. Skilled in data visualization using Tableau, with project experience showcasing these capabilities. Competent in data cleaning, analysis, and representation. Seeking a full-time role as a Data Engineer to apply my expertise and drive data-driven insights.
EDUCATION
Master of Science, Big Data Analytics, and Information Technology Jan 2023 – Present
University of Central Missouri (UCM), Warrensburg, Missouri CGPA: 3.66/4.0
Bachelor of Technology, Mechanical Engineering Aug 2018 – May 2022
CVR College of Engineering, Hyderabad, TS, India CGPA: 8.75/10
SKILLS AND COMPETENCIES
Programming – Python, Java, C
Web technologies – HTML, CSS, Java Script
Data Management and Analysis – SQL, Oracle, MongoDB, Machine Learning, MS Office, Task Execution, Quantitative Analytics, ETL.
Tools and Technologies – Eclipse, AWS, Azure, Apache Spark, Data Analytics, Google Cloud Platform, Data Extraction, Business Requirements, Tableau, Hadoop
Soft Skills – Teamwork, Time Management, Leadership, Analytical Skills, Business Requirements, presenting skills, Problem Solving
WORK EXPERIENCE
Machine Learning with Python Intern, Verzeo Edutech Private Limited Sep 2021 –Nov2021
Utilized Python libraries such as NumPy, Pandas, and Scikit-Learn for data manipulation and analysis.
Assisted in the development of data pipelines for ETL processes.
Implemented normalization and standardization techniques for data preprocessing.
Worked with SQL databases for data storage and retrieval.
Collaborated with the team to design and optimize data models.
Participated in machine learning projects focusing on data engineering aspects.
Developed Supervised Regression Model and analyzed data to obtain key features, calculated coefficient of determination value to evaluate the performance of the model.
PROJECTS
MedInsure Analytics: Enhancing Medical Insurance with Data-Driven Insights May 2024 – July 2024
This project aims to develop a comprehensive data processing and analysis system for a medical care insurance company. It includes modules for data ingestion, validation, transformation, storage, and analysis to support informed business decisions.
The project's goal is to create data pipelines for analyzing customer behaviors and developing revenue-enhancing business strategies through targeted offers and royalties for a medical care insurance company.
Use Sqoop to import data from RDBMS to Hive for further processing and analysis.
Visualizations created using Matplotlib and Seaborn.
Successfully collected and processed third-party data using Big Data tools, providing insights to help the medical care insurance company develop strategies to attract and retain customers, enhancing revenue and engagement.
Data Visualization Using Tableau- Dataset - Car Sales Jan 2024-May 2024
Cleaned, formatted, and transformed the relevant dataset as needed to ensure it was suitable for analysis. This involved tasks like handling missing values, removing duplicates, and creating calculated fields.
Created interactive visualizations in Tableau, incorporating charts, graphs, maps, and dashboards to highlight key trends and insights from the dataset, facilitating clear communication of findings.
Examined visualizations to extract actionable insights for data-informed decisions that encompassed pinpointing high-performing products, comprehending customer behavior, and enhancing marketing strategies using the Tableau data.
Data Pipeline Development- AWS Project Jan 2023 – May 2023
Developed an end-to-end data pipeline using AWS services including S3, Lambda, and Glue.
Designed and implemented data ingestion processes to gather and process large datasets.
Utilized AWS Code Pipeline for continuous integration and deployment (CI/CD).
Integrated data storage solutions like RDS for efficient data management.
Implemented monitoring and logging solutions for pipeline health and performance.
Collaborated with cross-functional teams to ensure the scalability and reliability of the data pipeline.
PUBLICATIONS
Paturi, U. M. R., Cheruku, S., Pasunuri, V. P. K., Salike, S., Cheruku, S., and Reddy, N. S., 2021, “Machine Learning and Statistical Approach in Modeling and Optimization of Surface Roughness in Wire Electrical Discharge Machining,” Mach. Learn. with Appl., 6(June), p. 100099. https://doi.org/10.1016/j.mlwa.2021.100099
Paturi, U. M. R., Vanga, D. G., Cheruku, S., Palakurthy, S. T., and Jha, N. K., 2023, “Estimation of Abrasive Wear of Nanostructured WC-10Co-4Cr TIG Weld Cladding Using Neural Network and Fuzzy Logic Approach,” Mater. Today Proc., 78, pp. 449–457. https://doi.org/10.1016/j.matpr.2022.10.266
CERTIFICATIONS
Participation in Technical Paper Contest at CVR College of Engineering. Nov’21 – Feb’22
Certified course on Machining Learning with Python in Verzeo Edutech private Limited. Sep’21 – Nov’21
Certified course on Programming for Everybody (Getting Started with Python) in Coursera.
March’20 – May’20