Post Job Free
Sign in

Data Analyst Machine Learning

Location:
Louisville, KY
Salary:
60000
Posted:
March 20, 2025

Contact this candidate

Resume:

Rahul Bathula

Louisville, KY •*************@*****.*** • +1-551-***-****

PROFESSIONAL SUMMARY:

Over 3+ years of experience as a Data Analyst specializing in designing, developing, and optimizing Big Data solutions using tools like Hadoop, Spark, and Python.

Proficient in building scalable data pipelines for batch and real-time data processing on cloud platforms, including Google Cloud Platform (GCP), AWS, and Azure.

Extensive hands-on experience with GCP services like Big Query, Cloud Dataflow, Pub/Sub, Cloud Storage, and Cloud Composer to develop and orchestrate data workflows.

Expertise in creating real-time data streaming pipelines using Kafka and Cloud Pub/Sub, integrating with diverse data sources to ensure seamless data ingestion.

Skilled in implementing ETL workflows and automating complex data pipelines with orchestration tools like Airflow and Cloud Composer.

Strong knowledge of data analytics, including optimizing queries, implementing data partitioning, and clustering for efficient processing in platforms like Big Query and Snowflake.

A highly analytical problem-solver proficient in Python, SQL, and Scala, combined with a robust understanding of containerization technologies like Kubernetes and Docker for deploying scalable applications. Committed to data security and compliance, with experience in IAM and DLP tools.

Results-driven Data Analyst & Data Scientist with expertise in Big Data, Machine Learning, ETL, and Cloud Computing. Experienced in managing large-scale databases, designing ETL pipelines, and developing predictive analytics models to drive data-driven decision-making. Adept at leveraging Python, SQL, Hadoop, Spark, and cloud platforms (AWS, GCP, Azure) to optimize data processing and visualization.

Proven track record of delivering impactful insights, including student admission trends analysis at the University of Louisville, fraud detection, and predictive maintenance models at Cognizant, and customer segmentation strategies at Miracle. Strong proficiency in Tableau, Power BI, and RPA automation for data visualization and workflow optimization.

Skilled in CI/CD (Jenkins, Docker, Kubernetes) and Agile methodologies to ensure efficient data pipeline deployment. Passionate about data science, machine learning, and business analytics, with a commitment to optimizing operational efficiency and delivering actionable insights.

Focused on enhancing accuracy and reducing manual interventions through RPA automation.

SKILLS

Big Data Tools

Hadoop, HDFS, Spark, Hive, Sqoop, Kafka, Airflow, Zookeeper.

Programming Languages

Python, C++, C, HTML, JSON, CSS, SQL, JavaScript, Scala, Java, R, Pig Latin, HiveQL, Shell Scripting, and R for data science. RPA

Software Methodologies

Agile, SDLC Waterfall.

Databases

MySQL, PostgreSQL, DynamoDB, Snowflake, MongoDB, Cassandra.

ETL/BI

Power BI, Tableau, Talend, Informatica, Visio

Containerization and CI/CD

Kubernetes, Docker, Jenkins, Git, Bitbucket.

Operating Systems

Windows (XP/7/8/10), Linux (Unix, Ubuntu), Mac OS.

Cloud Technologies

Google Cloud Platform (Big Query, Dataflow, Pub/Sub, Cloud Composer, GKE, Cloud Storage), AWS (EC2, S3, RDS, EMR, Redshift, Lambda), Azure (Data Factory, Data Lake, Databricks).

Machine Learning

Models

linear regression, Logistic Regression, LASSO, Decision Tree, Random Forest, Gradient Boosting, SVM, regression models SKLearn, XGBoost, TensorFlow, Pytorch, MLlib, and core machine learning frameworks

WORK EXPERIENCE

September 2023-june2024

Data Analyst University of Louisville

Managed and optimized the student admissions database, ensuring data accuracy and integrity across multiple academic terms.

Utilized SQL, HiveQL, and Python to extract, clean, and analyse large datasets, providing insights into student enrolment trends, retention rates, and admission patterns.

Designed and implemented ETL pipelines using Talend, Informatica, and Apache Spark to automate data ingestion and transformation from multiple sources.

Developed interactive Power BI and Tableau dashboards to visualize admissions trends, helping stakeholders make data-driven decisions.

Integrated Big Data tools such as Hadoop, HDFS, and Kafka for efficient storage and real-time processing of student admission data.

Worked with Google Cloud Platform (Big Query, Dataflow, Pub/Sub) to handle large-scale admissions data and ensure cloud-based scalability.

Applied machine learning models such as logistic regression, decision trees, and random forest to predict student acceptance rates and dropout risks.

Collaborated with cross-functional teams, including faculty and IT staff, to improve data collection and reporting methodologies.

Ensured compliance with data privacy regulations (FERPA) while managing student admission records.

Followed Agile and SDLC methodologies for project management, ensuring timely delivery of data analytics solutions.

Cognizant Pvt Ltd India September 2022 – May 2023

Data Scientist

Established a predictive maintenance model for equipment in manufacturing, reducing downtime by 22% and saving operational costs through proactive maintenance scheduling.

Implemented a fraud detection algorithm, resulting in a 15% reduction in fraudulent transactions and enhancing overall financial security for the organization.

Orchestrated the development of a supply chain optimization model, reducing excess inventory levels by 20% and minimizing stockouts, leading to improved operational efficiency.

Led the creation of a customer lifetime value (CLV) prediction model, contributing to a 21% increase in the precision of marketing budget allocation and customer acquisition strategies.

Miracle India September 2021– August 2022

Data Analytics Intern

•Executed SQL queries to extract crucial information such as customer segments, average transaction values, and popular product categories.

•Conducted customer segmentation analysis using Python, implementing clustering algorithms (e.g., K-means) to categorize customers based on purchasing behavior. Utilized Python's visualization libraries to depict distinct customer segments and their characteristics.

•Connected Tableau to the SQL database to develop interactive dashboards for visualizing customer segments.

•Designed visualizations in Tableau to illustrate customer preferences, popular products, and trends within each segment.

•Proposed personalized promotional campaigns, product recommendations, and engagement initiatives for customer satisfaction and loyalty.

•Used Python to develop machine learning models to predict future purchasing trends and customer preferences.

•Specialized in automating processes using Robotic Process Automation (RPA) tools.

•Implemented automatic invoice processing systems utilizing Optical Character Recognition (OCR) technology.

•Worked on automation projects tailored to health firms, streamlining operations and improving efficiency.

ACADEMIC EXPERIENCE

PROJECTS March 2022 – May 2022

TITLE: Student Performance Analysis in a University

•Gathered and cleaned student performance data using Python, SQL, and Tableau.

•Created an SQL database for storing processed data and conducting statistical analysis for actionable insights.

•Connected Tableau to visualize key insights and designed interactive dashboards showcasing trends.

•Provided recommendations for interventions based on analysis findings.

Certificates

DevOps Practices and Tools

Continuous Integration and Continuous Deployment (CI/CD): Implementing and automating pipelines using tools like Jenkins, GitHub Actions, and Bitbucket.

Containerization: Proficient use of Docker and Kubernetes for scalable application deployment.

Infrastructure as Code Hands-on experience with Terraform and Ansible to automate infrastructure provisioning.

Cloud Platforms: Deployment and monitoring of applications in Google Cloud, AWS, and Azure environments.

Monitoring and Logging: Setting up robust observability systems using tools like Prometheus, Grafana, and ELK stack.

EDUCATION

UNIVERSITY OF LOUISVILLE Louisville, Kentucky

Master of Science in Business Analytics GPA: 3.5/4.0 August 2023-August 2024

Andhra University Visakhapatnam, India

bachelor’s in computer systems GPA: 3.8/4.0 June 2019-August 2022



Contact this candidate