Post Job Free
Sign in

Data Analyst Machine Learning

Location:
Athens, GA, 30605
Salary:
70000
Posted:
September 10, 2025

Contact this candidate

Resume:

SUMMARY

Sherin Sebastian

DATA ANALYST

TX, USA 480-***-**** *********.********@*****.*** LinkedIn

Data Analyst with almost 3+ years of experience in healthcare and financial industries, leveraging advanced analytics to drive business insights and process improvements.

Proven expertise in designing and implementing ETL workflows using tools like Informatica PowerCenter, Alteryx, and custom Python scripts to process millions of records daily.

Skilled in developing interactive dashboards and reports using Tableau, Power BI, and SAP BusinessObjects, increasing data accessibility and supporting data-driven decision-making.

Proficient in SQL, Python, and R programming, with a strong background in statistical analysis, machine learning, and predictive modeling.

Experience in big data technologies and cloud platforms, including AWS, Azure, and Hadoop, managing and analyzing large-scale datasets exceeding 10TB.

Strong background in data warehousing, data lake architecture, and database management using various systems including Teradata, MySQL, PostgreSQL, and MongoDB. SKILLS

Methodologies: SDLC, Agile, Waterfall

Programming Language: Python, SQL, R, Scala, SAS

Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-learn, TensorFlow, Seaborn, dplyr, ggplot2 Visualization Tools: Tableau, Power BI, Advanced Excel (Pivot Tables, VLOOKUP) IDEs: Visual Studio Code, PyCharm, Jupyter Notebook, IntelliJ Database: MySQL, PostgreSQL, MongoDB, SQL Server, Oracle Cloud Platform: Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform Other Technical Skills: SSIS, SSRS, Machine Learning Algorithms, ETL\ELT Tools, Statistics, ServiceNow, Hadoop, Spark, MapReduce, Alteryx, Google Big Query, Power Query, Probability distributions, Mathematics, Confidence Intervals, ANOVA, Advance Analytics, Hypothesis Testing, Regression Analysis, Linear Algebra, Advance Analytics, Data Mining, Big Data, Data Integration, Data Interpretation, Data Pipeline, Data Visualization, Data warehousing, Data transformation, Data Governance, Data Storytelling, Association rules, Clustering, Classification, Regression, A/B Testing, Forecasting & Modelling, Data Cleaning, Data Wrangling, Descriptive analytics, Git, GitHub, JIRA, Talend, Informatica

Soft Skills: Time Management, Leadership, Strategy Planning, Problem-Solving, Negotiation, Decision-Making, Documentation and Presentation, Analytical Thinking, Attention to Detail, verbal and written communication

Operating Systems: Windows, Linux

EXPERIENCE

Data Analyst Cigna, TX Jul 2023 – Present

Worked on a data analytics project aimed at reducing patient readmissions and improving hospital operational efficiency by analyzing clinical data across multiple facilities within the Cigna’s network.

Utilized SQL to extract, clean, and transform over 1 million patient records from Electronic Health Records (EHRs) and other internal databases, ensuring high-quality, accurate data for further analysis.

Designed and deployed interactive Tableau dashboards to visualize key performance indicators (KPIs), including patient throughput, readmission rates, and bed occupancy, providing real-time insights to hospital administrators and improving data accessibility by 30% which significantly reduced the time spent on manual reporting and enabled faster decision-making for operational adjustments.

Developed predictive models (Logistic Regression, Random Forest) with an 85% accuracy rate, enabling the identification of patients at high risk of readmission within 30 days of discharge, leading to targeted interventions.

Applied Python (Pandas, Scikit-learn) for advanced data analysis and to develop predictive models that identified high- risk patients for readmission, enhancing clinical decision-making and resource allocation.

Optimized the patient discharge process, reducing the average discharge time by 20% by identifying bottlenecks and implementing automation, leading to improved hospital throughput and better bed management.

Conducted Exploratory Data Analysis (EDA) using statistical techniques in Python to uncover patterns and correlations, such as patient demographics, comorbidities, and discharge practices, influencing policy changes for patient care management.

Achieved a 15% reduction in patient readmissions within 6 months by proactively flagging at-risk patients and implementing care coordination strategies, directly improving patient outcomes and reducing hospital costs.

Leveraged Amazon S3 for secure and scalable storage of over 1 million patient records, ensuring data is accessible for analysis while maintaining compliance with HIPAA regulations.

Used PySpark (Python API for Spark) to clean and preprocess data from diverse healthcare sources (EHRs, pharmacy data, and claims data), reducing data inconsistencies by 25% and ensuring a higher quality of input for analysis.

Collaborated closely with clinical teams, care coordinators, and hospital administrators to interpret data insights and translate them into actionable strategies for improving patient care and reducing operational inefficiencies.

Actively participated in sprint planning meetings to define clear goals and deliverables, ensuring alignment with project objectives and stakeholder needs for healthcare data analysis. Data Analyst Softage Group, India Oct 2019 – Jul 2021

Developed sophisticated predictive models to identify customers at risk of churn, resulting in a 10% reduction in churn rates through proactive customer retention strategies.

Conducted extensive data cleaning and transformation processes, leading to an 18% improvement in data quality and analysis reliability.

Implemented A/B testing methodologies to evaluate and optimize marketing strategies, contributing to a 12% improvement in campaign effectiveness.

Employed Python programming to streamline data manipulation and analysis workflows, boosting overall data management effectiveness.

Utilized MySQL and SQL Server databases for data extraction, transformation, and analysis, supporting various analytical tasks.

Implemented data lake solutions on AWS, enabling scalable storage and efficient data retrieval for advanced analytics and machine learning models.

Developed interactive data visualizations using Matplotlib to communicate key insights to stakeholders, enhancing decision-making processes effectively.

Defined and tracked KPIs across marketing campaigns, leveraging data insights to optimize strategies and improve ROI by 15%.

Utilized advanced Excel functions including VLOOKUP and pivot tables to perform complex data analysis, create dynamic reports, and streamline data aggregation processes for non-technical stakeholders.

Created complex SQL queries to extract, manipulate, and analyze data from multiple sources, supporting various analytical and reporting needs.

Developed and automated ETL pipelines with SSIS, streamlining data processing workflows and reducing processing time by 20%.

EDUCATION

Master of Science in Electrical Engineering : Arizona State University, Arizona, USA Bachelor of Engineering in Electronics and Communication Engineering – Anna University, Chennai, India



Contact this candidate