SABAVAT SAIKIRAN
United States **************@*****.*** 860-***-**** LinkedIn
SUMMARY
Data Analyst with 5 years of experience delivering data-driven insights and machine learning solutions across healthcare, technology, and e-commerce domains.
Proficient in Python, R, SQL, SAS, and advanced statistical modeling techniques, with expertise in ETL pipelines, cloud-based analytics (AWS, Snowflake), and BI tools such as Power BI, Tableau, and Looker.
Experienced in designing end-to-end machine learning workflows, from data ingestion to deployment, leveraging Scikit-learn, TensorFlow, Keras, and NLP frameworks for predictive modeling and text analytics.
Adept at translating complex datasets into actionable business recommendations by developing interactive dashboards, optimizing reporting processes, and collaborating with cross-functional teams in Agile environments.
Skilled in data quality improvement, process automation, and business intelligence strategy, leading to measurable operational efficiencies and improved decision-making outcomes. SKILLS
Languages: Python, R, SQL, SAS
Data Analysis & Statistical Techniques: Descriptive & Inferential Statistics, Hypothesis Testing, A/B Testing, Regression Analysis, Classification, Clustering, Time Series Forecasting, Data Wrangling, Data Cleaning Machine Learning & AI: Scikit-learn, TensorFlow, Keras, PyTorch, NLP (spaCy, NLTK), RAG (Retrieval-Augmented Generation), KNN, CNN
Data Visualization: Power BI, Tableau, Looker, Microsoft Excel (Pivot Tables, VLOOKUP, VBA, KPIs), Google Data Studio Databases & Data Warehousing: MS SQL Server, MySQL, PostgreSQL, MongoDB, NoSQL, Google BigQuery ETL Process & Cloud: SSIS, SSRS, Apache Kafka, Apache Spark, Snowflake, AWS (S3, EC2) Libraries: NumPy, Pandas, Matplotlib, Seaborn, SciPy, Plotly EXPERIENCE
Syneos Health, United States August 2024 – Present Data Analyst
Designed and deployed patient enrollment forecasting models using Python (Scikit-learn, Pandas) and time series methods (ARIMA, Prophet), improving trial recruitment accuracy by 22% and reducing site underutilization.
Designed 10+ clinical operations dashboards in Power BI and Tableau, integrating Snowflake and SQL Server data to monitor trial progress, patient enrollment, and site performance.
Automated ETL pipelines using SSIS and AWS S3 for daily ingestion of EDC (Electronic Data Capture) and lab data, reducing manual data preparation by 40%.
Applied NLP (spaCy, NLTK) to extract and classify adverse event descriptions from unstructured text, improving pharmacovigilance reporting accuracy by 18%.
Collaborated with data governance teams to implement role-based security and data quality checks, achieving a 95% compliance rate with GCP (Good Clinical Practice) data standards.
Conducted A/B testing for patient engagement strategies, identifying a messaging variant that improved response rates by 12% in decentralized clinical trials.
Sage SoftTech, India January 2019 – November 2022
Data Analyst
Created sales and performance dashboards in Power BI and Looker using MySQL and Google BigQuery, providing executives with near real-time business insights.
Developed SQL-based reporting solutions for customer retention, sales pipeline tracking, and operational KPIs, improving reporting efficiency by 30%.
Built and maintained ETL pipelines with Apache Spark and Kafka for transactional data, reducing data refresh intervals from 24 hours to under 2 hours.
Conducted trend analysis and segmentation on customer purchase behavior, identifying opportunities that led to a 15% increase in targeted sales campaigns.
Implemented data cleaning and transformation processes in Python (Pandas, NumPy) to unify data from multiple CRM systems, reducing reporting discrepancies by 25%.
Delivered weekly executive summaries combining key business metrics, visualizations, and recommendations to support faster decision-making.
EDUCATION
Lewis University, Romeoville, Illinois January 2023 – December 2024 Master of Science in Computer Science
CERTIFICATIONS
Google Data Analytics by Google
Data Analytics Essential Course by Cisco