Akash Baidya
Atlanta, GA, ***** 404-***-**** *****.*********@*****.*** Github LinkedIn
WORK EXPERIENCE
Data Analyst Jan 2024 – Dec 2024
Georgia State University Atlanta, GA
• Implemented a comprehensive data model in Power BI, establishing relationships between fact and dimension tables, capturing a 360-degree view of the healthcare operations.
• Created Power BI dashboards to highlight top companies, sponsors, and students capstone project completion trends.
• Engineered a regression model to analyze the relationship between game features and player ratings for Amazon Games, leading to targeted enhancements that improved average ratings by 30% and boosted business outcomes.
• Developed complex DAX measures and calculated columns in Power BI, resulting in time intelligence reports and dashboards across Patient Management Systems, Affiliates, and Facilities, leading to a 15% increase in operational efficiency. Data Engineering Analyst Dec 2022 – Dec 2023
Accenture Mumbai, IN
• Performed sequential A/B testing frameworks for a beta app, analyzing user behavior across feature variations, resulting in a 2x increase in user engagement and a 15% improvement in retention rates.
• Developed Python scripts to automate ETL pipelines and data transformations, ensuring accurate and timely report generation.
• Designed an AWS Glue-based CDC (Change Data Capture) process, enabling real-time synchronization of 2TB+ of transac- tional data daily between S3 and Snowflake, reducing data lag by 40% and ensuring up-to-date analytics.
• Reduced deployment time and resource utilization by 50% by integrating CI/CD pipelines using AWS CodePipeline, CodeBuild, and Bitbucket to automate build, test, and deployment processes.
• Introduced multidimensional analytics through star and snowflake schema models, driving a 30% increase in reporting efficiency for finance and marketing teams.
Data Engineer Feb 2020 – Dec 2022
IBM Hydrebad, IN
• Built batch ETL pipelines with Spark and Hadoop, cutting data latency by 40% while managing the ingestion of 20 TB of raw data into a data lake.
• Boosted database performance by 40% by resolving bottlenecks through indexing, query rewriting, table partitioning, and buffer pool optimization in collaboration with Warehouse Operations leadership.
• Optimized complex ETL workflows using IBM DataStage and SQL, reducing data transformation and processing times by 35%, enabling real-time analytics for enterprise-wide decision-making.
• Developed an automated data pipeline using Python and Apache Airflow, reducing data processing time by 75% and enhancing accuracy through shell script-based data validation checks. Data Analyst Intern Jul 2019 – Feb 2020
Zhypility Technologies Pvt. Ltd. Mumbai, IN
• Developed a system to analyze large-scale logs in real-time using Hadoop’s distributed computing, uncovering patterns and providing actionable insights that enhanced system reliability. PROJECTS
Health Insurance Fraud Detection (Data Science, Classification) —Github) Mar 2024 Crime Prevention via Social Media Analysis (NLP, Machine Learning, Forecasting—Github) Sep 2024 AI Powered StyleGenie (Generative AI, Stable Diffusion, LLM, RAG —Github) - Fashion App Nov 2024 Real-Time Firearm Detection System (Hyper parameter Tuning, Deep Learning) —Github) Apr 2024 TECHNICAL SKILLS
Programming Languages: Python, R, SQL, Java, Pyspark, Shell Scripting, Prompt Engineering Big Data Tools: Apache Spark, Hadoop, Map Reduce, Kafka, AWS EMR. Database: SQL Server, MongoDB, MS Access, Redshift, DynamoDB, Snowflake, Mysql Cloud Platforms: AWS(Glue, Kinesis, Athena), GCP( Data Flow, Data Proc) Visualization and Sheduling Tools: Power BI, Excel, Airflow, Step Functions, Cloud Fusion Devops and Machine Learning : Docker, Kubernetes, Seaborn, SciKit Learn, Keras, Tensorflow, GitHub Certifications: AWS Data Analytics, GCP Data Engineering(skills boost), Microsoft Azure Fundamentals. EDUCATION
Master of Computer Information Systems, Georgia State University Jan 2024 - Dec 2024 Relevant Coursework: Data Science, Distributed Computing, Machine Learning, AI for Business Bachelor in Electronics and Telecommunication, University of Mumbai May 2015 - May 2019 Relevant Coursework: Signal processing, Power Systems, Networking, Calculus, Applied Mathematics