Harshitha Babu
United States +1-516-***-**** ***************@*****.*** LinkedIn GitHub Portfolio SUMMARY
Data Analyst with hands-on experience transforming complex datasets into actionable business insights using Python, SQL, Power BI, and Tableau. Proven ability to build scalable data workflows - including an automated pipeline processing 3M+ records - that eliminate bottlenecks and accelerate decision-making. Skilled in executive dashboard development, KPI tracking, and translating analytical findings into clear recommendations for stakeholders.
EDUCATION
SUNY Binghamton - Bachelor of Science, Computer Science Engineering May 2025 PROFESSIONAL EXPERIENCE
MarketMaker CRE Aug 2025 – Present
Data Science Intern · Florida, USA (Remote)
• Engineered a high-performance URL validation engine utilizing Python's asyncio and aiohttp for concurrent processing, reducing manual data cleanup time by 90% while validating 3M+ records in minutes.
• Refined AI-driven scoring API leveraging OpenAI LLMs, raising automation efficiency 80% and improving actionable intelligence.
• Standardized technical documentation and version control protocols using Git and Azure DevOps, ensuring maintainability and streamlined handoffs for cross-functional teams.
• Developed Power BI reports to surface data-driven insights for leadership decision-making. Splash Scripts May 2024 – Aug 2024
Data Analyst Intern · India
• Engineered automated data cleaning and EDA workflows using Python and SQL, processing 90,000+ rows of raw data to ensure 100% data integrity for downstream reporting.
• Developed interactive, executive-level dashboards in Power BI to track KPIs, enabling stakeholders to identify and act on business trends 15% faster.
• Streamlined recurring reporting by migrating manual Excel trackers to automated SQL-based workflows, saving 5 hours of manual effort per week.
SKILLS
Languages: Python, SQL
Data Analysis: Pandas, NumPy, Excel (Power Query)
Data Engineering: ETL Pipelines, Data Modeling, Async Processing AI/ML: OpenAI API, Prompt Engineering, Claude AI
Visualization: Power BI, Tableau
PROJECTS
Marketing Data ETL Pipeline GitHub
• Built an end-to-end ETL pipeline using Python and SQL to consolidate malformed marketing data into a star schema SQLite warehouse, enabling campaign performance analysis across key metrics - reducing data prep time and delivering analytics-ready datasets to power downstream dashboards. OTT Platform Content Analysis GitHub
• Built an interactive Tableau dashboard comparing Netflix and Amazon Prime across 18,477 titles to surface genre performance and market expansion opportunities. HR Analytics Dashboard GitHub
• Designed a Tableau dashboard surfacing a 16.12% attrition rate across 1,470 employees, with actionable breakdowns by department and education field to inform retention strategy. CERTIFICATIONS
• Google Data Analytics (2026)
• Databricks Generative AI Fundamentals
• Databricks Fundamentals