Post Job Free
Sign in

Business Intelligence Data Scientist

Location:
Raleigh, NC
Posted:
June 15, 2025

Contact this candidate

Resume:

Harikrishnan Girikumar

+1-657-***-**** *********************@*****.*** LinkedIn

Summary

I am a Data Enthusiast with 8+ years of experience and exceptional abilities in data engineering, business intelligence development and data science. Successful at promotion of company advancements and boosting analytical performance. Consistently reaches revenue targets by leveraging business intelligence, data and ai/ml technologies producing well-executed recommendations. Hands-on with Snowflake (Snowgrid, Fivetran, Snowpipe), Azure (Synapse, Data Factory, Virtual Machines, Functions, Data Lake Storage), Databricks (Unity Catalog, Mosaic AI, Delta Tables, Delta Caching) & AWS (Redshift, EMR, Lambda, Kinesis, Glue, S3), SQL & NoSQL databases, spark & Spark SQL, Airflow, Kafka.

Major Experience

Toshiba Global Commerce Solution – Data Scientist/Engineer IV. May 2023 – Present

Led the data migration for software and hardware solutions using technologies like Synapse, spark, pipelines & ADLS2 which led to cost reduction by 70%.

Orchestrated multiple retailer inventory pipelines to extract data from Service Now tables using REST API and spark data transformations which decreased the reporting latency by 60%.

Designed the plan and execution of model serving using Databricks Mosaic AI for both live and batch inference to deploy models to proactively predict and reduce the SLAs by 18%.

Extracted OBIEE reports to blob storage and then created Snowflake OLAP data marts the delivered abstracted data using Snow grid, reducing data leakage by 100%.

Spearheaded the team to develop a RAG & MCP based AI agent with LLM to create a recommendation system for technicians to solve the Service Requests.

Orchestrated ETL pipelines using data bricks workflows, auto loader and delta live tables to load live events into Unity Catalog to provide real-time data analysis for clients like Walmart, Kroger and BJs.

Optimized the overall cloud cost by planning distributed systems, delta caching and incremental loads to reduce the cost by 25%.

Mentored the team to develop and deliver stream processing system for live dashboards with change data capture which resulted in eased data consumption for Customers.

Played a key role in the development of data product – Proactive Service Availability which resulted in reliable, scalable and maintainable application compliant with GDPR & CCPA.

Designed the data model for hardware inventory optimization with source & sink data validation to ensure high data quality which improved fault tolerance by 90%.

Mentored the team to develop a BERT-based classification model for retailer hardware service requests which improved the operational capability of the maintenance team.

Fractal Analytics Inc (Humana & Regions Bank) – Data Engineer. October 2021 – May 2023

Built the data pipeline for the Decision Science team by accessing the multi-dimensional Essbase OLAP cube via MDX, REST API to solve the lack of smaller data marts.

Designed and deployed the data into the Hive table from Essbase cube using pySpark. Aggregated and transformed the data structure of financial data to incorporate a smooth pipeline for the tableau dashboards.

Recreated the SQL stored procedures in Spark SQL to access parquet data from Azure Data Lake Storage to migrate 50 major reports from Netezza SQL to Azure environment.

Partnered across the Humana team to migrate their stored procedures, views and SQL data to ADLS2 and Dedicated SQL warehouse-oriented architecture using Apache Airflow with Data Governance in 6 months.

Implemented the CI/CD pipeline using github and Azure Devops to test and deploy machine learning models and spark notebooks.

Zifo RnD Solutions, Boston, MA - Associate Project Manager. July 2021 - October 2021

Collaborated with senior management and the group of data scientists to develop a product to convert Microsoft project dashboards to more interactive Power BI dashboards.

Helped and designed the process to create the overall execution of the project from scratch using Agile.

Internet Brands, Los Angeles, CA - Technical Project Manager Intern. May 2020 - August 2020

Collaborated with senior project managers to assist with day-to-day functions, including SCRUM meetings, designing workflows, user stories & scope using Agile methodologies

Prepared and filed user requirement documents in Confluence and collected data from content management system that helped to track performance and efficiency for more than 100 websites

Coordinated internal resources and client Carsdirect.com using JIRA Kanban boards to manage tasks & update status.

Atomium Labs Pvt Ltd, Pune - Product Analyst. June 2018 - August 2019

●Functioned as a liaison between product and marketing team to understand customer requirements and turn it into workflows, tasks, and product roadmaps.

●Led the team of developers through Software Development Life Cycle to develop an android application and automated web-based analytics dashboard which formed the strong foundation of business

●Built robust forecasting using parameters, trend lines and reference lines that helped clients and stakeholders to improve their ad campaigns by 40%.

●Used stored procedures, views, and triggers in SQL to provide structured data to clients from different sources and real-time insights and metrics into business KPIs.

●Created tableau dashboards of the campaign results to present to the clients which helped them to better understand the product’s performance and improved the retention by 24%.

Cabby Tabby Technologies Pvt Ltd, Pune – Product Analyst. April 2017 - May 2018

●Built monthly ad-hoc reports in Tableau to identify data usage from each region, which helped the marketing department to plan next month's data budget.

●Designed and implemented A/B testing for advertisements on android applications, which led to an increase in the conversion rate of 17%.

●Introduced the use of AWS S3 data lake and ETL in Glue for json logs generated from more than 4000 devices every hour.

●Evangelized the recommendation engine using collaborative filtering and co-sine similarity which improved the click rate of users by 5%.

●Identified and implemented the use of Kafka to process the online event/activity data which resulted in customer experience enhancement through real time data analysis.

●Owned the planning and scheduling of two-week sprints and clearly articulated the product vision to engineering team resulting in hitting 97% of the goal defined in 2017-2018 term.

Spaceship Technologies Pvt Ltd, Pune - Software Developer. October 2015 - May 2017

●Administered various backend projects in NodeJS, Python (Flask), SQL and MongoDB to cater services for consumers, which included enterprise, e-commerce, and web solutions.

●Functioned as the prime contact between clients and development team to solve issues related to SaaS and CRM products.

●Collaborated cross functionally with sales and management team to present the services provided by the organization, which led to acquisition by Cabby Tabby Technologies.

●Played the key role in transitioning the team from Waterfall to Agile methodology which improved client feedback executions by 80%.

●Created user story acceptance criteria to get buy-in from stakeholders and refined it with the SCRUM team.

Education

University of Pune: B.E - Information Technology (May 2015)

California State University Fullerton: MS - Information Systems – Business Analytics (May 2021)

Azure Certified Data Engineer (January 2021)

Skills

●Programming/Databases: Python (Pandas, NumPy, PySpark, TensorFlow, scikit-learn, pyTorch), R, MySQL, MongoDB, JavaScript, Node, Angular, PostgreSQL, Vector Search

●Data Visualization: Google Motion Charts, Tableau, Matplotlib, ggplot2, Seaborn, D3.js

●Product/Project Management: Heuristic Evaluation, Hypothesis Test, A/B Testing, JIRA, Confluence, Agile methodologies, Waterfall, SCRUM, Context Inquiry, UAT, SDLC, Lean

●Data Engineering: S3, EC2, Lambda, Cloud Watch, Glue, Git, REST API, Redshift, Athena, CRM, Azure Databricks, Azure Synapse Analytics, ADLS2, Airflow, Data Factory, Azure Fabric, Snowflake

●AI: Transformers, LLM models, RAG systems, MCP, Hugging Face, Llama Index, Langchain, Predictive Modeling, Predictive Analytics, Principal Component Analysis, KNN, Decision Tree, Multiple Linear Regression, Random Forest, Collaborative Filtering, Co-Sine Similarity, ANOVA, Hypothesis Test, A/B Testing



Contact this candidate