Post Job Free
Sign in

Data Analyst Analytics

Location:
Houston, TX
Posted:
April 23, 2025

Contact this candidate

Resume:

Vaibhav Salonia

Spring. TX (open for relocation) *******.*@************.***,**********@*****.*** 346-***-**** LinkedIn Education

Northeastern University Boston, MA

Master of Professional Studies (MPS) in Analytics December 2024 Guru Gobind Singh Indraprastha University Delhi, India Bachelor of Technology in Computer Science Engineering August 2019 Publications: 4-Way Controlled Robot, October 2018; Land and Air Drone for Surveillance, March 2019 Work Experience

HP Inc. Jul 2024 – Sep 2024

4P Data Analytics & ML Intern

• Designed a multi-layered data architecture (raw, refined, and curated layers) to streamline data flow across platforms improving query performance by 60% and reducing data duplication issues.

• Automated ETL workflows using Python and Airflow, covering 95% of recurring data jobs cutting manual effort by 80+ hours/month and reducing pipeline failures by 70%.

• Developed a gradient boosting model for sales and market share optimization across 8 global market boosting forecasting accuracy and driving more targeted decision-making.

• Conducted competitive analysis using product-market data to identify patterns and risks delivering insights with 95% accuracy that informed go-to-market strategy.

• Evaluated costs, margins, and pricing models across 150+ SKUs to assess growth opportunities influencing product roadmap and uncovering $1M+ in potential revenue.

• Applied Big Query ML for customer segmentation, enabling hyper-targeted marketing strategies leading to a 15% increase in campaign effectiveness.

Pacific Global, Inc. Oct 2022 – Aug 2023

Senior Data Analyst

• Automated 80% of ETL jobs using Apache Airflow, significantly reducing errors and improving data availability for real-time reporting.

• Applied adaptive query execution and broadcast joins in distributed Spark workloads reducing memory consumption by 20% and accelerating performance.

• Implemented Spark Streaming pipelines capable of processing 300K+ events per second, enabling near real-time analytics for event data.

• Developed claim tracking reports using RStudio and Power BI, streamlining milestone monitoring and improving executive decision-making.

• Designed efficient star and snowflake schemas, optimizing data modelling strategies to enhance query performance across large datasets. EPROSIGN Technical Solution Pvt Ltd Apr 2019 – Sep 2022 Executive Data Analyst

• Processed unstructured claims data to improve the accuracy of ASA code classifications, enhancing compliance and billing precision.

• Built data visualizations and cleaned datasets by removing outliers, enabling more reliable and actionable analytics.

• Created scheduled queries and automated data pipelines—cutting manual reporting time by 50% and boosting team productivity.

• Developed materialized views and partitioned tables in SQL Server, reducing query costs by 40% and improving dashboard responsiveness.

• Managed big data operations with MS SQL Server, optimizing database performance and ETL workflows for large-scale systems.

• Created analytical datasets by integrating relational and non-relational databases, supporting advanced modelling and business intelligence use cases.

• Delivered strategic insights by analysing financial data with time series forecasting informing quarterly planning and resource allocation. Projects

DataWorksAI Text-to-SQL LLM chatbot

• Worked with a team of four Data Engineers collaborating with Red Hat, aiming to improve the accuracy of text to SQL chatbot’s human to text querying using Open AI platform and GPT LLM transformer technologies.

• Applied Lang Chain and LLAMA Index frameworks and RAG pipeline for efficient results. Skills

Analytics Data Visualization Data Warehousing ERP Hypothesis Testing Correlation, Regression, and Clustering Analysis K-Fold Cross Validation Risk Management Data Manipulation Algorithms Deep & Machine Learning Business Intelligence & Administration AI Data Structures and Algorithms EDA ETL

Tools: VS Code Llama 3 Tableau Power BI RStudio Jupyter Notebook Excel MySQL NLP Python SQL R C++ HTML Pandas Point Cloud Library Janitor RODBC Tidy verse Matplotlib MS SQL Server Visual Studio Google Cloud Platform



Contact this candidate