Darshil Vora
623-***-**** • ************@*****.*** • LinkedIn • GitHub
PROFESSIONAL EXPERIENCE
Cintana Education LLC, Tempe, AZ: Data Analyst 07/2024 – Present
● Developed and maintained data models using Excel, enhancing data accuracy and reducing reporting time by 20%.
● Built a cross-functional Power BI KPI dashboard with Power Query, partnering across Operations and Products teams to pinpoint bottlenecks; boosting partner performance tracking and speeding program launches by 27% globally.
● Streamlined key business functions by managing stakeholder projects in ASANA, automating HubSpot sales workflows with Power Automate, and leveraging Google Analytics to enable real-time analysis across 22+ marketing campaigns.
● Automated ETL data pipelines by integrating landing pages with Google Sheets using N8N workflow APIs and custom JavaScript, culminating in a dashboard that utilized Google Apps Script triggers to streamline stakeholder reporting. Arizona State University, Tempe, AZ: Math Grader Instruction Aide 08/2023 – 05/2024
● Collaborated with the mathematics faculty by preparing materials, grading, leading discussions, and utilizing Excel to analyze student performance of over 200 students across multiple sections of calculus, algebra, and statistics classes. YoShops, Remote: Data Science Intern 03/2022 – 05/2022
● Redesigned a Python script using Selenium to automate the extraction of over 300 e-commerce website product listings, improving market analysis efficiency and revealing insights into consumer trends and pricing strategies.
● Transformed over 5000 rows of raw data by writing advanced SQL queries (CTEs, subqueries) in SSMS and applying machine-learning, enabling sophisticated data-driven strategic recommendations and improved decision-making.
● Engineered interactive Power BI dashboards from five years of company order history, revealing critical purchasing trends that guided new marketing and sales strategies, which contributed to a 10% increase in repeat customers. Feynn Labs, Remote: Machine Learning & Data Analytics Intern 12/2021 – 02/2022
● Optimized data quality and operational processes by cleaning and preprocessing large datasets with SQL, Excel, and Python, resulting in a 5% increase in data reliability and refining product direction for predictive modeling application.
● Collaborated on a real estate project by analyzing trends, strategy and implementing a Random Forest classifier with Python (Scikit-Learn, Pandas), boosting segmentation accuracy by 12% and providing research-driven insights. The Sparks Foundation, Remote: Data Science Intern 10/2021 – 11/2021
● Achieved 87% model accuracy by engineering features in Pandas/NumPy and training Scikit-Learn models.
● Applied k-means clustering unsupervised learning techniques to analyze over 10,000 data records, uncovering key customer segments and highlighting trends, correlations, and anomalies to inform marketing and improve decisions. TECHNICAL SKILL
Programming Languages & Software: Python, SQL, C, C++, HTML, VBA, Google Analytics, Git, Salesforce, Jupyter Notebook Databases & Visualization: Power BI Dashboard, Matplotlib, Seaborn, Tableau, SAS, SPSS, PostgreSQL, MySQL, SSMS, Fabric Project Management: Agile, Documentation, Data Management, Communication, A/B Testing, Data Quality, Business Insights Machine Learning: Pandas, Scikit-Learn, NumPy, TensorFlow, Random Forest, LLMs, Data Warehousing, Data Mining, ETL Data Analytics: Data Modeling, Data Visualization, EDA, Statistical Analysis, Trend Forecasting, Pattern Recognition, KPIs, DAX Big Data & Cloud Platforms: Apache Spark, Hadoop, AWS, Google Cloud Platform, Azure, Apache Airflow, DBT, Snowflake EDUCATION
Master of Science in Computer Science 05/2024
Arizona State University, Tempe, AZ 3.60 CGPA
Courses: - Statistical Machine Learning, Data Processing at Scale, Data Mining, Data Visualization, Big Data, Software Testing Bachelor of Engineering in Computer Engineering 06/2022 Kadi Sarva Vishwavidyalaya, Gandhinagar, GJ 3.93 CGPA Courses:- Neural Networks & Deep Learning, Database Management, Python, C++, Data Structures & Algorithms, Data Analytics RELEVANT PROJECTS
Time Series Analysis for Artificial Pancreas system (Data Analysis, Data Mining, Data Viz, Python) 01/2023 – 05/2023
● Analyzed over 50,000 data points with machine learning models (K-Means, DBScan) to categorize glucose patterns and developed custom visualizations (Matplotlib, Seaborn) to simplify complex patient metrics.
● Guided the development of targeted treatment strategies, enhancing patient treatment plans and resulting in a 13% boost in patient outcome efficiency and an 8% improvement in clinical decision-making accuracy. NYC Taxi Hotspot Analysis (Apache Spark, Scala, Hadoop) 01/2024 – 05/2024
● Performed large-scale spatial analysis on NYC taxi trip data using Apache Spark and Scala, implementing statistical models to identify and visualize statistically significant hotspots for urban planning.
● Processed 48 months of NYC taxi trips in Spark/Scala to pinpoint the top 50 monthly hotspots across the city.