Avi Kumar Talaviya Data Scientist
Tucson, AZ 520-***-**** *********@*******.*** https://github.com/avikumart PROFESSIONAL SUMMARY
Data Scientist with a strong background in advanced analytics and machine learning, experienced in driving business enablement through a Centre of Excellence (COE) model. Proven track record in leading complex analytical projects, building scalable predictive models, and optimising high-performance data workflows. Expert in statistical modelling, MLOps, and big data technologies, with a passion for coaching teams and delivering data-driven insights to improve business outcomes. TECHNICAL SKILLS
● Programming & Databases: Python (Pandas, NumPy, Scikit-learn), R, SQL (Advanced Querying), MySQL, PostgreSQL, NoSQL.
● Data Science & AI: Machine Learning (GLM, Random Forest, XGBoost, Clustering, Anomaly Detection), Deep Learning (Transformers, CNNs, RNNs), Statistics, Text Mining, NLP, Time-Series Forecasting.
● Cloud & Big Data: Spark, PySpark, Hadoop, High-Performance Computing (HPC), Azure, Cloud AI Platforms, Containerization.
● MLOps & Tools: MLflow, Prefect, ETL Pipelines, Git, GitHub, Linux/Bash, Model Deployment & Monitoring.
● Visualisation: Tableau (Preferred), Power BI, Matplotlib, Seaborn, Plotly.
● Core Competencies: Data Management, Data Governance, Stakeholder Communication, Agile Methodologies.
WORK EXPERIENCE
University of Arizona - ECE Research Associate Aug 2025 – Present Tucson, AZ
● Execute complex analytical projects in the healthcare domain using high-performance computing (HPC) environments to drive research-based decision-making.
● Develop scalable, interpretable transformer-based models in PyTorch for multi-modal signal classification, ensuring adherence to data governance standards.
● Collaborate with cross-functional stakeholders to translate complex technical findings into actionable business/research insights.
Tops Technologies Pvt. Ltd. Data Scientist Sep 2024 – May 2025 Surat, India
● Served as a subject matter expert in advanced analytics, developing repeatable and dynamic data products for large-scale training stacks.
● Optimized advanced SQL queries and database designs, increasing data retrieval efficiency by 70% to support business enablement.
● Coached and upskilled junior team members in Python and SQL, fostering a culture of data science excellence within the organization.
Omdena Local Chapter Data Science Project Lead Mar 2023 – May 2023 Mumbai, India
● Led a team of 25+ members in an end-to-end AQI prediction project, establishing long-term processes and frameworks for ETL pipelines.
● Communicated analytics approaches and model performance (80%+ RMSE accuracy) to stakeholders, bridging the gap between technical execution and business objectives.
● Integrated disparate internal and external datasets to build robust time-series forecasting models using Scikit-learn.
The Machine Learning Company Data Science Trainee Nov 2022 – Feb 2023 Pune, India
● Performed exploratory data analysis (EDA) and feature engineering for road traffic severity classification, achieving an 88% F1 score.
● Applied Principal Component Analysis (PCA) to reduce dimensionality by 60%, optimizing model training time and scalability.
EDUCATION
● MS, Information Science University of Arizona GPA: 4.0/4.0 Jul 2024 – May 2026
● Bachelor’s, Data Science and Analytics Jain University GPA: 3.7 Aug 2021 – Jul 2024
ACHIEVEMENTS & PROJECTS
● Humyn.ai Award: Won AUD 1000 for identifying data-driven trends influencing project cost overruns.
● GenAI/SQL App: Built a Natural Language-to-SQL tool using Llama Index to automate query generation and improve data accessibility.
● Road Traffic Project: Implemented Chi-squared tests and statistical modeling to optimize classification accuracy.