Post Job Free
Sign in

Data Scientist with ML, NLP, and SQL Expertise

Location:
Tucson, AZ
Posted:
January 29, 2026

Contact this candidate

Resume:

Avi Kumar Talaviya Data Scientist

Tucson, AZ 520-***-**** *********@*******.*** https://github.com/avikumart PROFESSIONAL SUMMARY

Data Scientist with a strong background in advanced analytics and machine learning, experienced in driving business enablement through a Centre of Excellence (COE) model. Proven track record in leading complex analytical projects, building scalable predictive models, and optimising high-performance data workflows. Expert in statistical modelling, MLOps, and big data technologies, with a passion for coaching teams and delivering data-driven insights to improve business outcomes. TECHNICAL SKILLS

● Programming & Databases: Python (Pandas, NumPy, Scikit-learn), R, SQL (Advanced Querying), MySQL, PostgreSQL, NoSQL.

● Data Science & AI: Machine Learning (GLM, Random Forest, XGBoost, Clustering, Anomaly Detection), Deep Learning (Transformers, CNNs, RNNs), Statistics, Text Mining, NLP, Time-Series Forecasting.

● Cloud & Big Data: Spark, PySpark, Hadoop, High-Performance Computing (HPC), Azure, Cloud AI Platforms, Containerization.

● MLOps & Tools: MLflow, Prefect, ETL Pipelines, Git, GitHub, Linux/Bash, Model Deployment & Monitoring.

● Visualisation: Tableau (Preferred), Power BI, Matplotlib, Seaborn, Plotly.

● Core Competencies: Data Management, Data Governance, Stakeholder Communication, Agile Methodologies.

WORK EXPERIENCE

University of Arizona - ECE Research Associate Aug 2025 – Present Tucson, AZ

● Execute complex analytical projects in the healthcare domain using high-performance computing (HPC) environments to drive research-based decision-making.

● Develop scalable, interpretable transformer-based models in PyTorch for multi-modal signal classification, ensuring adherence to data governance standards.

● Collaborate with cross-functional stakeholders to translate complex technical findings into actionable business/research insights.

Tops Technologies Pvt. Ltd. Data Scientist Sep 2024 – May 2025 Surat, India

● Served as a subject matter expert in advanced analytics, developing repeatable and dynamic data products for large-scale training stacks.

● Optimized advanced SQL queries and database designs, increasing data retrieval efficiency by 70% to support business enablement.

● Coached and upskilled junior team members in Python and SQL, fostering a culture of data science excellence within the organization.

Omdena Local Chapter Data Science Project Lead Mar 2023 – May 2023 Mumbai, India

● Led a team of 25+ members in an end-to-end AQI prediction project, establishing long-term processes and frameworks for ETL pipelines.

● Communicated analytics approaches and model performance (80%+ RMSE accuracy) to stakeholders, bridging the gap between technical execution and business objectives.

● Integrated disparate internal and external datasets to build robust time-series forecasting models using Scikit-learn.

The Machine Learning Company Data Science Trainee Nov 2022 – Feb 2023 Pune, India

● Performed exploratory data analysis (EDA) and feature engineering for road traffic severity classification, achieving an 88% F1 score.

● Applied Principal Component Analysis (PCA) to reduce dimensionality by 60%, optimizing model training time and scalability.

EDUCATION

● MS, Information Science University of Arizona GPA: 4.0/4.0 Jul 2024 – May 2026

● Bachelor’s, Data Science and Analytics Jain University GPA: 3.7 Aug 2021 – Jul 2024

ACHIEVEMENTS & PROJECTS

● Humyn.ai Award: Won AUD 1000 for identifying data-driven trends influencing project cost overruns.

● GenAI/SQL App: Built a Natural Language-to-SQL tool using Llama Index to automate query generation and improve data accessibility.

● Road Traffic Project: Implemented Chi-squared tests and statistical modeling to optimize classification accuracy.



Contact this candidate