Post Job Free
Sign in

Data Science Analyst

Location:
College Park, IN, 46268
Posted:
May 22, 2025

Contact this candidate

Resume:

Yash Pandey

501-***-**** *******@**.*** linkedin.com/in/yashp15 github.com/yash-pandey24

Education

Indiana University Bloomington Bloomington, IN

Master of Science in Data Science GPA: 3.5/4.00 Aug. 2022 – Dec 2024 SRM University KTR Chennai, India

Bachelor of Technology in Computer Science Engineering Aug. 2018 – May 2022 Technical Skills

Languages & Libraries: Python, SQL, R, pandas, NumPy, matplotlib, seaborn, plotly, BeautifulSoup Data & Dev Tools:: MySQL, Snowflake, dbt, Power BI, Tableau, Jupyter Notebook, Google Colab, VS Code, Git, GitHub, Node.js, React.js, MERN Stack

ML & Software Engineering: Supervised Learning, Unsupervised Learning, Deep Learning, Neural Networks, CNN, BERT, Transformers, Predictive Modeling, GitHub, OOP, REST APIs, Modular Code Design, Testing and Debugging, Statistics: Probability, Hypothesis Testing, Linear Algebra. Experience

AI Software Engineer Feb 2025 – present

Tecsource International LLC Little Rock,AR

• Spearheading the design and development of an AI-powered SaaS platform to automate PTE (Pearson Test of English) coaching, enhancing training scalability and accessibility.

• Implementing AI avatars and NLP-based chatbots using MERN stack architecture to deliver real-time class simulations and automated evaluation across all PTE modules (Speaking, Writing, Reading, Listening).

• Building intelligent dashboards for trainer insights, performance tracking, and class supervision.

• Upon completion, the platform will emulate human instructors, significantly increasing student capacity per trainer and transforming the online language coaching industry. Data Engineer Feb 2025 – present

Harken Data Somerset,NJ

• Designed and deployed ETL pipelines using dbt for modular SQL-based data transformation.

• Integrated dbt with GitHub for version control, testing, and collaborative workflow management.

• Implemented scalable Snowflake warehousing solutions for secure and efficient data storage.

• Streamlined data workflows and optimized performance to enable faster, reliable analytics. Data Science Intern May 2023 – Aug 2023

Globtier Noida,India

• Improved coding skills by 70% by creating Python-based mini-projects, including interactive games and simulations.

• Led the creation of a Resume Parser using Python, NumPy, and resume parse libraries; increased hiring efficiency by 40%.

• Streamlined recruitment workflow by implementing an AI-driven applicant tracking system, reducing time-to-hire by 40% and a 30% increase in qualified candidates for technical roles. Projects

IMDb Sentiment Analysis Python, Scikit-learn, TensorFlow, Gensim, NLTK, BERT Aug 2024 – Dec 2024

• Developed sentiment classification models using Logistic Regression, Decision Tree, Naive Bayes, and BERT on 50K IMDb reviews.

• Applied TF-IDF, Word2Vec, GloVe, and BERT embeddings; achieved up to 90% accuracy and 0.98 AUC.

• Visualized performance using confusion matrices, ROC curves, and word clouds for interpretability.

• Compared traditional ML and deep learning models; presented findings in a technical report. Sales Data Analysis Dashboard Python, Power BI, DAX, pandas, numpy, plotly July 2024 – Aug 2024

• Built an interactive sales dashboard using Power BI and Python for data cleaning and visualization.

• Wrote custom DAX queries to track metrics like Total Profit and Avg. Discount.

• Used pandas and numpy for advanced preprocessing of large datasets.

• Designed dynamic visuals with Plotly and Power BI to highlight trends and regional performance. PUBLICATION

Customer Churn Analysis in Telecom Organization W Journal Of Positive School Psychology (JPSP)



Contact this candidate