Post Job Free
Sign in

Machine Learning Data Scientist

Location:
Columbus, OH
Posted:
May 16, 2025

Contact this candidate

Resume:

Vishnu Tallapareddy

Data Scientist (AI/ML Engineer)

+1-380-***-**** ******.*@*****.***

SUMMARY

With 4 years of experience in Data Engineering and AI/ML engineering, I specialize in building scalable cloud-based solutions and intelligent systems. Leveraging AI/ML frameworks like TensorFlow, PyTorch, and scikit-learn, I have developed advanced systems for medical image analysis, recommendation engines, and predictive analytics. My expertise includes designing and optimizing data pipelines for efficient ETL workflows and real-time processing, as well as implementing robust MLOps practices to ensure seamless deployment, monitoring, and lifecycle management of machine learning models on platforms such as AWS and GCP. I have achieved significant reductions in model size and latency through techniques like quantization, pruning, and A/B testing. In healthcare IT, I have integrated systems adhering to HL7 standards, ensured HIPAA compliance, and developed secure APIs to manage sensitive medical data. My experience in Agile environments includes leading cross- functional teams, driving iterative development, and delivering scalable, high-quality solutions that align with business objectives.

SKILLS

●Programming/Libraries: Python (Pandas, NumPy, Scikit-learn, TensorFlow, Hugging face Jupyter), PySpark

●Databases: SQL, MongoDB, Oracle, MySql

●Machine Learning: Predictive Modeling, Supervised and Unsupervised Learning, Anomaly Detection, Feature Engineering, LLM’s

●Algorithms: KNN, Regression (Linear, Logistic, Multiple), Naive Bayes, Random Forest, SVM, NLP, K- Means

●Cloud Platforms: AWS (Glue, Redshift, SageMaker, Lambda, Athena, S3, EMR, Kinesis, Firehose, IAM)

●Big Data & Workflow Automation: Apache Airflow, Spark, Hadoop, Kafka, ETL/ELT Pipelines

●Data Engineering: Data Extraction, Data Validation, Dimensional Modeling, Data Warehousing

●Data Visualization: Tableau, Looker Studio, Power BI

●Project Management: Workflow orchestration, Agile, Software Development Life Cycle, Work Breakdown Structure, Slack, Jira

●Version Control Tools: Git, GitHub, GitLab, Bitbucket

CERTIFICATES:

●AWS Machine Learning Specialty (In progress)

●AWS Data Engineer Associate

●AWS Educate Introduction to Generative AI

●Celonis EMS Technical Expert

WORK EXPERIENCE:

Data Scientist (AI/ML Engineer) Broadcom July 2024 – Present

●Conducted statistical analysis and developed predictive models, reducing feature engineering workflows’ runtime by 20% and improving machine learning model training efficiency.

●Extracted and analyzed large datasets using PySpark, SQL, and AWS tools, generating actionable insights that led to a 15% increase in product performance and revenue optimization.

●Designed and implemented GPT-like large language model prototypes, achieving 10% efficiency gains over baseline transformer models and enabling scalable deployment for real-world applications.

●Applied transfer learning and fine-tuning techniques to LLMs, improving performance for domain-specific tasks like conversational AI, sentiment analysis, and document summarization.

●Conducted prompt engineering experiments to optimize LLM outputs, saving 15% in content generation costs.

●Leveraged Kafka to process over 10,000 events/second, building scalable and fault-tolerant data pipelines that reduced data latency by 25% and supported real-time analytics.

●Implemented natural language processing (NLP) pipelines for text classification, sentiment analysis, and recommendation systems.

●Stayed up to date with the latest AI/ML trends and technologies, integrating emerging techniques like deep learning, reinforcement learning, and transfer learning into projects.

●Applied statistical and machine learning techniques like regression, clustering, classification, and deep learning to drive business outcomes.

●Developed a cloud-native MLOps platform on AWS for scalable AI/ML deployment and management using Python, TensorFlow ensuring 99.9% availability while handling petabytes of medical imaging data.

●Utilized TensorFlow and PyTorch for deep learning analysis of medical images, achieving 90% accuracy on 100,000 X-ray/MRI scans. Conducted A/B testing to compare model performance for data-driven decision-making.

●Employed advanced optimization techniques such as quantization and pruning (TensorFlow-Model-Optimization) to reduce ML model size by 60% without sacrificing accuracy.

●Established a scalable data ingestion pipeline on AWS (S3, Glue, Athena) for processing and storing terabytes of medical data from diverse sources, enduring reliability and scalability.

●Utilized pre-built machine learning algorithms (scikit-learn, XGBoost) for predictive analytics on medical data, resulting in an 18% improvement in diagnosis accuracy.

●Coordinated cross-functional teams to resolve critical system issues, achieving 99.9% uptime and reducing 50% incident resolution time.

Programmer Analyst Cognizant, India February 2022 – June 2023

●Developed an AI-driven recommendation engine for a leading e-commerce conglomerate with Cognizant, used microservices architecture with Java and Spring Boot.

●Implemented advanced machine learning algorithms to personalize product recommendations based on user behavior and preferences, leading to a 35% reduction in response times.

●Employed advanced AI techniques like NLP and collaborative filtering to enhance product recommendations, leading to a 20% increase in customer engagement and conversion rates.

●Preparation of Understanding documents, creating knowledge repository for the application process and end to end flow docs.

●Implemented reinforcement learning techniques to optimize product recommendations over time, allowing the system to adapt and improve based on user feedback and interactions.

●Integrated Elasticsearch and Apache Kafka for real-time data processing, enabling the recommendation engine to dynamically adapt to changing user interactions and market trends.

●Perform test estimation, test planning, requirement analysis, test design, test execution, defect management and test closure activities.

●Utilized unsupervised learning algorithms such as clustering and dimensionality reduction to segment customers based on browsing and purchasing behavior, enabling more targeted and personalized recommendations.

●Conduct daily review meetings for the entire team to track and streamline the workflow.

●Leveraged machine learning models for sentiment analysis and trend forecasting to provide actionable insights to the e- commerce client, enabling data-driven decision-making and strategic planning.

Celonis Data Engineer TCS India April 2020 – January 2022

●Utilized SQL to query and analyze large datasets, generating actionable insights that supported process optimization and strategic decision- making.

●Assisted in identifying operational inefficiencies and recommending process improvements to enhance performance.

●Assisted in developing resource allocation models by performing data analysis with SQL, contributing to a 15% improvement in project delivery timelines.

●Used tools like Excel, Python, R, and statistical software to perform quantitative analysis and modeling.

●Designed and maintained dashboards to monitor key performance indicators (KPIs) using SQL and data visualization tools, enabling real- time decision-making for operations teams.

●Operated financial market research, collected data and performed analysis by using statistical software

●Performed sensitivity analysis and risk assessment using SQL and statistical techniques to evaluate the impact of variable changes on business outcomes, enhancing decision-making accuracy under uncertainty.

EDUCATION

●Master’s in Data Science from Franklin University, Columbus, OH – USA.

●Bachelor’s in Computer Science and Engineering from Narsimha Reddy Engineering College - INDIA.



Contact this candidate