Post Job Free
Sign in

Machine Learning Data Scientist

Location:
Hampton, NJ
Salary:
75000
Posted:
April 28, 2025

Contact this candidate

Resume:

Ishwar Girase

Data Scientist Machine Learning Engineer

+1-551-***-**** ******.*@***********.*** Oakwood Blvd, New Jersey LinkedIn GitHub SUMMARY

● Dynamic Data Scientist with 5 years of experience specializing in data warehousing, extraction, and modeling, leveraging tools like Python, SQL, AWS, Power BI, and machine learning algorithms for efficient data processing and deriving actionable insights.

● Expertise in developing and optimizing machine learning algorithms, including regression, classification, and time series models, boosting prediction accuracy and improving business outcomes, enabling more informed decision-making and strategic growth.

● Strong background in statistical modeling and analysis, utilizing Python, R, and advanced techniques to extract actionable insights from complex datasets, driving data-driven decision-making and enhancing problem-solving capabilities across various domains.

● Proven track record in collaborating with cross-functional teams to implement data-driven solutions, enhancing decision-making, optimizing operational efficiencies, and delivering measurable, impactful results across a wide range of industries, business sectors. EXPERIENCE

Unum, USA Data Scientist Feb 2024 - Current

● Created and fine-tuned machine learning algorithms with Scikit-learn, such as Logistic Regression, Random Forest, and XGBoost, improving prediction accuracy by 9 % significantly enhancing model performance and boosting forecasting reliability.

● Used Python and SQL to manage large datasets and conduct complex statistical analysis, providing data-driven insights that enabled more informed decision-making and optimized processes across different business units and operations.

● Developed a GenAI system workflow using Retrieval-Augmented Generation (RAG) and LLMs to process insurance policy documents and patient records, delivering real-time, context-aware risk summaries for underwriters.

● Analyzed both structured and unstructured data, improving data processing workflows and translating complex insights into clear, actionable findings, resulting in a 30% improvement in decision-making efficiency across multiple departments.

● Utilized advanced machine learning models, SQL joins, and window functions to address business-specific financial challenges, enhancing predictive capabilities and operational efficiencies through complex queries and data manipulation techniques.

● Deployed machine learning models to cloud environments using AWS SageMaker with CI/CD pipelines, integrating production-ready code into real-time applications and ensuring smooth execution for continuous performance improvements.

● Worked on financial models, conducting detailed analysis with Tableau to visualize trends, patterns, helping to inform strategic decisions in finance projects and significantly improving the accuracy of business forecasting and decision-making processes.

● Collaborated with cross-functional teams to present findings, communicated technical concepts to both technical and non- technical stakeholders, demonstrating exceptional communication and problem-solving skills while driving project success. Infinite Infolab, India Machine Learning Engineer Mar 2019 - Jul 2022

● Leveraged Power BI to develop interactive dashboards that visualized real-time model results, allowing stakeholders to easily validate insights and ensure alignment with business objectives prior to full-scale deployment.

● Collaborated with data engineers to design and optimize ETL processes, leveraging SQL to extract, clean, and transform data for analysis, ensuring efficient data flow, improved data quality, and streamlined processes for data-driven decision-making.

● Utilized NLP techniques and NLTK (Natural Language Toolkit) for text preprocessing, feature extraction, and text mining tasks, transforming unstructured data into structured datasets, enabling more effective analysis and actionable insights across projects.

● Built and validated machine learning models, including Logistic Regression, Decision Trees, and Neural Networks, to predict key business outcomes such as customer churn and product recommendations, driving strategic decisions and business growth.

● Applied time series analysis techniques to analyze sequential data, identify trends, and implement both supervised and unsupervised learning algorithms, improving forecasting accuracy and uncovering hidden patterns in the data.

● Managed and optimized data workflows in Amazon Redshift, utilizing its data warehousing capabilities to efficiently process and analyze large datasets, improving query performance and significantly reducing data processing times across operations.

● Deployed and managed cloud-based infrastructure on AWS, including EC2 instances, S3 buckets, RDS databases, and Lambda functions, ensuring system scalability, high availability, and enhanced performance across the environment. TECHNICAL SKILLS

Language: Python, R, SQL, Shell Scripting

IDEs: Visual Studio Code, PyCharm, Jupyter Notebook, Google Colab Statistical Methods: Hypothesis Testing, ANOVA, Time Series, A/B Testing Machine Learning: Regression analysis, Bayesian Method, Decision Tree, Random Forests, Neural Network, Gen AI, RAG Sentiment Analysis, K- Means Clustering, KNN, Classification, SVM, Natural Language Processing (NLP), LLM, CNN, XGBoost Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-Learn, PyTorch, TensorFlow, Keras, Spark, Seaborn Visualization Tools / Database: Tableau, Power BI, Microsoft Excel, MySQL, SQL Server, Oracle, MongoDB, Vector DB Cloud Technologies: AWS, Azure

Software/Other Skills: Jira, Data Cleaning, Data Wrangling, Critical Thinking, Communication Skills, Presentation Skills, Problem-solving, Decision-Making, EDA, Communication Skills, Databricks, Data Visualization, Predictive Analytics, Pattern Recognition, Data Integrity, Quantitative Data, Data Science, Statistics, Statistical Analysis, Data Analytics, Data Modeling, Big Query, Snowflake, SDLC, Agile, Waterfall, Figma Operating System: Windows, Linux

CERTIFICATION

AWS Cloud Practitioner Applied Machine Learning by UTD Python for Data Science and AI Development by IBM EDUCATION

University of Texas at Dallas, Dallas, TX Aug 2022 – May 2024 Master’s of Science in Business Analytics – Machine Learning Dr Babasaheb Ambedkar Technological University India Aug 2017 – Jun 2021 Bachelor’s of Technology in Computer Engineering



Contact this candidate