Post Job Free
Sign in

Machine Learning Data Scientist

Location:
Georgetown, TX
Posted:
January 10, 2025

Contact this candidate

Resume:

Summary

Skills

Data Scientist with *+ years of experience in Python, SQL, and big data technologies for data processing, transformation, and engineering.

Actively learning and exploring AI methodologies, including large language models (LLMs), deep learning frameworks, and advanced machine learning techniques to enhance problem-solving capabilities.

Developed advanced algorithms to optimize system efficiency and effectiveness through sensor data analysis.

Applied data science and machine learning to build predictive models and address complex business challenges.

Skilled in data manipulation, statistical analysis, and data visualization for actionable insights and informed decision-making.

Proficient in Python, R, and SQL for in-depth analysis and visualization of complex datasets.

Leveraged predictive modelling and personalization algorithms to streamline operational processes.

Experienced with AWS solutions, including Redshift, Lambda, DynamoDB, and S3, and data warehouse platforms like Snowflake and Teradata.

Proficient in big data and data processing tools, including Linux, Spark, and NoSQL databases, to support data engineering and analytics.

Utilized analytical libraries (NumPy, Pandas, Matplotlib, Scikit-Learn) for advanced data analytics.

Created impactful visualizations with Tableau, Power BI, and Excel for data-driven decisions.

Experienced with databases like Oracle DB, MySQL, and SQL Server.

Strong problem-solving skills with excellent communication, critical thinking, and presentation abilities.

Proven ability to collaborate effectively with cross-functional teams. University of North Texas TX, USA 2022-2023

Master of Science - Business Analytics

Jawaharlal Nehru Technological University Hyderabad, India 2014-2018 Bachelor of Technology – Electronics and Communication Engineering Languages: Python, R, SQL, and SAS

Databases: MS SQL, SQLite, Oracle, NoSql, DynamoDB, Redshift, Snowflake, AWS S3 Libraries/Frameworks/

Algorithms:

NumPy, Pandas, Seaborn, TensorFlow, Ggplot, Random Forest, ARIMA, AdaBoost, Linear regression, Logistic regression, Decision Tree, Scikit-Learn, NLP, KNN, SVM, PySpark, Splunk, PyTorch, Large Language Models (LLMs), BERT, GPT, Transformer Models Data Engineering &

Workflow Orchestration:

ETL. Airflow, Kafka, dbt

Visualization Tools: Tableau, QlikView, Power BI

Platform/IDE Jupyter Notebook, PySpark, R-Studio, R Shiny Version Control Tools: GIT, GitHub and JIRA

Data Analysis Techniques: Data Visualization, Data Mining, and Data Warehousing Cloud Platforms: AWS, Microsoft Azure

Methodologies: Agile, SDLC, and Waterfall

Operating System: Windows, Linux, and macOS

Container/Orchestration: Docker, Kubernetes

Education

Sankeerth Poojala

Austin, TX Mobile: 513-***-**** Email: ****************@*****.*** Logicera Inc, Austin, TX Aug 2023 - Present

Data Scientist

Designed and optimized ETL workflows using SQL to extract, transform, and load large-scale data efficiently while ensuring data integrity.

Conducted A/B testing, hypothesis testing, and regression analyses using statistical packages like Python, R, and SAS to derive actionable insights and support data-driven decision-making.

Developed and deployed machine learning models using Python (NumPy, Pandas, SciKit Learn, LightGBM) and Spark MLlib, delivering scalable solutions tailored to business needs.

Applied predictive modeling techniques and personalization algorithms to solve complex problems, with exposure to deep learning frameworks like PyTorch and TensorFlow.

Created visually compelling dashboards and reports in Tableau to communicate insights effectively to stakeholders, enabling informed decision-making.

Leveraged Snowflake and Oracle databases for data retrieval and analysis, ensuring robust and efficient handling of large- scale datasets.

Expert in using Tableau and Snowflake for building data visualizations and supporting advanced analytics tasks.

Extensive use of Python and R for data analysis and statistical modeling, including tools like Matplotlib, Seaborn, and Plotly for visualizations.

Demonstrated expertise in object-oriented programming, with a focus on developing maintainable and efficient data science solutions.

Deployed machine learning pipelines on GCP and AWS, leveraging Docker and orchestration tools such as Airflow for efficient model lifecycle management.

Identified and implemented technical solutions to improve data processes, showcasing a proactive problem-solving approach.

Partnered with cross-functional teams to frame problems and design AI/ML solutions, leveraging large language models

(LLMs) like GPT and BERT for text classification, and sentiment analysis, driving measurable impact and innovation in natural language processing projects.

Intex Technologies Ltd, India Oct 2018 - Nov 2021

Data Science Analyst

Proficient in leveraging AWS and Azure cloud services to integrate, orchestrate, and automate data workflows, enhancing data processing efficiency and scalability.

Applied advanced data science, machine learning, and deep learning algorithms to develop enterprise-scale solutions and derive actionable insights.

Designed dashboards and tracked success metrics to measure financial results, customer satisfaction, and engagement using Tableau, Power BI, and Microsoft Excel.

Expert in data cleaning, validation, and wrangling techniques to ensure accuracy and reliability of datasets for analysis and modeling.

Crafted optimized SQL queries for ETL processes and advanced data manipulation, including window functions, CTEs, and performance tuning for large-scale datasets.

Utilized Python libraries like NumPy, Pandas, Matplotlib, Seaborn, SciPy, and SkLearn for exploratory data analysis, advanced data analysis, predictive modeling, and visualization.

Developed Python scripts for data manipulation, process automation, and integration of disparate datasets, increasing operational efficiency.

Skilled in methodologies like Agile and SDLC for successful project execution and delivery within defined timelines.

Extensive experience creating impactful visualizations in Tableau and Power BI, effectively communicating complex data insights to stakeholders.

Proficient in working with relational databases such as MySQL, Oracle, and SQL Server to manage and analyze large-scale datasets.

Worked with Excel for complex functions, pivot tables, advanced data visualization, and VBA automation to streamline repetitive tasks.

Strong problem-solving and critical thinking skills, coupled with excellent communication and collaboration abilities, fostering data-driven decisions.

Comfortable working in diverse technological environments, including Windows, Linux, and cloud-based platforms. Experience



Contact this candidate