Naveen Morla
• 419-***-**** • ******.*******@*****.*** • https://www.linkedin.com/in/Naveen-Morla/ • https://github.com/naveenmorla1901 WORK EXPERIENCE
Data Scientist / Software Engineer, THK Manufacturing of America, Inc.Jan 2024 – Present
● Engineered high-performance APIs using FastAPI with advanced caching for seamless data ingestion, processing, and retrieval of machine and inventory datasets, and developed and fine-tuned predictive models with PyTorch and TensorFlow to enhance lead time forecasting and optimizing inventory management.
● Executed comprehensive data preprocessing, feature engineering, and data wrangling to improve model accuracy and robustness, leveraged Polars and PySpark for accelerated data processing, ensuring efficient handling of large datasets.
● Architected and managed containerized microservices with Kubernetes and Docker for scalable big data processing pipelines, implementing MLOps practices to streamline the deployment and monitoring of machine learning models.
● Created dynamic data visualizations using Tableau for data-driven decision-making and developed responsive frontend interfaces with React and JavaScript to enhance user experience and data visualization.
● Utilized SQL for complex backend data management and querying, implemented Git version control, and established CI/CD pipelines for streamlined code management, automated testing, and continuous deployment. Data Scientist, Pago Analytics, India May 2019 – Apr 2021
● Utilized Python, and deep learning libraries to design resume parsing techniques to build a Talent Acquisition Management System
(TAMS). Developed advanced AI features with 30% more efficiency in the hiring process.
● Engineered an automated hiring process and facilitated its deployment using AWS Sagemaker. Collect 10,000+ resumes and label them atomically and manually for the training data, using libraries like NLTK, NLP, and Spacy to analyze data.
● Normalizing SQL databases, improving data handling and system performance by 2X. Coordinated with cross-functional teams throughout the development and deployment process, following Agile methodologies.
● Applied strong analytical skills and creativity in data mining and defining essential KPIs/metrics. Communicated complex data points in a comprehensible manner to stakeholders using Power BI. Fostered lasting relationships, ensuring smooth project execution, stakeholder satisfaction, and participation in code review. Data Analyst, Knowledge Matrix Pvt Ltd, India Mar 2018 – Feb 2019
● Applied a structured data analysis workflow and built an ETL pipeline using tools like Python and SQL to gather, clean, and transform data, uncovering valuable insights for informed decision-making, and saving 5+ hours for 10 million records.
● Utilized popular tools like Tableau and PowerBI to create visually appealing dashboards and reports, effectively presenting key findings to stakeholders and supporting strategic planning.
● Applied statistical techniques and data visualization to identify patterns, correlations, and anomalies, providing actionable insights and recommendations for process improvement and business growth. Languages: Python, R, JavaScript, SQL, NoSQL.
Data Science Technologies: Machine Learning, Big Data, OpenCV, LLM Agents, RAG, MLOps, A/B testing, Fine-tuning Models. Packages/Tools/Frameworks: Scikit-Learn, Graph neural network, NumPy, SciPy, Pandas, NLTK, LangChain, Matplotlib, Keras, Jupyter Notebook, TensorFlow, PyTorch, GIT, JIRA, FAST API, React, Cloud Computing, Hugging Face Transformers, indexing, BERT. Databases: MySQL, MongoDB, Cassandra, Elasticsearch, PostgreSQL, DynamoDB, AWS RDS, Vector databases. Statistics/ML: Statistical Analysis, Predictive Analytics, Statistical Modelling, Linear/Logistic/Ridge/Lasso Regression, K-means Clustering, Classification, Data Mining, Decision Trees, Recommender Systems, Random Forests, XG-Boost, LGBM, SVM. EDUCATION
● BOWLING GREEN STATE UNIVERSITY, OHIO Aug 2021 – May 2023 M.S in Data Science
Coursework: Data Science Programming, Database Management Systems, Regression Analysis, Data Mining, Time Series Analysis, Artificial Intelligence Methods, Probability Theory I, Mathematical Statistics II, Linear and Integer Programming, Data Science Project. ACCOMPLISHMENTS/CERTIFICATIONS
Certifications:
● Coursera : Python and Machine Learning, Introduction to Data Analytics.
● Udemy : Tableau, PowerBI, Django.
● Data Camp : Introduction to R programming and statistics in spreadsheets and Excel.