Post Job Free
Sign in

Machine Learning Data Analyst

Location:
Florham Park, NJ
Salary:
90000
Posted:
July 02, 2024

Contact this candidate

Resume:

SANISHA KOLANU

+1-571-***-**** Morristown, NJ

**************@*****.*** linkedin.com/sanisha-kolanu/ github.com/sanisha OBJECTIVE

Highly motivated Data Analyst with expertise in machine learning, ETL processes, and database management. Ex- perienced in extracting strategic insights from large datasets to drive data-driven solutions. Committed to delivering precise and impactful results.

EDUCATION

Master in Data Analytics Engineering, George Mason University Aug 2022 - May 2024 Bachelor of Computer Science, Jawaharlal Nehru Technological University 2017 - 2021 SKILLS

Technical Skills: Python, SQL, Tableau, Power BI, R, Machine Learning, Weka, Excel, SSMS, MySQL, Data Modeling, Data Cleaning and Preparation, Microsoft Office, Microsoft Azure Cloud Services, AWS Cloud Services, Hadoop, Hive, Apache Spark

Soft Skills: Agile Framework, Regulatory Compliance, Strategic Leadership, Collaborative, Stakeholder Manage- ment, Analytical Problem Solving, Effective Communication, Customer Experience Strategy EXPERIENCE

Associate Software Engineer July 2021 - Aug 2022

Optum (United Health Care) Hyderabad, Telangana,India

• Managed syndicated lending transactions, ensuring smooth data processing and integration, which increased processing speeds by 50% and reduced costs by 30%.

• Handled Medicare user data sets, working on updates, scheduling, incremental processing, and data management to ensure seamless integration with the Wellmed Portal, improving client data accessibility and interaction.

• Spearheaded the migration of large-scale client datasets from on-premises servers to cloud-based storage, utilizing advanced API development for dynamic data updates and efficient retrieval.

• Optimized data management workflows by designing and managing SQL job schedules through SSMS, supporting precise incremental data processing, timely decision-making, and regulatory compliance. PROJECTS

PySpark ETL Project for Real-Time Data Processing Developed a real-time ETL data pipeline using PySpark, leveraging Apache Kafka for data ingestion and Apache Spark for processing. Utilized AWS EC2 and Docker, implementing Spark Structured Streaming and fault tolerance mechanisms, enhancing data quality and decision- making.

Loan Prediction Model Using H2O.ai . Developed a predictive model using Python and H2O.ai to determine loan approval based on historical data of over 100,000 records. Implemented multiple machine learning algorithms

(GBM, XGBoost) with performance optimization. Created a Flask web application for real-time predictions and containerized it using Docker. Conducted data cleaning, feature engineering, and utilized visualization libraries for data exploration and model evaluation..

RAG-Enhanced AI Chatbot for George Mason University’s Student Services (MSSC). Developed an AI chatbot using AWS technologies and large language models to provide 24/7 support and improve user interac- tion. Utilized AWS Kendra, AWS Lambda, and Retrieval Augmented Generation (RAG) for efficient, context-aware responses. Enhanced user engagement and ensured scalability and multilingual support. CERTIFICATIONS

• Microsoft Certification: Fabric Analytics Engineer Associate (Certification Link)



Contact this candidate