Pranav Sharma
MS Syracuse University, Alumni from BITS PILANI. # 646-***-**** ****************@*****.*** Degrees/Qualifications obtained Colleges/Universities attended Year of Passing MS. Applied Data Science (GPA : 3.8) Syracuse University 2023 Bachelor of Engineering (Hons.) Birla Institute of Technology and Science Pilani 2018 Countries of Work Experience: India, USA.
Professional Highlights:
Experienced AI/ML and Data Engineering professional with over 4 years of experience across the U.S. and India, building production-grade machine learning pipelines, designing scalable data architectures, and deploying NLP-based solutions in healthcare, banking, and academic settings. Proven track record in leading end-to-end ML model development, cross-functional collaboration with clinicians and business stakeholders, and supporting data-driven decision-making using deep learning, text analytics, and semantic modelling techniques. Adept at cloud-native development and deployment using AWS, RedCap, Cerner, and modern MLOps tools. Technical Skills:
Category Skills
Programming Languages Python, R, SQL/PLSQL, Java, JavaScript, Shell Scripting, OWL, RDF, SPARQL. Software Tools TensorFlow, PyTorch, Scikit-learn, PySpark, FastAPI, Tableau, Power BI, Qlik Sense, Figma, Rally, ServiceNow, Excel, Git, Docker, Kubernetes. Cloud Platforms AWS (S3, Lambda, Glue, SageMaker), Azure (Machine Learning, Databricks, Cognitive Services), Cerner EMR, RedCap, ELSO Registry, BPM (Business Process Management) Detailed Work Experience:
Keck Medicine, USC, CA
Clinical Data Scientist ( Second Tenure ) 2025 - Present
• Developed ML model to classify cardiac procedures from unstructured Operative Notes using NLP and benchmarking techniques.
• Engineered structured datasets from Cerner for post-operative outcome analysis and health cost insights.
• Led mapping of 1,200+ fields between legacy and new clinical systems, ensuring seamless data integration.
• Performed data analysis to identify high-impact surgical procedures based on frequency, insurance coverage, and hospital payments to provide actionable insights for increasing revenue of Keck Medicine of USC.
• Partnered with cross-functional surgical teams to improve readmission detection using SQL/PLSQL logic. Data Scientist (First Tenure) 2023- 2023
• Extracted and transformed data from the Cerner database into a structured data model, adhering to process specifications and coding best practices.
• Identified and resolved discrepancies in Electronic Health Data through root cause analysis based on defined medical logic.
• Analyzed Electronic Health Records for the Department of Surgery as part of the Data Analytics Service Team, contributing to data quality and integration efforts.
Capgemini, India
Associate Consultant (SDE III) 2018 - 2021
• Supported 13 banking applications with SQL triggers and backend components ensuring 99.9% uptime.
• Reduced production incidents by 50% through proactive monitoring and stakeholder collaboration.
• Acted as a liaison between business clients and the technical team, translating requirements and ensuring aligned execution. Syracuse University
Data Engineer/ Research Associate 2022 - 2024
• Built scalable NLP pipelines with PySpark to cluster 70k users and process Reddit data for LLM bias detection.
• Created TFIDF and time-series models to predict sentiment and user engagement across 7M+ records.
• Designed a semantic architecture leveraging RDF/SPARQL to boost accessibility and performance analytics. Selected Accomplishments:
• Executed 30+ production deployments with zero downtime.
• Maintained and scaled 17 enterprise banking applications.
• Engineered EMR data transformation for ELSO Registry RedCap Cloud.
• Developed a semantic model to query terabytes of Reddit text data efficiently. Certifications: AWS Academy Cloud Foundations