Bhoj Raj Bhatt Data Engineer
940-***-****, *************@*****.***
Irving, Texas
PROFESSIONAL SUMMARY:
Data Engineer with over 12 years of experience in building data systems, including ETL pipelines, big data workflows, machine learning models, and cloud-based solutions using AWS and Azure. Specialized in database design and development, with strong academic credentials including a Master’s degree in Data Science. Experienced in data modeling, real-time analytics, and delivering scalable, reliable, and secure data solutions using Python, SQL, and ML algorithms across diverse projects and industries. TECHNICAL SKILLS:
Programming Languages Python, PHP, SQL, C, C++, Basic(Java, R, C#) Libraries/Frameworks Pandas, NumPy, Seaborn, SciPy, Matplotlib, Scikit-learn, TensorFlow, NLTK, PyTorch, Keras
Data Modeling & ETL
Tools
Apache Airflow, MySQL Workbench, RapidMiner, Pandas Databases MySQL, SQLite, MS SQL Server, Oracle Cloud RDBMS Cloud Technologies Amazon Web Services (EC2, S3, Lambda, RDS), Azure AI/ML Tools Gen AI, LLMs, NLP, Deep Learning, Machine Learning Algorithms (Linear Regression, Logistic Regression, KNN, k-means, Support Vector Machine, Decision Trees, Random Forest, XGBoost, PCA)
Visualization Tableau, Power BI, MS Excel, RapidMiner, Octoparse Web Development HTML, CSS, JavaScript, Jquery, Ajax, JSON, WordPress, Laravel, Flask etc Versioning Tools GIT
SDLC Agile, Scrum, Waterfall, Rally, JIRA
Web Scraping and Data
Extraction
Octoparse,s BeautifulSoup, Selenium
EDUCATION & CERTIFICATIONS:
Master of Science in Data Science, University of North Texas
Master of Computer Information System - NCIT (Pokhara University)
Bachelor in Computer Science and Information Technology - Tribhuvan University PROFESSIONAL EXPERIENCE:
Oberon IT/V-Care America, Texas, USA May 2024 – Present Data Engineering
Responsibilities:
Designed and implemented automated ETL pipelines using Apache Airflow to extract, clean, and transform high-volume healthcare data from LIS and EMR systems into cloud storage (AWS S3, RDS, Azure) and BI platforms.
Developed and validated machine learning models using Python, SQL, and R to predict patient outcomes and improve operational efficiency.
Performed exploratory data analysis and identified diagnostic trends by analyzing structured and semi-structured clinical datasets.
Built and maintained dynamic dashboards in Tableau and Power BI to visualize healthcare KPIs for stakeholders across departments.
Applied statistical and ML techniques (regression, classification, survival analysis) to uncover actionable insights from patient data.
Documented reusable feature engineering workflows and supported knowledge sharing across the analytics team.
Enforced HIPAA-compliant data protocols, ensuring secure handling and access control of sensitive patient information.
Collaborated with infrastructure and clinical teams to align data engineering practices with institutional goals and compliance standards
Designed and developed relational databases in MySQL to support an internal operations management system, ensuring reliable and optimized data storage for healthcare workflows. Environment: Python, SQL, R, Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn, Tableau, Power BI, Git, HL7, EMR/LIS, AWS (EC2, S3, Lambda, RDS), Azure, Aache Airflow, PHP, HTML, CSS, JavaScript, Agile, Rally, Jira.
UNT, Data Visualization and Extreme Reality Lab Oct 2023 - Dec 2023 Research Assistant
Responsibilities:
Built interactive time series data pipelines and visualization tools for geospatial datasets using Python and SQL.
Developed and deployed interactive data applications using C# and Unity for real-time and immersive visualization.
Designed and managed SQLite databases to structure and store geospatial and temporal data efficiently.
Integrated GIS components into Unity-based VR/AR visualizations to enhance user interaction with spatial data.
Conducted data cleaning, transformation, and feature engineering for downstream analytics.
Collaborated with researchers and developers to test and improve immersive VR/AR data visualization prototypes.
Environment: Python, PHP, C#, R, SQL, SQLite, Tableau, Unity, GIT, JSON, HTML, CSS, GIS, JavaScript, Scikit-learn, Confluence, Agile Scrum
GON, National Reconstruction Authority, Nepal Jan 2017 - Aug 2021 Data Engineer
Responsibilities:
Designed and implemented a centralized data warehouse using SQL to integrate government databases, satellite imagery, and field reports-boosting data accessibility.
Built and maintained ETL pipelines and real-time data workflows to support large-scale analytics across petabytes of structured and unstructured data.
Applied data governance frameworks to ensure consistency, quality, and security across the platform.
Led the end-to-end analytics process for earthquake housing reconstruction, including data collection, cleaning, transformation, model development, validation, and prediction- integrating datasets from diverse field and geospatial sources.
Developed predictive and deep learning models using Scikit-learn, TensorFlow, and Keras; automated data cleaning pipelines in Python (Pandas, NumPy).
Created Tableau dashboards and custom visualizations(Web Application) to monitor reconstruction KPIs and support decision-making.
Designed and deployed a cron-based mass mailer system to notify beneficiaries about their grant distribution status.
Conducted socioeconomic impact analyses and implemented geospatial/time-series forecasting to optimize resource planning.
Delivered training on analytics workflows, managed Agile project timelines, and documented processes for knowledge transfer.
Created JSON data files to transfer structured information from the web application to mobile and tablet applications, supporting field data access and synchronization. Environment: Python, PHP, C#, R, SQL, MySQL, Oracle, Tableau, Power BI, Git, RESTful APIs, Cron Job, JSON, HTML, CSS, GIS, JavaScript, TensorFlow, Scikit-learn, Agile Recent IT Solution and Research Center, Kathmandu, Nepal Jan 2014 - Jan 2017 Software Engineer - Data
Responsibilities:
Led the design and development of data-centric, scalable web applications and data pipelines, with a focus on optimizing data accessibility, processing, and storage.
Managed the full lifecycle of data-focused application development, from concept through deployment, with an emphasis on integrating and managing large datasets for e-commerce and content management solutions.
Collaborated with cross-functional teams to align data solutions with client needs, leading to successful project outcomes and enhanced client satisfaction.
Designed and optimized relational databases (MySQL) to support scalable web applications, and utilized data analysis techniques and scripting (Python) to extract insights for improving user engagement and system performance.
Developed mass mailer integrated in e-commerce B2B site and used cron job to send emails to customers for advertisement.
Directed comprehensive training programs for development staff in data handling, database management, and data analysis, improving project delivery efficiency and technical capabilities. Projects: E-commerce Websites (B2B, B2C), News Portals, Job Portals, Real Estate, Mass Mailer, Institute Management System, School Management System, Library to Management, Cargo Billing System, Hospital Management System, and many more. Environment: Python, MySQL, JavaScript, PHP, HTML, CSS, JSON, AJAX, jQuery, Bootstrap, Laravel, Joomla, WordPress, Opencart, CPanel, Apache, Git, Data Analysis (including scripting in Python), Database Management, Training and Development.
Oknepal Inc, Kathmandu, Nepal July 2011 – Jan 2014 Database Developer
Responsibilities:
Designed, implemented, and maintained relational databases to support web and business applications.
Worked with developers and system administrators to ensure reliable and efficient data storage and access.
Modified database structures and optimized queries to improve performance and reliability.
Created and maintained documentation, including technical specifications and user guides.
Developed and enforced database best practices, including indexing, backup strategies, and normalization.
Supported development of KPIs and performance metrics to monitor data and system health.
Responded to database-related issues and resolved bugs in a timely manner.
Assisted in evaluating and recommending upgrades or changes to current database technologies.
Contributed to the design and integration of back-end databases with PHP-based websites and APIs.
Participated in Agile product development cycles and cross-functional team collaboration. Environment: PHP, MySQL, SQL, JavaScript, HTML, CSS, AJAX, jQuery, Bootstrap, Laravel, WordPress, Joomla, Opencart, Git, Waterfall
CONTINUED RESEARCH DEVELOPMENT AND SCHOLARLY CONTRIBUTION
Conducted and led applied research on immersive virtual reality, mobile analytics, and spatial data visualization.
Authored two research papers and one academic poster based on original system development and experimental studies.
Served as corresponding author for major conference submission and led the writing, coordination, and documentation process.
Awarded First Place Poster Award by the College of Information, UNT, for innovation in spatial data visualization.
Publications and Presentations:
Bhatt, B. R., Summitt, A., & Sharma, S. (2025). Immersive Virtual Reality Data Visualization for Urban Parking Management. Paper accepted for SERA 2025 (Software Engineering Research and Applications Conference).
Bhatt, B. R., & Sharma, S. (2025). Mobile Application for Conducting Time Series Analysis on Location-Based Spatial Data. In Data Science (pp. 333–346). Springer, Cham. https://doi.org/10.1007/978-3-031-85856-7_25.
Bhatt, B. R.(2023). Mobile App for Object Tracking and Location-Based Data for Time Series Analysis [Poster presentation]. College of Information, University of North Texas. (Awarded 1st Place Poster Award)