2024Lowering Patient Wait Times

BASC Thesis

Built with
01 / Overview

A machine learning framework for predicting medication administration in emergency departments using the MIMIC-IV dataset, with focus on Urinary Tract Infections and Brain Trauma Injuries.

Emergency department workflows for complex conditions like UTIs and Brain Trauma Injuries present significant challenges due to variable severity and time-consuming assessment processes. This undergraduate thesis develops predictive models using patient vitals (blood pressure, heart rate), acuity scores, and clinical history to anticipate medication needs before arrival. Using the MIMIC-IV dataset from Harvard Medical School containing 299,712+ patient records from Beth Israel Deaconess Medical Center, the project implements Random Forest and Decision Tree classifiers with process mining techniques (pm4py) to identify patterns in medication administration and optimize pre-arrival treatment protocols.
02 / Process
01

Data Preprocessing & Event Log Construction

Cleaned MIMIC-IV dataset by removing blank entries and nonsensical values, standardizing column names, and ensuring consistent value representations. Filtered medications based on key ingredients and standardized names for similar medications across different doses. Formatted complex patient data into structured event logs suitable for process mining analysis.

02

Feature Engineering & Model Development

Analyzed key clinical parameters including patient vitals (blood pressure, heart rate), acuity scores, and injury severity indicators. Trained Random Forest and Decision Tree models to visualize variable interactions and predict medication administration patterns. Implemented class weighting strategies to address imbalance in medication frequency distributions.

03

Process Mining & Performance Evaluation

Utilized pm4py and Graphviz to visualize patient care pathways and medication administration workflows. Evaluated model performance against test datasets, identifying that Random Forests outperformed Decision Trees in multi-classification tasks. Documented limitations due to data encryption constraints and insufficient patient-specific information, providing recommendations for future data collection improvements.

03 / Impact
  • Developed predictive models for medication administration using Random Forests, demonstrating improved accuracy over Decision Trees for multi-classification tasks in emergency department workflows.

  • Implemented comprehensive data preprocessing pipeline including medication filtering by key ingredients, standardization of drug names, and structured event log formatting using pm4py and Graphviz for process mining visualization.

  • Identified limitations in prediction accuracy due to class imbalance and insufficient patient-specific data, providing recommendations for enhanced data collection and stratified sampling to improve model performance in clinical settings.

"

"This thesis project was a challenging and rewarding experience. It allowed me to apply my skills in simulation modeling and data analysis to a real-world problem with a significant impact on patient care. The project provided valuable insights into the complexities of healthcare systems and the importance of data-driven decision-making."