Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objective: To develop an electronic medical record (EMR) data processing tool that confers clinical context to machine learning (ML) algorithms for error handling, bias mitigation, and interpretability.

Materials And Methods: We present Trust-MAPS, an algorithm that translates clinical domain knowledge into high-dimensional, mixed-integer programming models that capture physiological and biological constraints on clinical measurements. EMR data are projected onto this constrained space, effectively bringing outliers to fall within a physiologically feasible range. We then compute the distance of each data point from the constrained space modeling healthy physiology to quantify deviation from the norm. These distances, termed "trust-scores," are integrated into the feature space for downstream ML applications. We demonstrate the utility of Trust-MAPS by training a binary classifier for early sepsis prediction on data from the 2019 PhysioNet Computing in Cardiology Challenge, using the XGBoost algorithm and applying SMOTE for overcoming class-imbalance.

Results: The Trust-MAPS framework shows desirable behavior in handling potential errors and boosting predictive performance. We achieve an area under the receiver operating characteristic curve of 0.91 (95% CI, 0.89-0.92) for predicting sepsis 6 hours before onset-a marked 15% improvement over a baseline model trained without Trust-MAPS.

Discussions: Downstream classification performance improves after Trust-MAPS preprocessing, highlighting the bias reducing capabilities of the error-handling projections. Trust-scores emerge as clinically meaningful features that not only boost predictive performance for clinical decision support tasks but also lend interpretability to ML models.

Conclusion: This work is the first to translate clinical domain knowledge into mathematical constraints, model cross-vital dependencies, and identify aberrations in high-dimensional medical data. Our method allows for error handling in EMR and confers interpretability and superior predictive power to models trained for clinical decision support.

Download full-text PDF

Source
http://dx.doi.org/10.1093/jamia/ocaf058DOI Listing

Publication Analysis

Top Keywords

clinical decision
12
decision support
12
error handling
12
machine learning
8
emr data
8
clinical domain
8
domain knowledge
8
constrained space
8
predictive performance
8
clinical
6

Similar Publications

Introduction: The role of imaging in radiotherapy is becoming increasingly important. Verification of imaging parameters prior to treatment planning is essential for safe and effective clinical practice.

Methods: This study described the development and clinical implementation of ImageCompliance, an automated, GUI-based script designed to verify and enforce correct CT and MRI parameters during radiotherapy planning.

View Article and Find Full Text PDF

Background: Recent advances in high-throughput sequencing technologies have enabled the collection and sharing of a massive amount of omics data, along with its associated metadata-descriptive information that contextualizes the data, including phenotypic traits and experimental design. Enhancing metadata availability is critical to ensure data reusability and reproducibility and to facilitate novel biomedical discoveries through effective data reuse. Yet, incomplete metadata accompanying public omics data may hinder reproducibility and reusability and limit secondary analyses.

View Article and Find Full Text PDF

Background: Gastric cancer is one of the most common cancers worldwide, with its prognosis influenced by factors such as tumor clinical stage, histological type, and the patient's overall health. Recent studies highlight the critical role of lymphatic endothelial cells (LECs) in the tumor microenvironment. Perturbations in LEC function in gastric cancer, marked by aberrant activation or damage, disrupt lymphatic fluid dynamics and impede immune cell infiltration, thereby modulating tumor progression and patient prognosis.

View Article and Find Full Text PDF

Background: Escherichia coli ST131 and clade H30Rx are the most prevalent extended-spectrum β-lactamase-producing E. coli (ESBL-EC) causing bacteremia and urinary tract infections globally and in Sweden. Previous studies have linked ST131-H30Rx with septic shock and mortality, as well as prolonged carriage.

View Article and Find Full Text PDF

Background: Current scoring systems for hypertriglyceridaemia-induced acute pancreatitis (HTG-AP) severity are few and lack reliability. The present work focused on screening predicting factors for HTG-SAP, then constructing and validating the visualization model of HTG-AP severity by combining relevant metabolic indexes.

Methods: Between January 2020 and December 2024, retrospective clinical information for HTG-AP inpatients from Weifang People's Hospital was examined.

View Article and Find Full Text PDF