Med-MGF: multi-level graph-based framework for handling medical data imbalance and representation.

BMC Med Inform Decis Mak

SingHealth-Duke NUS Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore, 169857, Singapore.

Published: September 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Modeling patient data, particularly electronic health records (EHR), is one of the major focuses of machine learning studies in healthcare, as these records provide clinicians with valuable information that can potentially assist them in disease diagnosis and decision-making.

Methods: In this study, we present a multi-level graph-based framework called MedMGF, which models both patient medical profiles extracted from EHR data and their relationship network of health profiles in a single architecture. The medical profiles consist of several layers of data embedding derived from interval records obtained during hospitalization, and the patient-patient network is created by measuring the similarities between these profiles. We also propose a modification to the Focal Loss (FL) function to improve classification performance in imbalanced datasets without the need to imputate the data. MedMGF's performance was evaluated against several Graphical Convolutional Network (GCN) baseline models implemented with Binary Cross Entropy (BCE), FL, class balancing parameter , and Synthetic Minority Oversampling Technique (SMOTE).

Results: Our proposed framework achieved high classification performance (AUC: 0.8098, ACC: 0.7503, SEN: 0.8750, SPE: 0.7445, NPV: 0.9923, PPV: 0.1367) on an extreme imbalanced pediatric sepsis dataset (n=3,014, imbalance ratio of 0.047). It yielded a classification improvement of 3.81% for AUC, 15% for SEN compared to the baseline GCN+ FL (AUC: 0.7717, ACC: 0.8144, SEN: 0.7250, SPE: 0.8185, PPV: 0.1559, NPV: 0.9847), and an improvement of 5.88% in AUC and 22.5% compared to GCN+FL+SMOTE (AUC: 0.7510, ACC: 0.8431, SEN: 0.6500, SPE: 0.8520, PPV: 0.1688, NPV: 0.9814). It also showed a classification improvement of 3.86% for AUC, 15% for SEN compared to the baseline GCN+ BCE (AUC: 0.7712, ACC: 0.8133, SEN: 0.7250, SPE: 0.8173, PPV: 0.1551, NPV: 0.9847), and an improvement of 14.33% in AUC and 27.5% in comparison to GCN+BCE+SMOTE (AUC: 0.6665, ACC: 0.7271, SEN: 0.6000, SPE: 0.7329, PPV: 0.0941, NPV: 0.9754).

Conclusion: When compared to all baseline models, MedMGF achieved the highest SEN and AUC results, demonstrating the potential for several healthcare applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11367907PMC
http://dx.doi.org/10.1186/s12911-024-02649-2DOI Listing

Publication Analysis

Top Keywords

compared baseline
12
auc
10
multi-level graph-based
8
graph-based framework
8
medical profiles
8
classification performance
8
baseline models
8
sen
8
classification improvement
8
auc 15%
8

Similar Publications

BackgroundTherapeutic plasma exchange (TPE) with albumin replacement has emerged as a potential treatment for Alzheimer's disease (AD). The AMBAR trial showed that TPE could slow cognitive and functional decline, along with changes in core and inflammatory biomarkers in cerebrospinal fluid.ObjectiveTo evaluate the safety and effectiveness of TPE in a real-world setting in Argentina.

View Article and Find Full Text PDF

Importance: Increasingly, strategies to systematically detect melanomas invoke targeted approaches, whereby those at highest risk are prioritized for skin screening. Many tools exist to predict future melanoma risk, but most have limited accuracy and are potentially biased.

Objectives: To develop an improved melanoma risk prediction tool for invasive melanoma.

View Article and Find Full Text PDF

Importance: Cannabis is the most commonly used illicit drug, with 10% to 30% of regular users developing cannabis use disorder (CUD), a condition linked to altered hippocampal integrity. Evidence suggests high-intensity interval training (HIIT) enhances hippocampal structure and function, with this form of physical exercise potentially mitigating CUD-related cognitive and mental health impairments.

Objective: To determine the impact of a 12-week HIIT intervention on hippocampal integrity (ie, structure, connectivity, biochemistry) compared with 12 weeks of strength and resistance (SR) training in CUD.

View Article and Find Full Text PDF

Importance: This study represents a first successful use of a genetic biomarker to select potential responders in a prospective study in psychiatry. Liafensine, a triple reuptake inhibitor, may become a new precision medicine for treatment-resistant depression (TRD), a major unmet medical need.

Objective: To determine whether ANK3-positive patients with TRD benefit from a 1-mg and/or 2-mg daily oral dose of liafensine, compared with placebo, in a clinical trial.

View Article and Find Full Text PDF

Purpose: We aimed to evaluate the impact of day- and night-time pad wetness on 2yrs-QoL after Radical Cystectomy (RC) with Orthotopic Neobladder (ON) from a Randomized Controlled Trial (RCT) aimed at comparing open RC (ORC) and Robot-Assisted RC (RARC) with intracorporeal (i) ON.

Methods: Between January 2018 and September 2020, 116 patients were enrolled. Data from self-assessed questionnaires (EORTC-QLQ-C30 and QLQ-BLM30) were collected.

View Article and Find Full Text PDF