Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objective: To challenge clinicians and informaticians to learn about potential sources of bias in medical machine learning models through investigation of data and predictions from an open-source severity of illness score.

Methods: Over a two-day period (total elapsed time approximately 28 hours), we conducted a datathon that challenged interdisciplinary teams to investigate potential sources of bias in the Global Open Source Severity of Illness Score. Teams were invited to develop hypotheses, to use tools of their choosing to identify potential sources of bias, and to provide a final report.

Results: Five teams participated, three of which included both informaticians and clinicians. Most (4/5) used Python for analyses, the remaining team used R. Common analysis themes included relationship of the GOSSIS-1 prediction score with demographics and care related variables; relationships between demographics and outcomes; calibration and factors related to the context of care; and the impact of missingness. Representativeness of the population, differences in calibration and model performance among groups, and differences in performance across hospital settings were identified as possible sources of bias.

Discussion: Datathons are a promising approach for challenging developers and users to explore questions relating to unrecognized biases in medical machine learning algorithms.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12250157PMC
http://dx.doi.org/10.1371/journal.pdig.0000932DOI Listing

Publication Analysis

Top Keywords

medical machine
12
machine learning
12
potential sources
12
sources bias
12
biases medical
8
severity illness
8
raising awareness
4
potential
4
awareness potential
4
potential biases
4

Similar Publications

This Letter to the Editor responds to the recent publication by Patel et al. (J Robot Surg. Jul 11;19(1):370, 2025), which outlines a framework and recommendations for telesurgery.

View Article and Find Full Text PDF

Background And Objectives: Older adults living with dementia are a heterogeneous group, which can make studying optimal medication management challenging. Unsupervised machine learning is a group of computing methods that rely on unlabeled data-that is, where the algorithm itself is discovering patterns without the need for researchers to label the data with a known outcome. These methods may help us to better understand complex prescribing patterns in this population.

View Article and Find Full Text PDF

Purpose: To build computed tomography (CT)-based radiomics models, with independent external validation, to predict recurrence and disease-specific mortality in patients with colorectal liver metastases (CRLM) who underwent liver resection.

Methods: 113 patients were included in this retrospective study: the internal training cohort comprised 66 patients, while the external validation cohort comprised 47. All patients underwent a CT study before surgery.

View Article and Find Full Text PDF

 Keloid scarring and Metabolic Syndrome (MS) are distinct conditions marked by chronic inflammation and tissue dysregulation, suggesting shared pathogenic mechanisms. Identifying common regulatory genes could unveil novel therapeutic targets. Methods.

View Article and Find Full Text PDF