Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background The ScreenTrustCAD trial was a prospective study that evaluated the cancer detection rates for combinations of artificial intelligence (AI) computer-aided detection (CAD) and two radiologists. The results raised concerns about the tendency of radiologists to agree with AI CAD too much (when AI CAD made an erroneous flagging) or too little (when AI CAD made a correct flagging). Purpose To evaluate differences in recall proportion and positive predictive value (PPV) related to which reader flagged the mammogram for consensus discussion: AI CAD and/or radiologists. Materials and Methods Participants were enrolled from April 2021 to June 2022, and each examination was interpreted by three independent readers: two radiologists and AI CAD, after which positive findings were forwarded to the consensus discussion. For each combination of readers flagging an examination, the proportion recalled and the PPV were calculated by dividing the number of pathologic evaluation-verified cancers by the number of positive examinations. Results The study included 54 991 women (median age, 55 years [IQR, 46-65 years]), among whom 5489 were flagged for consensus discussion and 1348 were recalled. For examinations flagged by one reader, the proportion recalled after flagging by one radiologist was larger (14.2% [263 of 1858]) compared with flagging by AI CAD (4.6% [86 of 1886]) ( < .001), whereas the PPV of breast cancer was lower (3.4% [nine of 263] vs 22% [19 of 86]) ( < .001). For examinations flagged by two readers, the proportion recalled after flagging by two radiologists was larger (57.2% [360 of 629]) compared with flagging by AI CAD and one radiologist (38.6% [244 of 632]) ( < .001), whereas the PPV was lower (2.5% [nine of 360] vs 25.0% [61 of 244]) ( < .001). For examinations flagged by all three readers, the proportion recalled was 82.6% (400 of 484) and the PPV was 34.2 (137 of 400). Conclusion A larger proportion of participants were recalled after initial flagging by radiologists compared with those flagged by AI CAD, with a lower proportion of cancer. ClinicalTrials.gov Identifier: NCT04778670 © RSNA, 2025 See also the editorial by Grimm in this issue.

Download full-text PDF

Source
http://dx.doi.org/10.1148/radiol.242566DOI Listing

Publication Analysis

Top Keywords

proportion recalled
16
flagging cad
12
consensus discussion
12
examinations flagged
12
cad
10
screentrustcad trial
8
proportion
8
recall proportion
8
proportion positive
8
positive predictive
8

Similar Publications

Background: As a common postoperative neurological complication, postoperative delirium (POD) can lead to poor postoperative recovery in patients, prolonged hospitalization, and even increased mortality. However, POD's mechanism remains undefined and there are no reliable molecular markers of POD to date. The present work examined the associations of cerebrospinal fluid (CSF) sTREM2 with CSF POD biomarkers, and investigated whether the effects of CSF sTREM2 on POD were modulated by the core pathological indexes of POD (Aβ42, tau, and ptau).

View Article and Find Full Text PDF

The relationship between gut microbiota, diet, and cardiovascular-kidney-metabolic (CKM) health has attracted attention. However, the relationship between the dietary index for gut microbiota (DI-GM) and CKM syndrome has not yet been studied. Patients diagnosed with CKM syndrome from the NHANES 2007-2018 data were included.

View Article and Find Full Text PDF

Background And Aim: Granulosa cells (GCs) are crucial mediators of follicular development and oocyte competence in goats, with their gene expression profiles serving as potential biomarkers of fertility. However, the lack of a standardized, quantifiable method to assess GC quality using transcriptomic data has limited the translation of such findings into reproductive applications. This study aimed to develop a hybrid deep learning model integrating one-dimensional convolutional neural networks (1DCNNs) and gated recurrent units (GRUs) to classify GCs as fertility-supporting (FS) or non-fertility-supporting (NFS) using single-cell RNA sequencing (scRNA-seq) data.

View Article and Find Full Text PDF

Vegetarian and vegan diets are increasingly popular in Germany due to ethical considerations, perceived health and environmental benefits. Regionally representative data, particularly for Bavaria, remain scarce. This study updates the prevalence, demographics and eating motives of vegetarians and vegans using data from the 3rd Bavarian Food Consumption Survey (BVS III; 2021-2023), a repeated, population-based, representative study.

View Article and Find Full Text PDF

Background: Recent studies suggest that large language models (LLMs) such as ChatGPT are useful tools for medical students or residents when preparing for examinations. These studies, especially those conducted with multiple-choice questions, emphasize that the level of knowledge and response consistency of the LLMs are generally acceptable; however, further optimization is needed in areas such as case discussion, interpretation, and language proficiency. Therefore, this study aimed to evaluate the performance of six distinct LLMs for Turkish and English neurosurgery multiple-choice questions and assess their accuracy and consistency in a specialized medical context.

View Article and Find Full Text PDF