Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The development of digital cancer twins relies on the capture of high-resolution representations of individual cancer patients throughout the course of their treatment. Our research aims to improve the detection of metastatic disease over time from structured radiology reports by exposing prediction models to historical information. We demonstrate that Natural language processing (NLP) can generate better weak labels for semi-supervised classification of computed tomography (CT) reports when it is exposed to consecutive reports through a patient's treatment history. Around 714,454 structured radiology reports from Memorial Sloan Kettering Cancer Center adhering to a standardized departmental structured template were used for model development with a subset of the reports included for validation. To develop the models, a subset of the reports was curated for ground-truth: 7,732 total reports in the lung metastases dataset from 867 individual patients; 2,777 reports in the liver metastases dataset from 315 patients; and 4,107 reports in the adrenal metastases dataset from 404 patients. We use NLP to extract and encode important features from the structured text reports, which are then used to develop, train, and validate models. Three models-a simple convolutional neural network (CNN), a CNN augmented with an attention layer, and a recurrent neural network (RNN)-were developed to classify the type of metastatic disease and validated against the ground truth labels. The models use features from consecutive structured text radiology reports of a patient to predict the presence of metastatic disease in the reports. A single-report model, previously developed to analyze one report instead of multiple past reports, is included and the results from all four models are compared based on accuracy, precision, recall, and F1-score. The best model is used to label all 714,454 reports to generate metastases maps. Our results suggest that NLP models can extract cancer progression patterns from multiple consecutive reports and predict the presence of metastatic disease in multiple organs with higher performance when compared with a single-report-based prediction. It demonstrates a promising automated approach to label large numbers of radiology reports without involving human experts in a time- and cost-effective manner and enables tracking of cancer progression over time.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8924403PMC
http://dx.doi.org/10.3389/frai.2022.826402DOI Listing

Publication Analysis

Top Keywords

radiology reports
20
reports
17
metastatic disease
16
structured radiology
12
metastases dataset
12
consecutive structured
8
consecutive reports
8
subset reports
8
reports included
8
structured text
8

Similar Publications

Importance: Multiparametric magnetic resonance imaging (MRI), with or without prostate biopsy, has become the standard of care for diagnosing clinically significant prostate cancer. Resource capacity limits widespread adoption. Biparametric MRI, which omits the gadolinium contrast sequence, is a shorter and cheaper alternative offering time-saving capacity gains for health systems globally.

View Article and Find Full Text PDF

Thirty years of SPM-BrainMap synergy: making and mining coordinate-based literature.

Cereb Cortex

August 2025

Research Imaging Institute, University of Texas Health Science Center at San Antonio, 8403 Floyd Curl Drive, San Antonio, TX 78229, United States.

Statistical Parametric Mapping (SPM) adheres to rigorous methodological standards, including: spatial normalization, inter-subject averaging, voxel-wise contrasts, and coordinate reporting. This rigor ensures that a thematically diverse literature is amenable to meta-analysis. BrainMap is a community database (www.

View Article and Find Full Text PDF

Atypical proximal tibial fractures in adolescents are rare, particularly when linked to hormonal therapy for short stature. This case series reports the clinical and imaging features of atypical proximal tibial and distal femoral physeal fractures in male adolescents undergoing combined growth hormone (GH) and aromatase inhibitor (AI) therapy for idiopathic short stature. We report three cases of skeletally immature male adolescents (ages 12-16) treated with GH and anastrozole who presented with acute leg pain following low-energy trauma during soccer.

View Article and Find Full Text PDF

The increasing complexity and volume of radiology reports present challenges for timely critical findings communication. To evaluate the performance of two out-of-the-box LLMs in detecting and classifying critical findings in radiology reports using various prompt strategies. The analysis included 252 radiology reports of varying modalities and anatomic regions extracted from the MIMIC-III database, divided into a prompt engineering tuning set of 50 reports, a holdout test set of 125 reports, and a pool of 77 remaining reports used as examples for few-shot prompting.

View Article and Find Full Text PDF