Automated Detection of Cancer-Suspicious Findings in Japanese Radiology Reports with Natural Language Processing: A Multicenter Study.

Kento Sugimoto , Shoya Wada , Shozo Konishi , Junya Sato , Katsuki Okada , Shoji Kido , Noriyuki Tomiyama , Yasushi Matsumura , Toshihiro Takeda

J Imaging Inform Med

Department of Medical Informatics, Osaka University Graduate School of Medicine, 2-2 Yamadaoka, Suita, 565-0871, Osaka, Japan.

Published: January 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Missed critical imaging findings, particularly those indicating cancer, are a common issue that can result in delays in patient follow-up and treatment. To address this, we developed a rule-based natural language processing (NLP) algorithm to detect cancer-suspicious findings from Japanese radiology reports. The dataset used consisted of chest and abdomen CT reports from six institutions. Reports from our institution were used for algorithm development and internal evaluation, while reports from the other five institutions were used for external evaluation. To create the gold standard, reports were annotated by two experienced physicians. Data were statistically analyzed using precision, recall and F1 score with 1000 bootstrap iterations. BERT was used as a baseline deep learning model, and its performance was compared with the proposed rule-based method. At the report level of detection, the overall precision, recall, and F-1 score were 0.886, 0.886, and 0.883, respectively, for the rule-based algorithm, which were higher than those of the deep learning algorithm (0.851, 0.679, and 0.733). The overall results include both internal and external validation data. For the internal validation set, the precision, recall, and F-1 score were 0.929, 0.929, and 0.927, respectively. For the external validation set, the precision, recall, and F-1 score were 0.875, 0.879, and 0.873, demonstrating generalizability. In conclusion, we show the rule-based NLP algorithm exhibited a high performance in detecting cancer-suspicious findings from multi-institutional CT reports.

Download full-text PDF	Source
http://dx.doi.org/10.1007/s10278-024-01338-w	DOI Listing

Publication Analysis

Top Keywords

precision recall

cancer-suspicious findings

recall f-1

f-1 score

findings japanese

japanese radiology

radiology reports

natural language

language processing

nlp algorithm

Similar Publications

A hybrid 1DCNN-GRU deep learning framework for classifying caprine granulosa cell fertility potential using single-cell transcriptomics.

Vet World

July 2025

Department of Veterinary Science, Faculty of Veterinary Medicine, Rajamangala University of Technology Tawan-OK, Chonburi, Thailand.

Thanida Sananmuang , Denis Puthier , Kaj Chokeshaiusaha

Background And Aim: Granulosa cells (GCs) are crucial mediators of follicular development and oocyte competence in goats, with their gene expression profiles serving as potential biomarkers of fertility. However, the lack of a standardized, quantifiable method to assess GC quality using transcriptomic data has limited the translation of such findings into reproductive applications. This study aimed to develop a hybrid deep learning model integrating one-dimensional convolutional neural networks (1DCNNs) and gated recurrent units (GRUs) to classify GCs as fertility-supporting (FS) or non-fertility-supporting (NFS) using single-cell RNA sequencing (scRNA-seq) data.

View Article and Find Full Text PDF

Similar Publications

Two step approach for detecting and segmenting the second mesiobuccal canal of maxillary first molars on cone beam computed tomography (CBCT) images via artificial intelligence.

BMC Oral Health

September 2025

Oral and Maxillofacial Radiology Department, Cairo university, Cairo, Egypt.

Sally Mansour , Enas Anter , Ali Khater Mohamed , Mushira M Dahaba , Arwa Mousa

Aim: The purpose of this study was to assess the accuracy of a customized deep learning model based on CNN and U-Net for detecting and segmenting the second mesiobuccal canal (MB2) of maxillary first molar teeth on cone beam computed tomography (CBCT) scans.

Methodology: CBCT scans of 37 patients were imported into 3D slicer software to crop and segment the canals of the mesiobuccal (MB) root of the maxillary first molar. The annotated data were divided into two groups: 80% for training and validation and 20% for testing.

View Article and Find Full Text PDF

Similar Publications

TPC-GCN: Deep learning for pulse pattern classification in traditional Chinese medicine.

Med Eng Phys

October 2025

College of Basic Medical Science, Shanxi University of Chinese Medicine, Jinzhong, 030619, Shanxi, China.

Hui Li , Yuetang Li , Zhidong Zhang , Chenyang Xue , Zhenhua Li

Pulse diagnosis holds a pivotal role in traditional Chinese medicine (TCM) diagnostics, with pulse characteristics serving as one of the critical bases for its assessment. Accurate classification of these pulse pattern is paramount for the objectification of TCM. This study proposes an enhanced SMOTE approach to achieve data augmentation, followed by multi-domain feature extraction.

View Article and Find Full Text PDF

Similar Publications

Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Biomed Phys Eng Express

September 2025

electrical engineering department, Indian Institute of Technology Roorkee, Research wing, electrical department, Roorkee, uttrakhand, 247664, INDIA.

Meenakshi Bisla , Radhey Shyam Anand

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments.

View Article and Find Full Text PDF

Similar Publications

Enhancing fake news detection with transformer-based deep learning: A multidisciplinary approach.

PLoS One

September 2025

Department of Computer Science, COMSATS University Islamabad, Sahiwal, Pakistan.

Nabeel Raza , Said Jadid Abdulkadir , Yawar Abbas Abid , Sami S Albouq , Ayed Alwadain

The widespread dissemination of fake news presents a critical challenge to the integrity of digital information and erodes public trust. This urgent problem necessitates the development of sophisticated and reliable automated detection mechanisms. This study addresses this gap by proposing a robust fake news detection framework centred on a transformer-based architecture.

View Article and Find Full Text PDF

Similar Publications