98%
921
2 minutes
20
Objective: Correct automatic analysis of a medical report requires the identification of negations and their scopes. Since most of available training data comes from medical texts in English, it usually takes additional work to apply to non-English languages. Here, we introduce a supervised learning method for automatically identifying and determining the scopes and negation cues in French medical reports using language models based on BERT.
Methods: Using a new private corpus of French-language chest CT scan reports with consistent annotation, we first fine-tuned five available transformer models on the negation cue and scope identification task. Subsequently, we extended the methodology by modifying the optimal model to encompass a wider range of clinical notes and reports (not limited to radiology reports) and more heterogeneous annotations. Lastly, we tested the generated model on its initial mask-filling task to ensure there is no catastrophic forgetting.
Results: On a corpus of thoracic CT scan reports annotated by four annotators within our team, our method reaches a F1-score of 99.4% for cue detection and 94.5% for scope detection, thus equaling or improving state-of-the art performance. On more generic biomedical reports, annotated with more heterogeneous rules, the quality of the automatic analysis of course decreases, but our best-of-the class model still delivers very good performance, with F1-scores of 98.2% (cue detection), and 90.9% (scope detection). Moreover, we show that fine-tuning the original model for the negation identification task preserves or even improves its performance on its initial fill-mask task, depending on the lemmatization.
Conclusion: Considering the performance of our fine-tuned model for the detection of negation cues and scopes in medical reports in French and its robustness with respect to the diversity of the annotation rules and the type of biomedical data, we conclude that it is suited for use in a real-life clinical context.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.compbiomed.2025.110795 | DOI Listing |
JMIR Med Inform
September 2025
Departments of Radiology, The Third Affiliated Hospital, Sun Yat-Sen University, 600 Tianhe Road, Guangzhou, Guangdong, 510630, China, 86 18922109279, 86 20852523108.
Background: Despite the Coronary Artery Reporting and Data System (CAD-RADS) providing a standardized approach, radiologists continue to favor free-text reports. This preference creates significant challenges for data extraction and analysis in longitudinal studies, potentially limiting large-scale research and quality assessment initiatives.
Objective: To evaluate the ability of the generative pre-trained transformer (GPT)-4o model to convert real-world coronary computed tomography angiography (CCTA) free-text reports into structured data and automatically identify CAD-RADS categories and P categories.
PLoS One
September 2025
Sterile Processing Department, Sichuan GEM Flower Hospital, North Sichuan Medical College, Chengdu, China.
Background: Luminal instruments are characterized by their slender internal lumens, which make them particularly challenging to clean and dry. A common drying method used by Sterile Processing Department (SPD) technicians involves blowing high-pressure air into one end of the lumen to expel moisture. However, this process generates a significant amount of aerosols that may contain bacteria, viruses, and other microorganisms.
View Article and Find Full Text PDFPLoS One
September 2025
Department of Smart Manufacturing, Industrial Perception and Intelligent Manufacturing Equipment Engineering Research Center of Jiangsu Province, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, China.
In the field of quality control, metal surface defect detection is an important yet challenging task. Although YOLO models perform well in most object detection scenarios, metal surface images under operational conditions often exhibit coexisting high-frequency noise components and spectral aliasing background textures, and defect targets typically exhibit characteristics such as small scale, weak contrast, and multi-class coexistence, posing challenges for automatic defect detection systems. To address this, we introduce concepts including wavelet decomposition, cross-attention, and U-shaped dilated convolution into the YOLO framework, proposing the YOLOv11-WBD model to enhance feature representation capability and semantic mining effectiveness.
View Article and Find Full Text PDFAnn Acad Med Singap
August 2025
Dementia Research Centre (Singapore), Lee Kong Chian School of Medicine, Nanyang Technology University, Singapore.
Introduction: Interpretation and analysis of magnetic resonance imaging (MRI) scans in clinical settings comprise time-consuming visual ratings and complex neuroimage processing that require trained professionals. To combat these challenges, artificial intelligence (AI) techniques can aid clinicians in interpreting brain MRI for accurate diagnosis of neurodegenerative diseases but they require extensive validation. Thus, the aim of this study was to validate the use of AI-based AQUA (Neurophet Inc.
View Article and Find Full Text PDFCereb Cortex
August 2025
Department of Psychology, University of Milano-Bicocca, Milan, Italy.
Semantic composition allows us to construct complex meanings (e.g., "dog house", "house dog") from simpler constituents ("dog", "house").
View Article and Find Full Text PDF