Automatic analysis of negation cues and scopes for medical texts in French using language models.

S Sadoune , A Richard , F Talbot , T Guyet , L Boussel , H Berry

Comput Biol Med

Inria, Lyon Research Center, F-69603, Villeurbanne, France; AIstroSight, Inria, Université Claude Bernard Lyon 1, Hospices Civils de Lyon, Villeurbanne, F-69603, France. Electronic address:

Published: August 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Objective: Correct automatic analysis of a medical report requires the identification of negations and their scopes. Since most of available training data comes from medical texts in English, it usually takes additional work to apply to non-English languages. Here, we introduce a supervised learning method for automatically identifying and determining the scopes and negation cues in French medical reports using language models based on BERT.

Methods: Using a new private corpus of French-language chest CT scan reports with consistent annotation, we first fine-tuned five available transformer models on the negation cue and scope identification task. Subsequently, we extended the methodology by modifying the optimal model to encompass a wider range of clinical notes and reports (not limited to radiology reports) and more heterogeneous annotations. Lastly, we tested the generated model on its initial mask-filling task to ensure there is no catastrophic forgetting.

Results: On a corpus of thoracic CT scan reports annotated by four annotators within our team, our method reaches a F1-score of 99.4% for cue detection and 94.5% for scope detection, thus equaling or improving state-of-the art performance. On more generic biomedical reports, annotated with more heterogeneous rules, the quality of the automatic analysis of course decreases, but our best-of-the class model still delivers very good performance, with F1-scores of 98.2% (cue detection), and 90.9% (scope detection). Moreover, we show that fine-tuning the original model for the negation identification task preserves or even improves its performance on its initial fill-mask task, depending on the lemmatization.

Conclusion: Considering the performance of our fine-tuned model for the detection of negation cues and scopes in medical reports in French and its robustness with respect to the diversity of the annotation rules and the type of biomedical data, we conclude that it is suited for use in a real-life clinical context.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.compbiomed.2025.110795	DOI Listing

Publication Analysis

Top Keywords

automatic analysis

negation cues

cues scopes

scopes medical

medical texts

language models

medical reports

scan reports

identification task

reports annotated

Similar Publications

Leveraging GPT-4o for Automated Extraction and Categorization of CAD-RADS Features From Free-Text Coronary CT Angiography Reports: Diagnostic Study.

JMIR Med Inform

September 2025

Departments of Radiology, The Third Affiliated Hospital, Sun Yat-Sen University, 600 Tianhe Road, Guangzhou, Guangdong, 510630, China, 86 18922109279, 86 20852523108.

Youmei Chen , Mengshi Dong , Jie Sun , Zhanao Meng , Yiqing Yang

Background: Despite the Coronary Artery Reporting and Data System (CAD-RADS) providing a standardized approach, radiologists continue to favor free-text reports. This preference creates significant challenges for data extraction and analysis in longitudinal studies, potentially limiting large-scale research and quality assessment initiatives.

Objective: To evaluate the ability of the generative pre-trained transformer (GPT)-4o model to convert real-world coronary computed tomography angiography (CCTA) free-text reports into structured data and automatically identify CAD-RADS categories and P categories.

View Article and Find Full Text PDF

Similar Publications

Self-activating air filtration device for aerosol control during luminal instruments drying in the sterile processing department.

PLoS One

September 2025

Sterile Processing Department, Sichuan GEM Flower Hospital, North Sichuan Medical College, Chengdu, China.

Wei Zheng , Ying He , Ping Gui , Xiaoxue Sun

Background: Luminal instruments are characterized by their slender internal lumens, which make them particularly challenging to clean and dry. A common drying method used by Sterile Processing Department (SPD) technicians involves blowing high-pressure air into one end of the lumen to expel moisture. However, this process generates a significant amount of aerosols that may contain bacteria, viruses, and other microorganisms.

View Article and Find Full Text PDF

Similar Publications

YOLOv11-WBD: A wavelet-bidirectional network with dilated perception for robust metal surface defect detection.

PLoS One

September 2025

Department of Smart Manufacturing, Industrial Perception and Intelligent Manufacturing Equipment Engineering Research Center of Jiangsu Province, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, China.

Li Guan , Haitao Zhang , Yijun Zhou , Xinyu Du , Mingxuan Li

In the field of quality control, metal surface defect detection is an important yet challenging task. Although YOLO models perform well in most object detection scenarios, metal surface images under operational conditions often exhibit coexisting high-frequency noise components and spectral aliasing background textures, and defect targets typically exhibit characteristics such as small scale, weak contrast, and multi-class coexistence, posing challenges for automatic defect detection systems. To address this, we introduce concepts including wavelet decomposition, cross-attention, and U-shaped dilated convolution into the YOLO framework, proposing the YOLOv11-WBD model to enhance feature representation capability and semantic mining effectiveness.

View Article and Find Full Text PDF

Similar Publications

Automatic brain segmentation in cognitive impairment: Validation of AI-based AQUA software in the Southeast Asian BIOCIS cohort.

Ann Acad Med Singap

August 2025

Dementia Research Centre (Singapore), Lee Kong Chian School of Medicine, Nanyang Technology University, Singapore.

Ashwati Vipin , Rasyiqah Binte Shaik Mohamed Salim , Regina Ey Kim , Minho Lee , Hye Weon Kim

Introduction: Interpretation and analysis of magnetic resonance imaging (MRI) scans in clinical settings comprise time-consuming visual ratings and complex neuroimage processing that require trained professionals. To combat these challenges, artificial intelligence (AI) techniques can aid clinicians in interpreting brain MRI for accurate diagnosis of neurodegenerative diseases but they require extensive validation. Thus, the aim of this study was to validate the use of AI-based AQUA (Neurophet Inc.

View Article and Find Full Text PDF

Similar Publications

Compositionality in the semantic network: a model-driven representational similarity analysis.

Cereb Cortex

August 2025

Department of Psychology, University of Milano-Bicocca, Milan, Italy.

Marco Ciapparelli , Marco Marelli , William Graves , Carlo Reverberi

Semantic composition allows us to construct complex meanings (e.g., "dog house", "house dog") from simpler constituents ("dog", "house").

View Article and Find Full Text PDF

Similar Publications