Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Aim: This study aimed to develop and evaluate an automated large language model (LLM)-based system for assessing the quality of medical imaging guidelines and consensus (GACS) in different languages, focusing on enhancing evaluation efficiency, consistency, and reducing manual workload.

Method: We developed the QPC-HASE-GuidelineEval algorithm, which integrates a Four-Quadrant Questions Classification Strategy and Hybrid Search Enhancement. The model was validated on 45 medical imaging guidelines (36 in Chinese and 9 in English) published in 2021 and 2022. Key evaluation metrics included consistency with expert assessments, hybrid search paragraph matching accuracy, information completeness, comparisons of different paragraph matching approaches, and cost-time efficiency.

Results: The algorithm demonstrated an average accuracy of 77%, excelling in simpler tasks but showing lower accuracy (29%-40%) in complex evaluations, such as explanations and visual aids. The average accuracy rates of the English and Chinese versions of the GACS were 74% and 76%, respectively (p = 0.37). Hybrid search demonstrated superior performance with paragraph matching accuracy (4.42) and information completeness (4.42), significantly outperforming keyword-based search (1.05/1.05) and sparse-dense retrieval (4.26/3.63). The algorithm significantly reduced evaluation time to 8 min and 30 s per guideline and reduced costs to approximately 0.5 USD per guideline, offering a considerable advantage over traditional manual methods.

Conclusion: The QPC-HASE-GuidelineEval algorithm, powered by LLMs, showed strong potential for improving the efficiency, scalability, and multi-language capability of guideline evaluations, though further enhancements are needed to handle more complex tasks that require deeper interpretation.

Download full-text PDF

Source
http://dx.doi.org/10.1111/jebm.70020DOI Listing

Publication Analysis

Top Keywords

medical imaging
12
imaging guidelines
12
hybrid search
12
paragraph matching
12
large language
8
language model
8
guidelines consensus
8
qpc-hase-guidelineeval algorithm
8
matching accuracy
8
average accuracy
8

Similar Publications

Clinicopathological features of dermal clear cell sarcoma: A series of 13 cases.

Pathol Res Pract

September 2025

Department of Pathology, Xijing Hospital and School of Basic Medicine, Fourth Military Medical University, Xi'an, China. Electronic address:

Background: Dermal clear cell sarcoma (DCCS) is a rare malignant mesenchymal neoplasm. Owing to the overlaps in its morphological and immunophenotypic profiles with a broad spectrum of tumors exhibiting melanocytic differentiation, it is frequently misdiagnosed as other tumor entities in clinical practice. By systematically analyzing the clinicopathological characteristics, immunophenotypic features, and molecular biological properties of DCCS, this study intends to further enhance pathologists' understanding of this disease and provide a valuable reference for its accurate diagnosis.

View Article and Find Full Text PDF

Leveraging GPT-4o for Automated Extraction and Categorization of CAD-RADS Features From Free-Text Coronary CT Angiography Reports: Diagnostic Study.

JMIR Med Inform

September 2025

Departments of Radiology, The Third Affiliated Hospital, Sun Yat-Sen University, 600 Tianhe Road, Guangzhou, Guangdong, 510630, China, 86 18922109279, 86 20852523108.

Background: Despite the Coronary Artery Reporting and Data System (CAD-RADS) providing a standardized approach, radiologists continue to favor free-text reports. This preference creates significant challenges for data extraction and analysis in longitudinal studies, potentially limiting large-scale research and quality assessment initiatives.

Objective: To evaluate the ability of the generative pre-trained transformer (GPT)-4o model to convert real-world coronary computed tomography angiography (CCTA) free-text reports into structured data and automatically identify CAD-RADS categories and P categories.

View Article and Find Full Text PDF

Background: Circumcision is a widely practiced procedure with cultural and medical significance. However, certain penile abnormalities-such as hypospadias or webbed penis-may contraindicate the procedure and require specialized care. In low-resource settings, limited access to pediatric urologists often leads to missed or delayed diagnoses.

View Article and Find Full Text PDF

Salivary Duct Carcinoma Spread to the Internal Auditory Canal Along the Facial Nerve: A Case Report.

J Craniofac Surg

September 2025

Department of Otolaryngology-Head and Neck Surgery, Xijing Hospital, Air Force Military Medical University, Xi'an, China.

Salivary duct carcinoma (SDC) is a rare high-grade parotid malignancy prone to perineural spread. However, perineural spread of SDC has rarely been reported. The case of a 46-year-old male with SDC spread along the facial nerve (FN) is presented here.

View Article and Find Full Text PDF

Scalp masses are common scalp lesions, most of which are benign, with a small proportion being malignant. Scalp sarcomas constitute one category of malignant tumors, primarily including fibrosarcoma, liposarcoma, rhabdomyosarcoma, and leiomyosarcoma. Among these, scalp leiomyosarcoma is exceedingly rare.

View Article and Find Full Text PDF