98%
921
2 minutes
20
Large language models (LLMs) have demonstrated remarkable potential in medical applications. However, they still face critical challenges such as hallucinations, knowledge inconsistency, and insufficient integration of domain-specific medical expertise. To address these limitations, we introduce MedKA, a novel knowledge graph-augmented approach for fine-tuning and evaluating medical LLMs. Our approach systematically transforms structured knowledge from a medical knowledge graph into a high-quality QA corpus, cMKGQA, by clustering multiple fields around clinically meaningful scenarios (e.g., diagnosis, treatment planning). This grouping strategy enables comprehensive and use-case-specific data construction and supports one-stage training of the LLM, ensuring better alignment with structured medical knowledge. This transformation process ensures the comprehensive integration of domain-specific knowledge while maintaining factual consistency. To evaluate the factuality of LLM-generated response, we further propose the Knowledge Graph-based Auxiliary Evaluation Metrics (KG-AEMs)-a novel benchmarking framework that compares LLM outputs with fine-grained, attribute-level ground truth from knowledge graph. Experimental results demonstrate that MedKA achieves state-of-the-art performance, significantly outperforming existing models, including LLaMA-3.1-8B-Chinese-Chat, HuatuoGPT2-7B, and Apollo2-7B. On the cMKGQA dataset, MedKA achieves 44.63 BLEU-1 and 17.62 BLEU-4 scores, with particularly strong performance in areas such as medication recommendations and diagnostic tests as measured by KG-AEMs. Our approach highlights the potential of integrating knowledge graphs into LLM fine-tuning to improve the accuracy and reliability of medical AI systems. It advances factual accuracy in medical dialogue systems and provides a comprehensive framework for evaluating the integration of medical knowledge into LLMs. This work is publicly available on Github: https://github.com/Yai017/MedKA.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.jbi.2025.104871 | DOI Listing |
JACC Heart Fail
September 2025
Université de Lorraine, Inserm, Centre d'Investigations Cliniques Plurithématique 1433, Centre Hospitalier Régional Universitaire de Nancy, Nancy, France.
JMIR Ment Health
September 2025
National Institute of Health and Care Research MindTech HealthTech Research Centre, School of Medicine, University of Nottingham, Nottingham, United Kingdom.
Background: Cross-sector collaboration is increasingly recognized as essential for addressing complex health challenges, including those in mental health. Industry-academic partnerships play a vital role in advancing research and developing health solutions, yet differing priorities and perspectives can make collaboration complex.
Objective: This study aimed to identify key principles to support effective industry-academic partnerships, from the perspective of industry partners, and develop this into actionable guidance, which can be applied across sectors.
JMIR Res Protoc
September 2025
School of Rehabilitation Science, University of Saskatchewan, Saskatoon, SK, Canada.
Background: In Canada, the Indigenous population is the youngest and fastest growing, yet ongoing health disparities for Indigenous peoples are widely recognized. There is a concerning lack of research on childhood disabilities and health conditions in Indigenous populations in Canada. For children with disabilities and chronic health conditions, ongoing access to rehabilitation services, such as occupational therapy, physical therapy, speech-language pathology, and audiology, is critical in promoting positive health and developmental outcomes.
View Article and Find Full Text PDFNeuro Endocrinol Lett
September 2025
Department of Neurosurgery, PLA 960th Hospital, Jinan, Shandong, 250031, China.
Objective: To analyze the hotspots and frontiers in the field of subarachnoid hemorrhage using the bibliometrics method and providing references for academic research.
Methods: All published studies related to subarachnoid hemorrhage published in the Web of Science core database from 1 January 2016 to 25 September 2021 were retrospectively identified using VOSviewer and CiteSpace software. Visualization VOSviewer and CiteSpace software were used to perform statistical and cluster analyses on authors, countries, institutions, keywords, and co-cited documents.
JCO Precis Oncol
September 2025
Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori IRCCS Fondazione G. Pascale, Napoli, Italy.
Purpose: Tumor comprehensive genomic profiling (CGP) may detect potential germline pathogenic/likely pathogenic (P/LP) alterations as secondary findings. We analyzed the frequency of potentially germline variants and large rearrangements (LRs) in the RATIONAL study, an Italian multicenter, observational clinical trial that collects next-generation sequencing-based tumor profiling data, and evaluated how these findings were managed by the enrolling centers.
Patients And Methods: Patients prospectively enrolled in the pathway-B of the RATIONAL study and undergoing CGP with the FoundationOne CDx assays were included in the analysis.