MedKA: A knowledge graph-augmented approach to improve factuality in medical Large Language Models.

J Biomed Inform

Department of Artificial Intelligence, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China; Jiangsu Key Laboratory of Intelligent Medical Image Computing, Nanjing, Jiangsu, China. Electronic address:

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Large language models (LLMs) have demonstrated remarkable potential in medical applications. However, they still face critical challenges such as hallucinations, knowledge inconsistency, and insufficient integration of domain-specific medical expertise. To address these limitations, we introduce MedKA, a novel knowledge graph-augmented approach for fine-tuning and evaluating medical LLMs. Our approach systematically transforms structured knowledge from a medical knowledge graph into a high-quality QA corpus, cMKGQA, by clustering multiple fields around clinically meaningful scenarios (e.g., diagnosis, treatment planning). This grouping strategy enables comprehensive and use-case-specific data construction and supports one-stage training of the LLM, ensuring better alignment with structured medical knowledge. This transformation process ensures the comprehensive integration of domain-specific knowledge while maintaining factual consistency. To evaluate the factuality of LLM-generated response, we further propose the Knowledge Graph-based Auxiliary Evaluation Metrics (KG-AEMs)-a novel benchmarking framework that compares LLM outputs with fine-grained, attribute-level ground truth from knowledge graph. Experimental results demonstrate that MedKA achieves state-of-the-art performance, significantly outperforming existing models, including LLaMA-3.1-8B-Chinese-Chat, HuatuoGPT2-7B, and Apollo2-7B. On the cMKGQA dataset, MedKA achieves 44.63 BLEU-1 and 17.62 BLEU-4 scores, with particularly strong performance in areas such as medication recommendations and diagnostic tests as measured by KG-AEMs. Our approach highlights the potential of integrating knowledge graphs into LLM fine-tuning to improve the accuracy and reliability of medical AI systems. It advances factual accuracy in medical dialogue systems and provides a comprehensive framework for evaluating the integration of medical knowledge into LLMs. This work is publicly available on Github: https://github.com/Yai017/MedKA.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jbi.2025.104871DOI Listing

Publication Analysis

Top Keywords

medical knowledge
12
knowledge
10
medical
9
knowledge graph-augmented
8
graph-augmented approach
8
large language
8
language models
8
integration domain-specific
8
knowledge graph
8
medka achieves
8

Similar Publications

Principles of Industry-Academic Partnerships Informed by Digital Mental Health Collaboration: Mixed Methods Study.

JMIR Ment Health

September 2025

National Institute of Health and Care Research MindTech HealthTech Research Centre, School of Medicine, University of Nottingham, Nottingham, United Kingdom.

Background: Cross-sector collaboration is increasingly recognized as essential for addressing complex health challenges, including those in mental health. Industry-academic partnerships play a vital role in advancing research and developing health solutions, yet differing priorities and perspectives can make collaboration complex.

Objective: This study aimed to identify key principles to support effective industry-academic partnerships, from the perspective of industry partners, and develop this into actionable guidance, which can be applied across sectors.

View Article and Find Full Text PDF

Background: In Canada, the Indigenous population is the youngest and fastest growing, yet ongoing health disparities for Indigenous peoples are widely recognized. There is a concerning lack of research on childhood disabilities and health conditions in Indigenous populations in Canada. For children with disabilities and chronic health conditions, ongoing access to rehabilitation services, such as occupational therapy, physical therapy, speech-language pathology, and audiology, is critical in promoting positive health and developmental outcomes.

View Article and Find Full Text PDF

Objective: To analyze the hotspots and frontiers in the field of subarachnoid hemorrhage using the bibliometrics method and providing references for academic research.

Methods: All published studies related to subarachnoid hemorrhage published in the Web of Science core database from 1 January 2016 to 25 September 2021 were retrospectively identified using VOSviewer and CiteSpace software. Visualization VOSviewer and CiteSpace software were used to perform statistical and cluster analyses on authors, countries, institutions, keywords, and co-cited documents.

View Article and Find Full Text PDF

Germline Findings From Tumor-Only Comprehensive Genomic Profiling in the RATIONAL Study: A Missed Opportunity?

JCO Precis Oncol

September 2025

Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori IRCCS Fondazione G. Pascale, Napoli, Italy.

Purpose: Tumor comprehensive genomic profiling (CGP) may detect potential germline pathogenic/likely pathogenic (P/LP) alterations as secondary findings. We analyzed the frequency of potentially germline variants and large rearrangements (LRs) in the RATIONAL study, an Italian multicenter, observational clinical trial that collects next-generation sequencing-based tumor profiling data, and evaluated how these findings were managed by the enrolling centers.

Patients And Methods: Patients prospectively enrolled in the pathway-B of the RATIONAL study and undergoing CGP with the FoundationOne CDx assays were included in the analysis.

View Article and Find Full Text PDF