MedKA: A knowledge graph-augmented approach to improve factuality in medical Large Language Models.

Yiyan Deng , Shen Zhao , Yongming Miao , Junjie Zhu , Jin Li

J Biomed Inform

Department of Artificial Intelligence, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China; Jiangsu Key Laboratory of Intelligent Medical Image Computing, Nanjing, Jiangsu, China. Electronic address:

Published: August 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Large language models (LLMs) have demonstrated remarkable potential in medical applications. However, they still face critical challenges such as hallucinations, knowledge inconsistency, and insufficient integration of domain-specific medical expertise. To address these limitations, we introduce MedKA, a novel knowledge graph-augmented approach for fine-tuning and evaluating medical LLMs. Our approach systematically transforms structured knowledge from a medical knowledge graph into a high-quality QA corpus, cMKGQA, by clustering multiple fields around clinically meaningful scenarios (e.g., diagnosis, treatment planning). This grouping strategy enables comprehensive and use-case-specific data construction and supports one-stage training of the LLM, ensuring better alignment with structured medical knowledge. This transformation process ensures the comprehensive integration of domain-specific knowledge while maintaining factual consistency. To evaluate the factuality of LLM-generated response, we further propose the Knowledge Graph-based Auxiliary Evaluation Metrics (KG-AEMs)-a novel benchmarking framework that compares LLM outputs with fine-grained, attribute-level ground truth from knowledge graph. Experimental results demonstrate that MedKA achieves state-of-the-art performance, significantly outperforming existing models, including LLaMA-3.1-8B-Chinese-Chat, HuatuoGPT2-7B, and Apollo2-7B. On the cMKGQA dataset, MedKA achieves 44.63 BLEU-1 and 17.62 BLEU-4 scores, with particularly strong performance in areas such as medication recommendations and diagnostic tests as measured by KG-AEMs. Our approach highlights the potential of integrating knowledge graphs into LLM fine-tuning to improve the accuracy and reliability of medical AI systems. It advances factual accuracy in medical dialogue systems and provides a comprehensive framework for evaluating the integration of medical knowledge into LLMs. This work is publicly available on Github: https://github.com/Yai017/MedKA.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.jbi.2025.104871	DOI Listing

Publication Analysis

Top Keywords

medical knowledge

knowledge

medical

knowledge graph-augmented

graph-augmented approach

large language

language models

integration domain-specific

knowledge graph

medka achieves

Similar Publications

Transendocardial Injection of Expanded Autologous CD34+ Cells After Myocardial Infarction: Results of the EXCELLENT Trial.

JACC Heart Fail

September 2025

Université de Lorraine, Inserm, Centre d'Investigations Cliniques Plurithématique 1433, Centre Hospitalier Régional Universitaire de Nancy, Nancy, France.

Jerome Roncalli , François Roubille , Yves Cottin , Lionel Leroux , Anthony Mathur

View Article and Find Full Text PDF

Similar Publications

Principles of Industry-Academic Partnerships Informed by Digital Mental Health Collaboration: Mixed Methods Study.

JMIR Ment Health

September 2025

National Institute of Health and Care Research MindTech HealthTech Research Centre, School of Medicine, University of Nottingham, Nottingham, United Kingdom.

Sophie S Hall , Olivia Hastings , Kelly Marie Prentice , Beverley Brown , Jacob Andrews

Background: Cross-sector collaboration is increasingly recognized as essential for addressing complex health challenges, including those in mental health. Industry-academic partnerships play a vital role in advancing research and developing health solutions, yet differing priorities and perspectives can make collaboration complex.

Objective: This study aimed to identify key principles to support effective industry-academic partnerships, from the perspective of industry partners, and develop this into actionable guidance, which can be applied across sectors.

View Article and Find Full Text PDF

Similar Publications

Developing an Interprofessional Pediatric Rehabilitation Model of Care in Northern Cree First Nation Communities: Protocol for a Needs Assessment and Codeveloped Intervention With a Qualitative and Participatory Action Approach.

JMIR Res Protoc

September 2025

School of Rehabilitation Science, University of Saskatchewan, Saskatoon, SK, Canada.

Katie Crockett , Hailey Dunn , Rosalie Dostie , Karin Diedrich-Closson , Laureen McIntyre

Background: In Canada, the Indigenous population is the youngest and fastest growing, yet ongoing health disparities for Indigenous peoples are widely recognized. There is a concerning lack of research on childhood disabilities and health conditions in Indigenous populations in Canada. For children with disabilities and chronic health conditions, ongoing access to rehabilitation services, such as occupational therapy, physical therapy, speech-language pathology, and audiology, is critical in promoting positive health and developmental outcomes.

View Article and Find Full Text PDF

Similar Publications

Visual analysis of subarachnoid hemorrhage research hotspots based on CiteSpace and VOSviewer: A retrospective study.

Neuro Endocrinol Lett

September 2025

Department of Neurosurgery, PLA 960th Hospital, Jinan, Shandong, 250031, China.

Zhenqiang Hao , Peigang Lu , Bo Li

Objective: To analyze the hotspots and frontiers in the field of subarachnoid hemorrhage using the bibliometrics method and providing references for academic research.

Methods: All published studies related to subarachnoid hemorrhage published in the Web of Science core database from 1 January 2016 to 25 September 2021 were retrospectively identified using VOSviewer and CiteSpace software. Visualization VOSviewer and CiteSpace software were used to perform statistical and cluster analyses on authors, countries, institutions, keywords, and co-cited documents.

View Article and Find Full Text PDF

Similar Publications

Germline Findings From Tumor-Only Comprehensive Genomic Profiling in the RATIONAL Study: A Missed Opportunity?

JCO Precis Oncol

September 2025

Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori IRCCS Fondazione G. Pascale, Napoli, Italy.

Riziero Esposito Abate , Alessandro Morabito , Michele Milella , Fabrizio Tabbò , Valentina Guarneri

Purpose: Tumor comprehensive genomic profiling (CGP) may detect potential germline pathogenic/likely pathogenic (P/LP) alterations as secondary findings. We analyzed the frequency of potentially germline variants and large rearrangements (LRs) in the RATIONAL study, an Italian multicenter, observational clinical trial that collects next-generation sequencing-based tumor profiling data, and evaluated how these findings were managed by the enrolling centers.

Patients And Methods: Patients prospectively enrolled in the pathway-B of the RATIONAL study and undergoing CGP with the FoundationOne CDx assays were included in the analysis.

View Article and Find Full Text PDF

Similar Publications