A Comparative Study of Five Large Language Models' Response for Liver Cancer Comprehensive Treatment.

J Hepatocell Carcinoma

Department of Liver Transplantation Center and HBP Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, People's Republic of China.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Introduction: Large language models (LLMs) are increasingly used in healthcare, yet their reliability in specialized clinical fields remains uncertain. Liver cancer, as a complex and high-burden disease, poses unique challenges for AI-based tools. This study aimed to evaluate the comprehensibility and clinical applicability of five mainstream LLMs in addressing liver cancer-related clinical questions.

Methods: We developed 90 standardized questions covering multiple aspects of liver cancer management. Five LLMs-GPT-4, Gemini, Copilot, Kimi, and Ernie Bot-were evaluated in a blinded fashion by three independent hepatobiliary experts. Responses were scored using predefined criteria for comprehensibility and clinical applicability. Overall group comparisons were conducted using the Fisher-Freeman-Halton test (for categorical data) and the Kruskal-Wallis test (for ordinal scores), followed by Dunn's post-hoc test or Fisher's exact test with Bonferroni correction. Inter-rater reliability was assessed using Fleiss' kappa.

Results: Kimi and GPT-4 achieved the highest proportions of fully applicable responses (68% and 62%, respectively), while Ernie Bot and Copilot showed the lowest. Comprehensibility was generally high, with Kimi and Ernie Bot scoring over 98%. However, none of the LLMs consistently provided guideline-concordant answers to all questions. Performance on professional-level questions was significantly lower than on common-sense ones, highlighting deficiencies in complex clinical reasoning.

Conclusion: LLMs demonstrate varied performance in liver cancer-related queries. While GPT-4 and Kimi show promise in clinical applicability, limitations in accuracy and consistency-particularly for complex medical decisions-underscore the need for domain-specific optimization before clinical integration.

Trial Registration: Not applicable.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12375359PMC
http://dx.doi.org/10.2147/JHC.S531642DOI Listing

Publication Analysis

Top Keywords

liver cancer
12
clinical applicability
12
large language
8
comprehensibility clinical
8
liver cancer-related
8
kimi ernie
8
ernie bot
8
clinical
7
liver
5
comparative study
4

Similar Publications

RAB25/GCN1 Signaling Promotes ER Stress to Mediate Alcohol-associated Liver Disease Progression.

Clin Mol Hepatol

September 2025

Department of Endoscopy, Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Guangzhou, China.

Background/aims: Endoplasmic reticulum (ER) stress in hepatocytes plays a causative role in alcohol-associated liver disease (ALD). The incomplete inhibition of ER stress by targeting canonical ER stress sensor proteins suggests the existence of noncanonical ER stress pathways in ALD pathology. This study aimed to delineate the role of RAB25 in ALD and its regulatory mechanism in noncanonical ER stress pathways.

View Article and Find Full Text PDF

Adiponectin as a Predictor of Metabolic Dysfunction-Associated Steatotic Liver Disease and Non-Alcoholic Fatty Liver Disease: A 17-Year Korean Cohort Study.

Diabetes Metab J

September 2025

Department of Epidemiology and Health Promotion, Institute for Health Promotion, Graduate School of Public Health, Yonsei University, Seoul, Korea.

Background: This study aimed to investigate the association between adiponectin levels and the incidence of metabolic dysfunction- associated steatotic liver disease (MASLD) and nonalcoholic fatty liver disease (NAFLD), and to explore the predictive value of adiponectin in the onset of these conditions.

Methods: A 17-year follow-up of 35,026 individuals from the Korean Cancer Prevention Study-II biobank cohort (2004-2021) was conducted. Adiponectin levels were categorized into quintiles.

View Article and Find Full Text PDF

Pyroptosis is a lytic and pro-inflammatory regulated cell death pathway mediated by pores formed by the oligomerization of gasdermin proteins on cellular membranes. Different pro-inflammatory molecules such as interleukin-18 are released from these pores, promoting inflammation. Pyroptotic cell death has been implicated in many pathological conditions, including cancer and liver diseases.

View Article and Find Full Text PDF

Peroxisome proliferator-activated receptor γ (PPARγ) is a nuclear receptor abundantly expressed in the fatty liver of type 2 diabetic ob/ob mice. Herein, we investigated how PPARγ regulates the expression of the interferon alpha-inducible protein 27-like 2b (lfi27l2b) gene in the mouse liver. High expression of lfi27l2b was observed in the fatty liver of ob/ob mice, and the expression was further upregulated by PPARγ ligands; however, liver-specific Pparg knockout ameliorated this increase.

View Article and Find Full Text PDF

Lipidomic Profiling in Cancer: Phospholipid Alterations and their Role in Tumor Progression.

Curr Cancer Drug Targets

September 2025

Department of Biotechnology, Institute of Applied Sciences &Humanities, GLA University, 17km Stone, NH-19, Mathura, Delhi Road, P.O. Chaumuhan, Mathura, 281 406, U.P. India.

Phospholipids play a crucial role in various aspects of cancer biology, including tumor progression, metastasis, and cell survival. Recent studies have highlighted the signifi-cance of phospholipid metabolism and signaling in multiple cancer types, such as breast, cer-vical, prostate, bladder, colorectal, liver, lung, melanoma, mesothelioma, and oral cancer. Al-terations in phospholipid profiles, particularly in phosphatidylcholine and phosphatidylethan-olamine, have been identified as potential biomarkers for cancer diagnosis and prognosis.

View Article and Find Full Text PDF