Comparative Analysis of Large Language Models for Answering Cancer-Related Questions in Korean.

Yonsei Med J

Department of Radiation Oncology, International St. Mary's Hospital, Catholic Kwandong University, Incheon, Korea.

Published: July 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Purpose: Large language models (LLMs) have shown potential in medicine, transforming patient education, clinical decision support, and medical research. However, the effectiveness of LLMs in providing accurate medical information, particularly in non-English languages, remains underexplored. This study aimed to compare the quality of responses generated by ChatGPT and Naver's CLOVA X to cancer-related questions posed in Korean.

Materials And Methods: The study involved selecting cancer-related questions from the National Cancer Institute and Korean National Cancer Information Center websites. Responses were generated using ChatGPT and CLOVA X, and three oncologists assessed their quality using the Global Quality Score (GQS). The readability of the responses generated by ChatGPT and CLOVA X was calculated using KReaD, an artificial intelligence-based tool designed to objectively assess the complexity of Korean texts and reader comprehension.

Results: The Wilcoxon test for the GQS score of answers using ChatGPT and CLOVA X showed that there is no statistically significant difference in quality between the two LLMs (>0.05). The chi-square statistic for the variables "Good rating" and "Poor rating" showed no significant difference in the quality of responses between the two LLMs (>0.05). KReaD scores were higher for CLOVA X than for ChatGPT (=0.036). The categorical data analysis for the variables "Easy to read" and "Hard to read" revealed no significant difference (>0.05).

Conclusion: Both ChatGPT and CLOVA X answered Korean-language cancer-related questions with no significant difference in overall quality.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12206594PMC
http://dx.doi.org/10.3349/ymj.2024.0200DOI Listing

Publication Analysis

Top Keywords

cancer-related questions
16
chatgpt clova
16
responses generated
12
generated chatgpt
12
difference quality
12
large language
8
language models
8
quality responses
8
national cancer
8
llms >005
8

Similar Publications

Cancer caregiver information needs: communication experiences with health providers - a qualitative study.

BMJ Support Palliat Care

August 2025

Centre for Quality and Patient Safety, Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia.

Purpose: We sought to explore the lived experience of caregivers when communicating with doctors and nurses about their cancer-related information needs.

Methods: Semistructured interviews (n=20) were conducted with cancer caregivers. Data were collected as part of a broader study on caregiver health literacy needs.

View Article and Find Full Text PDF

The role of immunotherapy in resectable non-small-cell lung cancer.

Ther Adv Med Oncol

August 2025

Trinity St. James's Cancer Institute, Trinity College Dublin, Dublin, D08 XF38, Ireland.

Non-small-cell lung cancer (NSCLC) accounts for 80%-85% of all lung cancer cases, being the leading cause of cancer-related mortality worldwide. Historically, outcomes for patients with resectable disease have trailed those with other solid organ malignancies. Advances in treatment strategies, particularly in immunotherapy (IO), have revolutionised the landscape of lung cancer care.

View Article and Find Full Text PDF

A Comparative Study of Five Large Language Models' Response for Liver Cancer Comprehensive Treatment.

J Hepatocell Carcinoma

August 2025

Department of Liver Transplantation Center and HBP Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, People's Republic of China.

Introduction: Large language models (LLMs) are increasingly used in healthcare, yet their reliability in specialized clinical fields remains uncertain. Liver cancer, as a complex and high-burden disease, poses unique challenges for AI-based tools. This study aimed to evaluate the comprehensibility and clinical applicability of five mainstream LLMs in addressing liver cancer-related clinical questions.

View Article and Find Full Text PDF

Genomic instability and amplification as emerging biomarkers for stratifying high-grade serous ovarian cancer.

Front Oncol

August 2025

Department of Diagnostic Innovation, Pathology Unit 2, Fondazione IRCCS Istituto Nazionale Dei Tumori, Milano, Italy.

Introduction: Ovarian cancer (OC) is one of the leading causes of cancer-related death in women worldwide. Treatment with PARP-inhibitors has significantly improved survival in patients with high-grade serous cancer (HGSC) bearing mutations (~22% of the cases), and/or homologous recombination deficiency (HRD, ~50%). Unfortunately, limited therapeutic alternatives are available for wild type/HR proficient HGSC patients, who usually exhibit resistance to standard treatments and poor prognosis.

View Article and Find Full Text PDF

Bone marrow adipose tissue (BMAT) is physiologically linked to bone and energy metabolism, endocrine regulation, hematopoiesis and cancer-related processes. A key challenge in the field is that methods for isolating BMAT or bone marrow adipocytes (BMAds) are variable because there are no widely adopted standardized protocols. To generate awareness of this challenge and to establish uniformity in experimental approaches requiring isolation, storage and characterization of BMAT and BMAds, the Biobanking Working Group of the international Bone Marrow Adiposity Society (BMAS) has previously recommended experimental standards.

View Article and Find Full Text PDF