98%
921
2 minutes
20
Importance: Artificial intelligence (AI) large language models (LLMs) demonstrate potential in simulating human-like dialogue. Their efficacy in accurate patient-clinician communication within radiation oncology has yet to be explored.
Objective: To determine an LLM's quality of responses to radiation oncology patient care questions using both domain-specific expertise and domain-agnostic metrics.
Design, Setting, And Participants: This cross-sectional study retrieved questions and answers from websites (accessed February 1 to March 20, 2023) affiliated with the National Cancer Institute and the Radiological Society of North America. These questions were used as queries for an AI LLM, ChatGPT version 3.5 (accessed February 20 to April 20, 2023), to prompt LLM-generated responses. Three radiation oncologists and 3 radiation physicists ranked the LLM-generated responses for relative factual correctness, relative completeness, and relative conciseness compared with online expert answers. Statistical analysis was performed from July to October 2023.
Main Outcomes And Measures: The LLM's responses were ranked by experts using domain-specific metrics such as relative correctness, conciseness, completeness, and potential harm compared with online expert answers on a 5-point Likert scale. Domain-agnostic metrics encompassing cosine similarity scores, readability scores, word count, lexicon, and syllable counts were computed as independent quality checks for LLM-generated responses.
Results: Of the 115 radiation oncology questions retrieved from 4 professional society websites, the LLM performed the same or better in 108 responses (94%) for relative correctness, 89 responses (77%) for completeness, and 105 responses (91%) for conciseness compared with expert answers. Only 2 LLM responses were ranked as having potential harm. The mean (SD) readability consensus score for expert answers was 10.63 (3.17) vs 13.64 (2.22) for LLM answers (P < .001), indicating 10th grade and college reading levels, respectively. The mean (SD) number of syllables was 327.35 (277.15) for expert vs 376.21 (107.89) for LLM answers (P = .07), the mean (SD) word count was 226.33 (191.92) for expert vs 246.26 (69.36) for LLM answers (P = .27), and the mean (SD) lexicon score was 200.15 (171.28) for expert vs 219.10 (61.59) for LLM answers (P = .24).
Conclusions And Relevance: In this cross-sectional study, the LLM generated accurate, comprehensive, and concise responses with minimal risk of harm, using language similar to human experts but at a higher reading level. These findings suggest the LLM's potential, with some retraining, as a valuable resource for patient queries in radiation oncology and other medical fields.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10988356 | PMC |
http://dx.doi.org/10.1001/jamanetworkopen.2024.4630 | DOI Listing |
J Clin Oncol
September 2025
Division of Hematology and Medical Oncology, Simmons Comprehensive Cancer Center, UT Southwestern Medical Center, Dallas, TX.
JCO Precis Oncol
September 2025
Shu-Ning Li, MS, Jun-Nv Xu, MD, PhD,and Nan-Nan Ji, MD, PhD, Department of Radiation Oncology, Cancer Treatment Center, The Second Affiliated Hospital of Hainan Medical University, Haikou, China, Ming Xue, MS, Department of Outpatient, The Second Affiliated Hospital of Hainan Medical University, Hai
JCO Precis Oncol
September 2025
Monica F. Chen, MD, Division of Solid Tumor Oncology, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, Daniel Gomez, MD, Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, and Helena A. Yu, MD, Division of Solid Tumor Oncology, Depart
PLoS Comput Biol
September 2025
Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, California, United States of America.
Biology has been transformed by the rapid development of computing and the concurrent rise of data-rich approaches such as, omics or high-resolution imaging. However, there is a persistent computational skills gap in the biomedical research workforce. Inherent limitations of classroom teaching and institutional core support highlight the need for accessible ways for researchers to explore developments in computational biology.
View Article and Find Full Text PDFAnn Acad Med Singap
August 2025
Division of Medical Oncology, National Cancer Centre Singapore, Singapore.
Introduction: Trastuzumab deruxtecan (T-DXd) has revolutionised treatment for metastatic breast cancer (MBC). While effective, its high cost and toxicities, such as fatigue and nausea, pose challenges.
Method: Medical records from the Joint Breast Cancer Registry in Singapore were used to study MBC patients treated with T-DXd (February 2021-June 2024).