Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background And Aims: Large language models (LLMs) can potentially support clinicians in their daily routine by providing easy access to information. Yet, they are plagued by stating incorrect facts and hallucinating when queried. Increasing the context by providing external databases while prompting LLMs may decrease the risk of misinformation. This study compares the influence of increased context on the coherence of LLM-based treatment recommendations with the recently updated WHO guidelines for the treatment of chronic hepatitis B (CHB).

Methods: GPT-4 was queried with five clinical case vignettes in two configurations: with and without additional context. The clinical vignettes were explicitly constructed so that treatment recommendations differed between the formerly applicable 2015 WHO guidelines and the updated 2024 ones. GPT-4 with context was provided access to the updated guidelines, while GPT-4 without context had to rely on its internal knowledge. GPT-4 was accessed only a few days after the release of the new WHO guidelines. Treatment recommendations were compared regarding guideline coherence, information inclusion, textual errors, wording clarity and preciseness by seven physicians.

Results: Using GPT-4 with context increased the coherence of the treatment recommendations with the new 2024 guidelines from 51% to 91% compared to GPT-4 without context. Similar trends were observed for all other categories, leading to an increase of 54% in preciseness and clarity, 24% in completeness of incorporating the case vignette information, and 12% in textual correctness.

Conclusions: If LLMs are consulted by clinicians for medical advice, they should be given access to external data sources to increase the chance of providing factually correct advice.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12402858PMC
http://dx.doi.org/10.1111/liv.70324DOI Listing

Publication Analysis

Top Keywords

treatment recommendations
20
gpt-4 context
16
llm-based treatment
8
chronic hepatitis
8
updated guidelines
8
guidelines treatment
8
context
7
guidelines
6
treatment
6
gpt-4
6

Similar Publications

Introduction: Diabetes Mellitus is a chronic disease characterised by elevated plasma glucose (PG) levels. HbA1c has been widely utilized for diabetes diagnosis. However, certain conditions restrict its use.

View Article and Find Full Text PDF

Importance: Research in behavioral economics has demonstrated that people have irrational biases, which make them susceptible to decisional shortcuts, or heuristics. The extent to which physicians consciously might use nudges to exploit these heuristics and thereby influence their patients' decision-making is unclear. In addition, ethical questions about the conscious use of nudges in medicine persist, yet little is known about how physicians experience and perceive their use.

View Article and Find Full Text PDF

African swine fever (ASF) is a contagious viral disease that affects domestic pigs and Eurasian wild boars, causing significant economic losses to the global pig industry. Since its first outbreak in February 2019, ASF has had a profound impact on the Vietnamese pig sector. This review presents a comprehensive analysis of ASF outbreaks in Vietnam from 2019 to 2024, focusing on outbreak dynamics, control strategies, economic impact, and key lessons learned.

View Article and Find Full Text PDF

This Letter to the Editor responds to the recent publication by Patel et al. (J Robot Surg. Jul 11;19(1):370, 2025), which outlines a framework and recommendations for telesurgery.

View Article and Find Full Text PDF

This review article, developed by the EASD Global Council, addresses the growing global challenges in diabetes research and care, highlighting the rising prevalence of diabetes, the increasing complexity of its management and the need for a coordinated international response. With regard to research, disparities in funding and infrastructure between high-income countries and low- and middle-income countries (LMICs) are discussed. The under-representation of LMIC populations in clinical trials, challenges in conducting large-scale research projects, and the ethical and legal complexities of artificial intelligence integration are also considered as specific issues.

View Article and Find Full Text PDF