Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objective: The practice of evidence-based medicine can be challenging when relevant data are lacking or difficult to contextualize for a specific patient. Large language models (LLMs) could potentially address both challenges by summarizing published literature or generating new studies using real-world data.

Materials And Methods: We submitted 50 clinical questions to five LLM-based systems: OpenEvidence, which uses an LLM for retrieval-augmented generation (RAG); ChatRWD, which uses an LLM as an interface to a data extraction and analysis pipeline; and three general-purpose LLMs (ChatGPT-4, Claude 3 Opus, Gemini 1.5 Pro). Nine independent physicians evaluated the answers for relevance, quality of supporting evidence, and actionability (i.e., sufficient to justify or change clinical practice).

Results: General-purpose LLMs rarely produced relevant, evidence-based answers (2-10% of questions). In contrast, RAG-based and agentic LLM systems, respectively, produced relevant, evidence-based answers for 24% (OpenEvidence) to 58% (ChatRWD) of questions. OpenEvidence produced actionable results for 48% of questions with existing evidence, compared to 37% for ChatRWD and <5% for the general-purpose LLMs. ChatRWD provided actionable results for 52% of questions that lacked existing literature compared to <10% for other LLMs.

Discussion: Special-purpose LLM systems greatly outperformed general-purpose LLMs in producing answers to clinical questions. Retrieval-augmented generation-based LLM (OpenEvidence) performed well when existing data were available, while only the agentic ChatRWD was able to provide actionable answers when preexisting studies were lacking.

Conclusion: Synergistic systems combining RAG-based evidence summarization and agentic generation of novel evidence could improve the availability of pertinent evidence for patient care.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12159471PMC
http://dx.doi.org/10.1177/20552076251348850DOI Listing

Publication Analysis

Top Keywords

clinical questions
8
large language
8
retrieval-augmented generation
8
general-purpose llms
8
produced relevant
8
relevant evidence-based
8
evidence-based answers
8
questions
5
answering real-world
4
real-world clinical
4

Similar Publications

Beyond the screen: exploring digital health experiences of individuals affected by psoriasis - a qualitative interview study.

BMC Public Health

September 2025

Department of Dermatology and Allergy, TUM School of Medicine and Health, Technical University of Munich, Biedersteiner Str. 29, 80802, Munich, Germany.

Background: Psoriasis, a chronic inflammatory skin disorder, imposes a high burden on those affected, often leading to stigma and increased depression risk. With the increasing importance of digital media in medical contexts, there is a notable prevalence of misinformation and low-quality content. This study aims to explore the experiences of individuals affected by psoriasis regarding their disease-related digital media use.

View Article and Find Full Text PDF

The multi-kingdom cancer microbiome.

Nat Microbiol

September 2025

Joan and Sanford I. Weill Department of Medicine, Gastroenterology and Hepatology Division, Weill Cornell Medicine, New York, NY, USA.

Microbial influence on cancer development and therapeutic response is a growing area of cancer research. Although it is known that microorganisms can colonize certain tissues and contribute to tumour initiation, the use of deep sequencing technologies and computational pipelines has led to reports of multi-kingdom microbial communities in a growing list of cancer types. This has prompted discussions on the role and scope of microbial presence in cancer, while raising the possibility of microbiome-based diagnostic, prognostic and therapeutic tools.

View Article and Find Full Text PDF

Readiness for climate change mitigation among anesthesiologists : A before and after study at three German university hospitals.

Anaesthesiologie

September 2025

TUM School of Medicine and Health, Klinikum rechts der Isar, Department of Anesthesiology and Intensive Care, Technical University of Munich, Ismaninger Str. 22, 81675, Munich, Germany.

Background: Medical societies around the world are exploring strategies to reduce their carbon footprint. In this context, organizational readiness can serve as an important facilitator for the success of change. In this study we assessed whether a series of educational interventions improved anesthesia departments' organizational readiness for climate change mitigation.

View Article and Find Full Text PDF

Introduction: Pharmacy students were given the opportunity to participate in an online video-recorded objective structured clinical examination (OSCE) with pharmacist feedback. This study aimed to evaluate their views and experiences regarding this initiative and reviewing the recording.

Methods: Third year undergraduate pharmacy students (n = 68) were invited to participate in a formative video-recorded OSCE station online, followed by a one-to-one feedback discussion with a pharmacist facilitator.

View Article and Find Full Text PDF

[Towards an integrative understanding of complex trauma].

Encephale

September 2025

Département de psychiatrie de l'adolescent et du jeune adulte, institut mutualiste Montsouris, 42, boulevard Jourdan, Paris, France; UVSQ, Inserm U1178, PsyDev, CESP université Paris-Saclay, Villejuif, France; Université Paris-Cité, Paris, France.

The body of knowledge on trauma is rapidly expanding. Since 2022, the WHO has been calling for the history of adversity to be systematically taken into account when assessing the state of health of all individuals. But at this stage, our understanding of the precise mechanisms of complex trauma remains incomplete.

View Article and Find Full Text PDF