98%
921
2 minutes
20
Introduction: Published systematic reviews display an heterogeneous methodological quality, which can impact decision-making. Large language models (LLMs) can support and make the assessment of the methodological quality of systematic reviews more efficient, aiding in the incorporation of their evidence in guideline recommendations. We aimed to develop a LLM-based tool for supporting the assessment of the methodological quality of systematic reviews.
Methods: We assessed the performance of eight large language models (LLMs) in evaluating the methodological quality of systematic reviews. In particular, we provided 100 systematic reviews for eight LLMs (five base models and three fine-tuned models) to evaluate their methodological quality based on a 27-item validated tool (ReMarQ). The fine-tuned models had been trained with a different sample of 300 manually assessed systematic reviews. We compared the answers provided by LLMs with those independently provided by human reviewers, computing the accuracy, kappa coefficient and F1-score for this comparison.
Results: The best performing LLM was a fine-tuned GPT-3.5 model (mean accuracy=96.5% [95%CI=89.9-100%]; mean kappa coefficient=0.90 [95%CI=0.71-1.00]; mean F1-score=0.91 [95%CI=0.83-1.00]). This model displayed an accuracy >80% and a kappa coefficient >0.60 for all individual items. When we made this LLM assess 60 times the same set of systematic reviews, answers to 18 of 27 items were always consistent (i.e., were always the same) and only 11% of assessed systematic reviews showed inconsistency.
Conclusion: Overall, LLMs have the potential to accurately support the assessment of the methodological quality of systematic reviews based on a validated tool comprising dichotomous items.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.jclinepi.2025.111944 | DOI Listing |
JMIR Res Protoc
September 2025
University of Nevada, Las Vegas, Las Vegas, NV, United States.
Background: In-hospital cardiac arrest (IHCA) remains a public health conundrum with high morbidity and mortality rates. While early identification of high-risk patients could enable preventive interventions and improve survival, evidence on the effectiveness of current prediction methods remains inconclusive. Limited research exists on patients' prearrest pathophysiological status and predictive and prognostic factors of IHCA, highlighting the need for a comprehensive synthesis of predictive methodologies.
View Article and Find Full Text PDFPLoS One
September 2025
Department of Neurology and Neurosurgery, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, The Netherlands.
Background: Attention to existential needs has become part of daily treatment. Studies have described the concepts of existential experiences and existential interventions. However, a consensus or conceptual clarity regarding an existential approach in cancer patients is currently missing.
View Article and Find Full Text PDFPLoS One
September 2025
Neonatology, Yan'an Hospital Affiliated to Kunming Medical University, Kunming City, Yunnan Province, China.
Purpose: To determine the experience of medication multiple in elderly patients with multiple chronic condition by systematically reviewing, retrieving, and synthesizing data from qualitative studies.
Methods: Nine databases were systematically searched for relevant contributions from the time of construction until October 30, 2024. All qualitative studies in English and Chinese exploring the real-life experiences, feelings, etc, of medication multiple in elderly patients with multiple chronic condition were included.
PLoS One
September 2025
School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America.
Background: Financial hardship (including financial stress, financial strain, asset depletion, and financial toxicity) is a highly relevant construct among the 6.9 million people living with Alzheimer's disease and related dementias (ADRD) in the United States and their family networks. This scoping review will identify existing measures and approaches for capturing financial strain among these families.
View Article and Find Full Text PDFArterial thrombosis is a multifaceted process characterized by platelet aggregation and fibrin deposition, leading to the occlusion of blood vessels. It plays a central role in cardiovascular conditions such as myocardial infarction and ischemic stroke. Gaining insight into the mechanisms underlying arterial thrombosis is essential for developing effective treatments aimed at preventing thrombotic events and reducing associated health burdens.
View Article and Find Full Text PDF