Assessment of Adherence to Reporting Guidelines by Commonly Used Clinical Prediction Models From a Single Vendor: A Systematic Review.

Jonathan H Lu , Alison Callahan , Birju S Patel , Keith E Morse , Dev Dash , Michael A Pfeffer , Nigam H Shah

JAMA Netw Open

Center for Biomedical Informatics Research, Stanford University School of Medicine, Stanford, California.

Published: August 2022

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Importance: Various model reporting guidelines have been proposed to ensure clinical prediction models are reliable and fair. However, no consensus exists about which model details are essential to report, and commonalities and differences among reporting guidelines have not been characterized. Furthermore, how well documentation of deployed models adheres to these guidelines has not been studied.

Objectives: To assess information requested by model reporting guidelines and whether the documentation for commonly used machine learning models developed by a single vendor provides the information requested.

Evidence Review: MEDLINE was queried using machine learning model card and reporting machine learning from November 4 to December 6, 2020. References were reviewed to find additional publications, and publications without specific reporting recommendations were excluded. Similar elements requested for reporting were merged into representative items. Four independent reviewers and 1 adjudicator assessed how often documentation for the most commonly used models developed by a single vendor reported the items.

Findings: From 15 model reporting guidelines, 220 unique items were identified that represented the collective reporting requirements. Although 12 items were commonly requested (requested by 10 or more guidelines), 77 items were requested by just 1 guideline. Documentation for 12 commonly used models from a single vendor reported a median of 39% (IQR, 37%-43%; range, 31%-47%) of items from the collective reporting requirements. Many of the commonly requested items had 100% reporting rates, including items concerning outcome definition, area under the receiver operating characteristics curve, internal validation, and intended clinical use. Several items reported half the time or less related to reliability, such as external validation, uncertainty measures, and strategy for handling missing data. Other frequently unreported items related to fairness (summary statistics and subgroup analyses, including for race and ethnicity or sex).

Conclusions And Relevance: These findings suggest that consistent reporting recommendations for clinical predictive models are needed for model developers to share necessary information for model deployment. The many published guidelines would, collectively, require reporting more than 200 items. Model documentation from 1 vendor reported the most commonly requested items from model reporting guidelines. However, areas for improvement were identified in reporting items related to model reliability and fairness. This analysis led to feedback to the vendor, which motivated updates to the documentation for future users.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9391954	PMC
http://dx.doi.org/10.1001/jamanetworkopen.2022.27779	DOI Listing

Publication Analysis

Top Keywords

reporting guidelines

single vendor

model reporting

reporting

documentation commonly

machine learning

items

vendor reported

commonly requested

items model

Similar Publications

Comparison of Cost-Effectiveness Between Digital Health Interventions and Pharmacotherapy for Depression: Systematic Review.

J Med Internet Res

September 2025

School of Pharmacy, Sungkyunkwan University, Gyeonggi-do, Republic of Korea.

Jiae Im , Byeong-Chan Oh , Ha-Jun Song , Jeong-Min Choi , Dong-Ho Yeo

Background: Owing to the unique characteristics of digital health interventions (DHIs), a tailored approach to economic evaluation is needed-one that is distinct from that used for pharmacotherapy. However, the absence of clear guidelines in this area is a substantial gap in the evaluation framework.

Objective: This study aims to systematically review and compare the economic evaluation literature on DHIs and pharmacotherapy for the treatment of depression.

View Article and Find Full Text PDF

Similar Publications

Neurofilament Light Chain and Differentiation of Behavioral Variant Frontotemporal Dementia From Psychiatric Disorders: A Systematic Review.

JAMA Psychiatry

September 2025

Norman Fixel Institute for Neurological Diseases, University of Florida, Gainesville.

Dimitry S Davydow , Morgan Brasfield , Christopher B Morrow , Adam M Staffaroni , Gregory M Pontone

Importance: Behavioral variant frontotemporal dementia (bvFTD), the most common subtype of FTD, is a leading form of early-onset dementia worldwide. Accurate and timely diagnosis of bvFTD is frequently delayed due to symptoms overlapping with common psychiatric disorders, and interest has increased in identifying biomarkers that may aid in differentiating bvFTD from psychiatric disorders.

Objective: To summarize and critically review studies examining whether neurofilament light chain (NfL) in cerebrospinal fluid (CSF) or blood is a viable aid in the differential diagnosis of bvFTD vs psychiatric disorders.

View Article and Find Full Text PDF

Similar Publications

From neglected to notoriety: a review of Mpox clinical features, virology, epidemiology, treatment and prevention strategies.

Eur J Clin Microbiol Infect Dis

September 2025

Department of Infectious and Tropical Diseases, Toulouse University Hospital, Toulouse, 31059 Cedex 9, France.

Clément Viguier , Pierre Delobel , François-Xavier Lescure , Simon Bessis , Jean-Michel Mansuy

Purpose: This narrative review aims to provide an overview of current knowledge on mpox, emphasizing updated epidemiology and recent advances in treatment and prevention strategies, in light of the latest outbreaks.

Methods: We searched PubMed and Google Scholar for publications on 'Mpox' and 'Monkeypox' up to June 5, 2025. Grey literature from governmental and health agencies was also accessed for outbreak reports and guidelines where published evidence was unavailable.

View Article and Find Full Text PDF

Similar Publications

Spirituality and Medicine in the USA, Europe, and the UK: A Systematic Review and Meta-analysis of Integrative Approaches to Patient Satisfaction, Quality of Life, and Health Outcomes.

J Relig Health

September 2025

Government Law College, Madurai, India.

Arbind Kumar Choudhary , R Abirami

Spiritual interventions, including meditation, prayer, mindfulness, and compassionate care, have gained increasing attention for their potential to enhance both psychological resilience and overall health. This systematic review and meta-analysis examined eight eligible studies conducted across the USA, Europe, and China to assess the impact of such interventions on key outcomes, namely anxiety reduction, quality of life, chronic disease symptom management, and patient satisfaction. Seven studies contributed quantitative data.

View Article and Find Full Text PDF

Similar Publications

Bacteriology, antibiotic treatment effect and adverse birth outcomes in pregnant women with and without bacteriuria: a registry study.

Infection

September 2025

Department of Infectious Diseases, Copenhagen University Hospital, Hvidovre, Hvidovre, Denmark.

Jon Dissing Sund , Mathilde Sif Frydensberg Nicolaisen , Jenny Dahl Knudsen , Michael Pedersen , Emil Hofman

Purpose: To investigate bacteriology, antibiotic treatment and adverse birth outcomes (ABOs) in pregnancies with and without bacteriuria and urinary tract infections (UTIs) based on urine cultures and clinical diagnoses.

Methods: Registry-based cohort study.

Population: Pregnancies with at least one urine culture analysed at one of two hospitals in the Capital Region, Denmark, between 2015 and 2021.

View Article and Find Full Text PDF

Similar Publications