Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Importance: Various model reporting guidelines have been proposed to ensure clinical prediction models are reliable and fair. However, no consensus exists about which model details are essential to report, and commonalities and differences among reporting guidelines have not been characterized. Furthermore, how well documentation of deployed models adheres to these guidelines has not been studied.

Objectives: To assess information requested by model reporting guidelines and whether the documentation for commonly used machine learning models developed by a single vendor provides the information requested.

Evidence Review: MEDLINE was queried using machine learning model card and reporting machine learning from November 4 to December 6, 2020. References were reviewed to find additional publications, and publications without specific reporting recommendations were excluded. Similar elements requested for reporting were merged into representative items. Four independent reviewers and 1 adjudicator assessed how often documentation for the most commonly used models developed by a single vendor reported the items.

Findings: From 15 model reporting guidelines, 220 unique items were identified that represented the collective reporting requirements. Although 12 items were commonly requested (requested by 10 or more guidelines), 77 items were requested by just 1 guideline. Documentation for 12 commonly used models from a single vendor reported a median of 39% (IQR, 37%-43%; range, 31%-47%) of items from the collective reporting requirements. Many of the commonly requested items had 100% reporting rates, including items concerning outcome definition, area under the receiver operating characteristics curve, internal validation, and intended clinical use. Several items reported half the time or less related to reliability, such as external validation, uncertainty measures, and strategy for handling missing data. Other frequently unreported items related to fairness (summary statistics and subgroup analyses, including for race and ethnicity or sex).

Conclusions And Relevance: These findings suggest that consistent reporting recommendations for clinical predictive models are needed for model developers to share necessary information for model deployment. The many published guidelines would, collectively, require reporting more than 200 items. Model documentation from 1 vendor reported the most commonly requested items from model reporting guidelines. However, areas for improvement were identified in reporting items related to model reliability and fairness. This analysis led to feedback to the vendor, which motivated updates to the documentation for future users.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9391954PMC
http://dx.doi.org/10.1001/jamanetworkopen.2022.27779DOI Listing

Publication Analysis

Top Keywords

reporting guidelines
24
single vendor
16
model reporting
16
reporting
15
documentation commonly
12
machine learning
12
items
12
vendor reported
12
commonly requested
12
items model
12

Similar Publications

Background: Owing to the unique characteristics of digital health interventions (DHIs), a tailored approach to economic evaluation is needed-one that is distinct from that used for pharmacotherapy. However, the absence of clear guidelines in this area is a substantial gap in the evaluation framework.

Objective: This study aims to systematically review and compare the economic evaluation literature on DHIs and pharmacotherapy for the treatment of depression.

View Article and Find Full Text PDF

Importance: Behavioral variant frontotemporal dementia (bvFTD), the most common subtype of FTD, is a leading form of early-onset dementia worldwide. Accurate and timely diagnosis of bvFTD is frequently delayed due to symptoms overlapping with common psychiatric disorders, and interest has increased in identifying biomarkers that may aid in differentiating bvFTD from psychiatric disorders.

Objective: To summarize and critically review studies examining whether neurofilament light chain (NfL) in cerebrospinal fluid (CSF) or blood is a viable aid in the differential diagnosis of bvFTD vs psychiatric disorders.

View Article and Find Full Text PDF

Purpose: This narrative review aims to provide an overview of current knowledge on mpox, emphasizing updated epidemiology and recent advances in treatment and prevention strategies, in light of the latest outbreaks.

Methods: We searched PubMed and Google Scholar for publications on 'Mpox' and 'Monkeypox' up to June 5, 2025. Grey literature from governmental and health agencies was also accessed for outbreak reports and guidelines where published evidence was unavailable.

View Article and Find Full Text PDF

Spiritual interventions, including meditation, prayer, mindfulness, and compassionate care, have gained increasing attention for their potential to enhance both psychological resilience and overall health. This systematic review and meta-analysis examined eight eligible studies conducted across the USA, Europe, and China to assess the impact of such interventions on key outcomes, namely anxiety reduction, quality of life, chronic disease symptom management, and patient satisfaction. Seven studies contributed quantitative data.

View Article and Find Full Text PDF

Purpose: To investigate bacteriology, antibiotic treatment and adverse birth outcomes (ABOs) in pregnancies with and without bacteriuria and urinary tract infections (UTIs) based on urine cultures and clinical diagnoses.

Methods: Registry-based cohort study.

Population: Pregnancies with at least one urine culture analysed at one of two hospitals in the Capital Region, Denmark, between 2015 and 2021.

View Article and Find Full Text PDF