Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Rationale And Objectives: This study demonstrates a method for quantifying the impact of overfitting on the receiving operator characteristic curve (AUC) when using standard analysis pipelines to develop imaging biomarkers. We illustrate the approach using two publicly available repositories of radiology and pathology images for breast cancer diagnosis.

Materials And Methods: For each dataset, we permuted the outcome (cancer diagnosis) values to eliminate any true association between imaging features and outcome. Seven types of classification models (logistic regression, linear discriminant analysis, Naïve Bayes, linear support vector machines, nonlinear support vector machine, random forest, and multi-layer perceptron) were fitted to each scrambled dataset and evaluated by each of four techniques (all data, hold-out, 10-fold cross-validation, and bootstrapping). After repeating this process for a total of 50 outcome permutations, we averaged the resulting AUCs. Any increase over a null AUC of 0.5 can be attributed to overfitting.

Results: Applying this approach and varying sample size and the number of imaging features, we found that failing to control for overfitting could result in near-perfect prediction (AUC near 1.0). Cross-validation offered greater protection against overfitting than the other evaluation techniques, and for most classification algorithms a sample size of at least 200 was required to assess as few as 10 features with less than 0.05 AUC inflation attributable to overfitting.

Conclusion: This approach could be applied to any curated dataset to suggest the number of features and analysis approaches to limit overfitting.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12034343PMC
http://dx.doi.org/10.1016/j.acra.2024.02.028DOI Listing

Publication Analysis

Top Keywords

standard analysis
8
analysis pipelines
8
imaging features
8
support vector
8
sample size
8
overfitting
5
response permutation
4
permutation measure
4
imaging
4
measure imaging
4

Similar Publications

Amyloidosis encompasses a spectrum of rare disorders characterized by extracellular amyloid deposition. Achieving an accurate early diagnosis of systemic amyloidosis necessitates biopsy-specific pathological evaluation. Formalin-fixed, paraffin-embedded liver biopsy specimens were examined using Congo red staining, electron microscopy, immunohistochemistry (IHC), immunofluorescence, and Congo red-assisted laser microdissection with mass spectrometry (LMD/MS).

View Article and Find Full Text PDF

Background: To assess the efficacy and safety of tenecteplase in patients presenting within 24 hours of symptom onset with a large vessel occlusion and target mismatch on perfusion computed tomography.

Methods: ETERNAL-LVO was a prospective, randomized, open-label, blinded end point, phase 3, superiority trial where adult participants with a large vessel occlusion, presenting within 24 hours of onset with salvageable tissue on computed tomography perfusion, were randomized to tenecteplase 0.25 mg/kg or standard care across 11 primary and comprehensive stroke centers in Australia.

View Article and Find Full Text PDF

Background: Mental health (MH) problems are more common in people with intellectual disabilities (ID), yet under-diagnosis persists, which may be partly due to a lack of appropriate assessment tools. This study presents a systematic review of instruments used to assess MH problems in Spanish-speaking adults with ID.

Method: Following PRISMA guidelines, a search was conducted in Web of Science, PsycINFO, and Scopus using terms related to ID, MH and assessment.

View Article and Find Full Text PDF

Introduction: Cutaneous scalp metastases from breast carcinoma (CMBC) represent an uncommon manifestation of metastatic disease, with heterogeneous clinical presentations, including nodular or infiltrative lesions and scarring alopecia (alopecia neoplastica). The absence of standardized diagnostic criteria, particularly for alopecic phenotypes, poses challenges to early recognition of CMBC, which may represent either the first indication of neoplastic progression or a late recurrence.

Materials And Methods: We retrospectively analyzed a multicenter cohort of 15 patients with histologically confirmed CMBC.

View Article and Find Full Text PDF

Introduction: Cardiac amyloidosis is an underdiagnosed disease, and its prevalence is probably higher than previously estimated. We aimed to investigate the effect of introducing a systemic diagnostic algorithm for cardiac amyloidosis in clinical practice.

Methods: A systematic diagnostic algorithm was developed and clinically applied in two hospitals in Eastern Denmark.

View Article and Find Full Text PDF