Federated Learning for Decentralized Artificial Intelligence in Melanoma Diagnostics.

Sarah Haggenmüller , Max Schmitt , Eva Krieghoff-Henning , Achim Hekler , Roman C Maron , Christoph Wies , Jochen S Utikal , Friedegund Meier , Sarah Hobelsberger , Frank F Gellrich , Mildred Sergon , Axel Hauschild , Lars E French , Lucie Heinzerling , Justin G Schlager , Kamran Ghoreschi , Max Schlaak , Franz J Hilke , Gabriela Poch , Sören Korsing

JAMA Dermatol

Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany.

Published: March 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Importance: The development of artificial intelligence (AI)-based melanoma classifiers typically calls for large, centralized datasets, requiring hospitals to give away their patient data, which raises serious privacy concerns. To address this concern, decentralized federated learning has been proposed, where classifier development is distributed across hospitals.

Objective: To investigate whether a more privacy-preserving federated learning approach can achieve comparable diagnostic performance to a classical centralized (ie, single-model) and ensemble learning approach for AI-based melanoma diagnostics.

Design, Setting, And Participants: This multicentric, single-arm diagnostic study developed a federated model for melanoma-nevus classification using histopathological whole-slide images prospectively acquired at 6 German university hospitals between April 2021 and February 2023 and benchmarked it using both a holdout and an external test dataset. Data analysis was performed from February to April 2023.

Exposures: All whole-slide images were retrospectively analyzed by an AI-based classifier without influencing routine clinical care.

Main Outcomes And Measures: The area under the receiver operating characteristic curve (AUROC) served as the primary end point for evaluating the diagnostic performance. Secondary end points included balanced accuracy, sensitivity, and specificity.

Results: The study included 1025 whole-slide images of clinically melanoma-suspicious skin lesions from 923 patients, consisting of 388 histopathologically confirmed invasive melanomas and 637 nevi. The median (range) age at diagnosis was 58 (18-95) years for the training set, 57 (18-93) years for the holdout test dataset, and 61 (18-95) years for the external test dataset; the median (range) Breslow thickness was 0.70 (0.10-34.00) mm, 0.70 (0.20-14.40) mm, and 0.80 (0.30-20.00) mm, respectively. The federated approach (0.8579; 95% CI, 0.7693-0.9299) performed significantly worse than the classical centralized approach (0.9024; 95% CI, 0.8379-0.9565) in terms of AUROC on a holdout test dataset (pairwise Wilcoxon signed-rank, P < .001) but performed significantly better (0.9126; 95% CI, 0.8810-0.9412) than the classical centralized approach (0.9045; 95% CI, 0.8701-0.9331) on an external test dataset (pairwise Wilcoxon signed-rank, P < .001). Notably, the federated approach performed significantly worse than the ensemble approach on both the holdout (0.8867; 95% CI, 0.8103-0.9481) and external test dataset (0.9227; 95% CI, 0.8941-0.9479).

Conclusions And Relevance: The findings of this diagnostic study suggest that federated learning is a viable approach for the binary classification of invasive melanomas and nevi on a clinically representative distributed dataset. Federated learning can improve privacy protection in AI-based melanoma diagnostics while simultaneously promoting collaboration across institutions and countries. Moreover, it may have the potential to be extended to other image classification tasks in digital cancer histopathology and beyond.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10851139	PMC
http://dx.doi.org/10.1001/jamadermatol.2023.5550	DOI Listing

Publication Analysis

Top Keywords

test dataset

federated learning

whole-slide images

artificial intelligence

ai-based melanoma

learning approach

diagnostic performance

classical centralized

external test

median range

Similar Publications

Artificial Intelligence in Contact Dermatitis: Current and Future Perspectives.

Dermatitis

September 2025

From the Department of Dermatology, Venereology and Leprology, All India Institute of Medical Sciences (AIIMS), Bhopal, India.

Akriti Agrawal

Contact dermatitis (CD), which includes both allergic CD and irritant CD, is a common inflammatory condition that can pose significant diagnostic challenges. Although patch testing is the gold standard for identifying causative allergens for allergic contact dermatitis (ACD), it is time-consuming, subjective, and requires expert interpretation. Recent advancements in artificial intelligence (AI), particularly in machine learning (ML) and deep learning, have shown promise in improving the accuracy, efficiency, and accessibility of CD diagnosis and management.

View Article and Find Full Text PDF

Similar Publications

Interpretable Machine Learning for Proteomics-Based Subtyping and Tumor Mutational Burden Prediction in Endometrial Cancer.

Proteomics Clin Appl

September 2025

AIBioMed Research Group, Taipei Medical University, Taipei, Taiwan.

Thi-My-Trang Luong , Xuan Lam Bui , Chii-Ruey Tzeng , Nguyen Quoc Khanh Le

Background: Endometrial carcinoma (EC) represents a significant clinical challenge due to its pronounced molecular heterogeneity, directly influencing prognosis and therapeutic responses. Accurate classification of molecular subtypes (CNV-high, CNV-low, MSI-H, POLE) and precise tumor mutational burden (TMB) assessment is crucial for guiding personalized therapeutic interventions. Integrating proteomics data with advanced machine learning (ML) techniques offers a promising strategy for achieving precise, clinically actionable classification and biomarker discovery in EC.

View Article and Find Full Text PDF

Similar Publications

[A myocardial infarction detection and localization model based on multi-scale field residual blocks fusion with modified channel attention].

Nan Fang Yi Ke Da Xue Xue Bao

August 2025

School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China.

Qiucen Wu , Xueqi Lu , Yaoqi Wen , Yong Hong , Yuliang Wu

Objectives: We propose a myocardial infarction (MI) detection and localization model for improving the diagnostic accuracy for MI to provide assistance to clinical decision-making.

Methods: The proposed model was constructed based on multi-scale field residual blocks fusion modified channel attention (MSF-RB-MCA). The model utilizes lead II electrocardiogram (ECG) signals to detect and localize MI, and extracts different levels of feature information through the multi-scale field residual block.

View Article and Find Full Text PDF

Similar Publications

Factors Contributing to Geographical Variation in Maternal Smoking Rates Among Aboriginal and Torres Strait Islander Women.

Health Promot J Austr

October 2025

School of Medicine and Public Health, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, New South Wales, Australia.

Emilie Cameron , Matthew Clapham , Rita Hitching , Sandra Eades , Bob Davis

Issue Addressed: Smoking during pregnancy poses serious health risks for mother and baby. Addressing smoking among pregnant Aboriginal and Torres Strait Islander women is an Australian national priority. This study aimed to understand the geographical variation in rates of not smoking during pregnancy among Aboriginal and Torres Strait Islander women.

View Article and Find Full Text PDF

Similar Publications

A Deep Learning-Based Fully Automated Cardiac MRI Segmentation Approach for Tetralogy of Fallot Patients.

J Magn Reson Imaging

September 2025

Department of Medical Imaging and Intervention, Chang Gung Memorial Hospital at Linkou, Taoyuan City, Taiwan.

Wen-Yen Chai , Gigin Lin , Chao-Jan Wang , Hsin-Ju Chiang , Shu-Hang Ng

Background: Automated cardiac MR segmentation enables accurate and reproducible ventricular function assessment in Tetralogy of Fallot (ToF), whereas manual segmentation remains time-consuming and variable.

Purpose: To evaluate the deep learning (DL)-based models for automatic left ventricle (LV), right ventricle (RV), and LV myocardium segmentation in ToF, compared with manual reference standard annotations.

Study Type: Retrospective.

View Article and Find Full Text PDF

Similar Publications