Dive into the details of self-supervised learning for medical image analysis.

Chuyan Zhang , Hao Zheng , Yun Gu

Med Image Anal

Institute of Medical Robotics, Shanghai Jiao Tong University, Shanghai, China. Electronic address:

Published: October 2023

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Self-supervised learning (SSL) has achieved remarkable performance in various medical imaging tasks by dint of priors from massive unlabeled data. However, regarding a specific downstream task, there is still a lack of an instruction book on how to select suitable pretext tasks and implementation details throughout the standard "pretrain-then-finetune" workflow. In this work, we focus on exploiting the capacity of SSL in terms of four realistic and significant issues: (1) the impact of SSL on imbalanced datasets, (2) the network architecture, (3) the applicability of upstream tasks to downstream tasks and (4) the stacking effect of SSL and common policies for deep learning. We provide a large-scale, in-depth and fine-grained study through extensive experiments on predictive, contrastive, generative and multi-SSL algorithms. Based on the results, we have uncovered several insights. Positively, SSL advances class-imbalanced learning mainly by boosting the performance of the rare class, which is of interest to clinical diagnosis. Unfortunately, SSL offers marginal or even negative returns in some cases, including severely imbalanced and relatively balanced data regimes, as well as combinations with common training policies. Our intriguing findings provide practical guidelines for the usage of SSL in the medical context and highlight the need for developing universal pretext tasks to accommodate diverse application scenarios. The code of this paper can be found at https://github.com/EndoluminalSurgicalVision-IMR/Medical-SSL.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.media.2023.102879	DOI Listing

Publication Analysis

Top Keywords

self-supervised learning

pretext tasks

ssl

tasks

dive details

details self-supervised

learning

learning medical

medical image

image analysis

Similar Publications

TissueMosaic: Self-supervised learning of tissue representations enables differential spatial transcriptomics across samples.

Cell Syst

September 2025

Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA. Electronic address:

Sandeep Kambhampati , Luca D'Alessio , Fedor Grab , Stephen Fleming , Sophia Liu

Spatial transcriptomics allows for the measurement of gene expression within the native tissue context. However, despite technological advancements, computational methods to link cell states with their microenvironment and compare these relationships across samples and conditions remain limited. To address this, we introduce Tissue Motif-Based Spatial Inference across Conditions (TissueMosaic), a self-supervised convolutional neural network designed to discover and represent tissue architectural motifs from multi-sample spatial transcriptomic datasets.

View Article and Find Full Text PDF

Similar Publications

Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid-Self-Supervised Learning Framework.

IEEE Trans Cybern

September 2025

Cheol-Hui Lee , Hakseung Kim , Byung Chul Yoon , Dong-Joo Kim

Sleep is essential for maintaining human health and quality of life. Analyzing physiological signals during sleep is critical in assessing sleep quality and diagnosing sleep disorders. However, manual diagnoses by clinicians are time-intensive and subjective.

View Article and Find Full Text PDF

Similar Publications

Multimodal self-supervised retinal vessel segmentation.

Neural Netw

September 2025

Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen, China. Electronic address:

Pengshuai Yin , Jingqi Zhang , Huichou Huang , Ruirui Liu , Yanxia Liu

Automatic segmentation of retinal vessels from retinography images is crucial for timely clinical diagnosis. However, the high cost and specialized expertise required for annotating medical images often result in limited labeled datasets, which constrains the full potential of deep learning methods. Recent advances in self-supervised pretraining using unlabeled data have shown significant benefits for downstream tasks.

View Article and Find Full Text PDF

Similar Publications

Self-supervised representation learning with continuous training data improves the feel and performance of myoelectric control.

Comput Biol Med

September 2025

Department of Electrical and Computer Engineering and the Institute of Biomedical Engineering, University of New Brunswick, Fredericton, E3B 5A3, NB, Canada.

Shriram Tallam Puranam Raghu , Dawn T MacIsaac , Erik J Scheme

Pattern recognition-based myoelectric control is traditionally trained with static or ramp contractions, but this fails to capture the dynamic nature of real-world movements. This study investigated the benefits of training classifiers with continuous dynamic data, encompassing transitions between various movement classes. We employed both conventional (LDA) and deep learning (LSTM) classifiers, comparing their performance when trained with ramp data, continuous dynamic data, and an LSTM pre-trained with a self-supervised learning technique (VICReg).

View Article and Find Full Text PDF

Similar Publications

A foundation model for learning genetic associations from brain imaging phenotypes.

Bioinform Adv

August 2025

IBM Research, Yorktown Heights, NY, 10598, United States.

Diego Machado Reyes , Myson Burch , Laxmi Parida , Aritra Bose

Motivation: Due to the intricate etiology of neurological disorders, finding interpretable associations between multiomics features can be challenging using standard approaches.

Results: We propose COMICAL, a contrastive learning approach using multiomics data to generate associations between genetic markers and brain imaging-derived phenotypes. COMICAL jointly learns omics representations utilizing transformer-based encoders with custom tokenizers.

View Article and Find Full Text PDF

Similar Publications