Thyroid ultrasound diagnosis improvement via multi-view self-supervised learning and two-stage pre-training.

Jian Wang , Xin Yang , Xiaohong Jia , Wufeng Xue , Rusi Chen , Yanlin Chen , Xiliang Zhu , Lian Liu , Yan Cao , Jianqiao Zhou , Dong Ni , Ning Gu

Comput Biol Med

Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, China; Cardiovascular Disease Research Center, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Medi

Published: March 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Thyroid nodule classification and segmentation in ultrasound images are crucial for computer-aided diagnosis; however, they face limitations owing to insufficient labeled data. In this study, we proposed a multi-view contrastive self-supervised method to improve thyroid nodule classification and segmentation performance with limited manual labels. Our method aligns the transverse and longitudinal views of the same nodule, thereby enabling the model to focus more on the nodule area. We designed an adaptive loss function that eliminates the limitations of the paired data. Additionally, we adopted a two-stage pre-training to exploit the pre-training on ImageNet and thyroid ultrasound images. Extensive experiments were conducted on a large-scale dataset collected from multiple centers. The results showed that the proposed method significantly improves nodule classification and segmentation performance with limited manual labels and outperforms state-of-the-art self-supervised methods. The two-stage pre-training also significantly exceeded ImageNet pre-training.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.compbiomed.2024.108087	DOI Listing

Publication Analysis

Top Keywords

two-stage pre-training

nodule classification

classification segmentation

thyroid ultrasound

thyroid nodule

ultrasound images

segmentation performance

performance limited

limited manual

manual labels

Similar Publications

ITSEF: Inception-based two-stage ensemble framework for P300 detection.

Neural Netw

August 2025

College of Communication Engineering, Jilin University, Changchun, China. Electronic address:

Wenjun Hu , Dingguo Zhang , Wanzhong Chen

To address the problems of low signal-to-noise ratio, significant individual differences between subjects, and class imbalance in P300-based brain-computer interface (BCI), this paper proposes a novel Inception-based two-stage ensemble framework (ITSEF) to improve detection accuracy. Firstly, an Inception-based convolutional neural network (ICNN) is designed to extract multi-scale features and conduct cross-channel learning. In addition, a two-stage ensemble framework (TSEF) combined with a pre-training and fine-tuning strategy is developed, aiming to enhance the classification performance of the minority class and improve the generalization ability of the model.

View Article and Find Full Text PDF

Similar Publications

FastDIP: An effective approach for accelerating unsupervised low-count PET image reconstruction.

Comput Med Imaging Graph

September 2025

School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China. Electronic address:

Jinming Li , Jing Wang , Yang Lv , Puming Zhang , Jun Zhao

Introduction: Unsupervised deep learning methods can improve the image quality of positron emission tomography (PET) images without the need for large-scale datasets. However, these approaches typically require training a distinct network for each patient, making the reconstruction process extremely time-consuming and limiting their clinical applicability. In this paper, our research objective is to develop an efficient unsupervised learning framework for unsupervised PET image reconstruction, in order to fulfill the clinical requirement for real-time imaging capabilities.

View Article and Find Full Text PDF

Similar Publications

Optimized AI-based neural decoding from BOLD fMRI signal for analyzing visual and semantic ROIs in the human visual system.

J Neural Eng

August 2025

Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy.

Lorenzo Veronese , Andrea Moglia , Nicolò Pecco , Pasquale Anthony Della Rosa , Paola Scifo

. AI-based neural decoding reconstructs visual perception by leveraging generative models to map brain activity measured through functional magnetic resonance imaging (fMRI) into the observed visual stimulus..

View Article and Find Full Text PDF

Similar Publications

Deep reinforcement learning as an interaction agent to steer fragment-based 3D molecular generation for protein pockets.

Brief Bioinform

July 2025

Shanghai Key Laboratory of Maternal Fetal Medicine, Clinical and Translational Research Center of Shanghai First Maternity and Infant Hospital, Frontier Science Center for Stem Cell Research, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China.

Xudong Zhang , Jing Hou , Sanqing Qu , Fan Lu , Zhixin Tian

Designing high-affinity molecules for protein targets (especially novel protein families) is a crucial yet challenging task in drug discovery. Recently, there has been tremendous progress in structure-based 3D molecular generative models that incorporate structural information of protein pockets. However, the capacity for molecular representation learning and the generalization for capturing interaction patterns need substantial further developments.

View Article and Find Full Text PDF

Similar Publications

TS-DENet: a transferable self-supervised learning method for multi-modal fluorescence image denoising.

Appl Opt

April 2025

Liangliang Huang , Zhong Wen , Zhaokai Wang , Quanzhi Li , Qilin Deng

Recent fluorescence diagnostic tools have demonstrated effectiveness in detecting early-stage neoplasmatic tissue and monitoring therapy, allowing rapid non-invasive live imaging diagnosis. However, varying light conditions in environments and modalities of observation systems introduce multi-level noises to acquired images, causing degraded image quality. Deep learning (DL) has shown great potential in improving image quality, but its performance may be limited when dealing with insufficient labeled training data and the challenges of acquiring high-quality multi-modality fluorescence images in specific biomedical tasks.

View Article and Find Full Text PDF

Similar Publications