Automated lung tumor segmentation robust to various tumor sizes using a consistency learning-based multi-scale dual-attention network.

Jumin Lee , Min-Jin Lee , Bong-Seog Kim , Helen Hong

J Xray Sci Technol

Department of Software Convergence, Seoul Women's University, Seoul, Republic of Korea.

Published: September 2023

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background: It is often difficult to automatically segment lung tumors due to the large tumor size variation ranging from less than 1 cm to greater than 7 cm depending on the T-stage.

Objective: This study aims to accurately segment lung tumors of various sizes using a consistency learning-based multi-scale dual-attention network (CL-MSDA-Net).

Methods: To avoid under- and over-segmentation caused by different ratios of lung tumors and surrounding structures in the input patch according to the size of the lung tumor, a size-invariant patch is generated by normalizing the ratio to the average size of the lung tumors used for the training. Two input patches, a size-invariant patch and size-variant patch are trained on a consistency learning-based network consisting of dual branches that share weights to generate a similar output for each branch with consistency loss. The network of each branch has a multi-scale dual-attention module that learns image features of different scales and uses channel and spatial attention to enhance the scale-attention ability to segment lung tumors of different sizes.

Results: In experiments with hospital datasets, CL-MSDA-Net showed an F1-score of 80.49%, recall of 79.06%, and precision of 86.78%. This resulted in 3.91%, 3.38%, and 2.95% higher F1-scores than the results of U-Net, U-Net with a multi-scale module, and U-Net with a multi-scale dual-attention module, respectively. In experiments with the NSCLC-Radiomics datasets, CL-MSDA-Net showed an F1-score of 71.7%, recall of 68.24%, and precision of 79.33%. This resulted in 3.66%, 3.38%, and 3.13% higher F1-scores than the results of U-Net, U-Net with a multi-scale module, and U-Net with a multi-scale dual-attention module, respectively.

Conclusions: CL-MSDA-Net improves the segmentation performance on average for tumors of all sizes with significant improvements especially for small sized tumors.

Download full-text PDF	Source
http://dx.doi.org/10.3233/XST-230003	DOI Listing

Publication Analysis

Top Keywords

multi-scale dual-attention

lung tumors

u-net multi-scale

consistency learning-based

segment lung

dual-attention module

lung tumor

sizes consistency

learning-based multi-scale

dual-attention network

Similar Publications

A novel dual-attention deep neural network with multi-scale fusion feature processing for predicting transcription factor binding sites.

IEEE J Biomed Health Inform

September 2025

Yuechuan Dai , Xianjun Shen , Weizhong Zhao , Xiaohua Hu

Transcription functions as a pivotal biological process in cell biology, which is required to complete the binding of transcription factors (TFs) to transcription factor binding sites (TFBSs) on the DNA. Accurate prediction of TFBSs can provide great potential to regulate the expression of interested genes, which can facilitate exploration of new drugs and treatment for diseases. Although many deep learning-based models have been proposed for predicting TFBSs, existing models still have problems, including the use of convolutional processing of DNA sequences that loses information about the DNA double helix structure and fails to adequately account for the stereoscopic structure of DNA shape data in three dimensions.

View Article and Find Full Text PDF

Similar Publications

Heterogeneous dual-decoder network for road extraction in remote sensing images.

Sci Rep

August 2025

Henan University, Software College, Kaifeng, 475000, China.

Shenming Qu , Gaigai Liu , Xiangnan Zhang , Yanhong Liu

Accurate road extraction from remote sensing images is crucial for autonomous driving, urban planning, and route planning. However, existing methods struggle to address the challenges of scale variation, occlusion, and blurred boundaries. To tackle these challenges, this paper proposes a heterogeneous dual-decoder network (HDDNet), which aims to simultaneously solve the multiple problems in remote sensing road extraction by designing two decoders with complementary functions.

View Article and Find Full Text PDF

Similar Publications

Texture-preserving and information loss minimization method for infrared and visible image fusion.

Sci Rep

July 2025

Institute of Image Understanding Research, North Minzu University, Yinchuan, 750021, China.

Qiyuan He , Yongdong Huang

In the task of infrared and visible image fusion, achieving high-quality fusion results typically requires preserving detailed texture and minimizing information loss, while maintaining high contrast and clear edges; however, existing methods often struggle to balance these objectives, leading to texture degradation and information loss during the fusion process. To address these challenges, we propose TPFusion, a texture-preserving and information loss minimization method for infrared and visible image fusion. TPFusion consists of the following key components: a multi-scale feature extraction module for enhancing the capability of capturing features; a texture enhancement module and contrast enhancement module, which helps to preserve fine-grained textures and extract salient contours and contrast information; a dual-attention fusion module for fusing the features extracted from the source images; an information content based loss function minimizing the feature discrepancy between the fused images and the source images and effectively reducing the information loss.

View Article and Find Full Text PDF

Similar Publications

One-dimensional time-frequency dual-channel visual transformer for bearing fault diagnosis under strong noise and limited data conditions.

Sci Rep

July 2025

College of Mathematics, SuQian University, Suqian, 223800, China.

Shaobin Cai , Yuchen Wang , Wanchen Cai , Yuchang Mo , Liansuo Wei

In industrial settings, bearing health directly affects equipment stability, making accurate and efficient fault diagnosis critical for operational safety. Recently, Transformer models have been widely adopted in bearing fault diagnosis due to their strong global modeling capabilities. However, they still face significant challenges under strong noise and limited data.

View Article and Find Full Text PDF

Similar Publications

Single-Image Super-Resolution via Cascaded Non-Local Mean Network and Dual-Path Multi-Branch Fusion.

Sensors (Basel)

June 2025

School of Computer and Control Engineering, Yantai University, Yantai 264005, China.

Yu Xu , Yi Wang

Image super-resolution (SR) aims to reconstruct high-resolution (HR) images from low-resolution (LR) inputs. It plays a crucial role in applications such as medical imaging, surveillance, and remote sensing. However, due to the ill-posed nature of the task and the inherent limitations of imaging sensors, obtaining accurate HR images remains challenging.

View Article and Find Full Text PDF

Similar Publications