BSA-Seg: A Bi-level sparse attention network combining narrow band loss for multi-target medical image segmentation.

Zhiyong Zhou , Zhechen Zhou , Xusheng Qian , Jisu Hu , Bo Peng , Chen Geng , Bin Dai , He Huang , Wenbin Zhang , Yakang Dai

Neural Netw

School of Biomedical Engineering (Suzhou), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; The Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Science, Suzhou 215163, China. Electronic address:

Published: August 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Segmentation of multiple targets of varying sizes within medical images is of significant importance for the diagnosis of disease and pathological research. Transformer-based methods are emerging in the medical image segmentation, leveraging the powerful yet computationally intensive self-attention mechanism. A variety of attention mechanisms have been proposed to reduce computation at the cost of accuracy loss, utilizing handcrafted patterns within local or artificially defined receptive fields. Furthermore, the common region-based loss functions are insufficient for guiding the transformer to focus on tissue regions, resulting in their unsuitability for the segmentation of tissues with intricate boundaries. This paper presents the development of a bi-level sparse attention network and a narrow band (NB) loss function for the accurate and efficient multi-target segmentation of medical images. In particular, we introduce a bi-level sparse attention module (BSAM) and formulate a segmentation network based on this module. The BSAM consists of coarse-grained patch-level attention and fine-grained pixel-level attention, which captures fine-grained contextual features in adaptive receptive fields learned by patch-level attention. This results in enhanced segmentation accuracy while simultaneously reducing computational complexity. The proposed narrow-band (NB) loss function constructs a target region in close proximity to the tissue boundary. The network is thus guided to perform boundary-aware segmentation, thereby simultaneously alleviating the issues of over-segmentation and under-segmentation. A series of comprehensive experiments on whole brains, brain tumors and abdominal organs, demonstrate that our method outperforms other state-of-the-art segmentation methods. Furthermore, the BSAM and NB loss can be applied flexibly to a variety of network frameworks.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.neunet.2025.107431	DOI Listing

Publication Analysis

Top Keywords

bi-level sparse

sparse attention

segmentation

attention network

narrow band

band loss

medical image

image segmentation

medical images

receptive fields

Similar Publications

Studying the performance of YOLOv11 incorporating DHSA BRA and PPA modules in railway track fasteners defect detection.

Sci Rep

July 2025

College of Engineering, Zhejiang Normal University, Yingbin Avenue, Jinhua, 321005, China.

Chengwei Zhang , Jiawei Zhu , Yihao Ma , Qingmei Huang

With the development of railway transportation and the advancement of deep learning, object detection algorithms are increasingly replacing manual inspection of track fasteners. However, current algorithms struggle with low accuracy in complex weather conditions or low-contrast backgrounds. To address this, we propose a track fastener defect detection algorithm based on YOLOv11 (You Only Look Once).

View Article and Find Full Text PDF

Similar Publications

MFFBi-Unet: Merging Dynamic Sparse Attention and Multi-scale Feature Fusion for Medical Image Segmentation.

Interdiscip Sci

July 2025

School of Computer Science and Technology, Tiangong University, Tianjin, 300387, China.

Baoshan Sun , Chunfei Liu , Qiuyan Wang , Kaiyu Bi , Wenxue Zhang

The advancement of deep learning has driven extensive research validating the effectiveness of U-Net-style symmetric encoder-decoder architectures based on Transformers for medical image segmentation. However, the inherent design requiring attention mechanisms to compute token affinities across all spatial locations leads to prohibitive computational complexity and substantial memory demands. Recent efforts have attempted to address these limitations through sparse attention mechanisms.

View Article and Find Full Text PDF

Similar Publications

Combining convolutional neural network with transformer to improve YOLOv7 for gas plume detection and segmentation in multibeam water column images.

PeerJ Comput Sci

May 2025

School of Marine Technology and Geomatics, Jiangsu Ocean University, Lianyungang, China.

Wenguang Chen , Xiao Wang , Junjie Chen , Jialong Sun , Guozhen Zha

Multibeam bathymetry has become an effective underwater target detection method by using echo signals to generate a high-resolution water column image (WCI). However, the gas plume in the image is often affected by the seafloor environment and exhibits sparse texture and changing motion, making traditional detection and segmentation methods more time-consuming and labor-intensive. The emergence of convolutional neural networks (CNNs) alleviates this problem, but the local feature extraction of the convolutional operations, while capturing detailed information well, cannot adapt to the elongated morphology of the gas plume target, limiting the improvement of the detection and segmentation accuracy.

View Article and Find Full Text PDF

Similar Publications

BioTransX: A novel bi-former based hybrid model with bi-level routing attention for brain tumor classification with explainable insights.

Comput Biol Med

September 2025

Computer Science & Engineering, Maulana Azad National Institute of Technology, Bhopal, 462003, M.P, India.

Reenu Rajpoot , Sweta Jain , Vijay Bhaskar Semwal

Brain tumors, known for their life-threatening implications, underscore the urgency of precise and interpretable early detection. Expertise remains essential for accurate identification through MRI scans due to the intricacies involved. However, the growing recognition of automated detection systems holds the potential to enhance accuracy and improve interpretability.

View Article and Find Full Text PDF

Similar Publications

A novel method of BiFormer with temporal-spatial characteristics for ECG-based PVC detection.

Front Physiol

May 2025

The First Clinical Medical College, Heilongjiang University of Chinese Medicine, Harbin, China.

Siyuan Chen , Zhen Wang , Hao Wang , Shuai Wang , Yang Li

Introduction: Premature Ventricular Contractions (PVCs) can be warning signs for serious cardiac conditions, and early detection is essential for preventing complications. The use of deep learning models in electrocardiogram (ECG) analysis has aided more accurate and efficient PVC identification. These models automatically extract and analyze complex signal features, providing valuable clinical decision-making support.

View Article and Find Full Text PDF

Similar Publications