Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Pixel-level medical image segmentation tasks are challenging due to factors such as variable target scales, complex geometric shapes, and low contrast. Although U-shaped hybrid networks have demonstrated strong performance, existing models often fail to effectively integrate the local features captured by convolutional neural networks (CNNs) with the global features provided by Transformers. Moreover, their self-attention mechanisms often lack adequate emphasis on critical spatial and channel information. To address these challenges, our goal was to develop a hybrid deep learning model that can effectively and robustly segment medical images, including but not limited to computed tomography (CT) and magnetic resonance (MR) images.

Methods: We propose an effective hybrid U-shaped network, named the effective multi-scale context aggregation hybrid network (EMCAH-Net). It integrates an effective multi-scale context aggregation (EMCA) block in the backbone, along with a dual-attention augmented self-attention (DASA) block embedded in the skip connections and bottleneck layers. Aimed at the characteristics of medical images, the former block focuses on fine-grained local multi-scale feature encoding, whereas the latter enhances global representation learning by adaptively combining spatial and channel attention with self-attention. This approach not only effectively integrates local multi-scale and global features but also reinforces skip connections, thereby highlighting segmentation targets and precisely delineating boundaries. The code is publicly available at https://github.com/AloneIsland/EMCAH-Net.

Results: Compared to previous state-of-the-art (SOTA) methods, the EMCAH-Net achieves outstanding performance in medical image segmentation, with Dice similarity coefficient (DSC) scores of 84.73% (+2.85), 92.33% (+0.27), and 82.47% (+0.76) on the Synapse, automated cardiac diagnosis challenge (ACDC), and digital retinal images for vessel extraction (DRIVE) datasets, respectively. Additionally, it maintains computational efficiency in terms of model parameters and floating point operations (FLOPs). For instance, EMCAH-Net surpasses TransUNet on the Synapse dataset by 7.25% in DSC while requiring only 25% of the parameters and 71% of the FLOPs.

Conclusions: EMCAH-Net has demonstrated significant advantages in segmenting multi-scale, small, and boundary-blurred features in medical images. Extensive experiments on abdominal multi-organ, cardiac, and retinal vessel medical segmentation tasks confirm that EMCAH-Net surpasses previous methods, including pure CNN, pure Transformer, and hybrid architectures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11994538PMC
http://dx.doi.org/10.21037/qims-24-1983DOI Listing

Publication Analysis

Top Keywords

effective multi-scale
12
multi-scale context
12
context aggregation
12
medical image
12
image segmentation
12
medical images
12
aggregation hybrid
8
hybrid network
8
segmentation tasks
8
global features
8

Similar Publications

Purpose: Total knee arthroplasty (TKA) is associated with acute postoperative effects that increase the risk of falls. These effects differ between the medial parapatellar (PP) and mid-vastus (MV) surgical techniques but have not been evaluated in terms of postural sway complexity. Loss of this complexity leads to increased randomness in the center of pressure and higher fall risk.

View Article and Find Full Text PDF

Tussah pupa protein (TPP), rich in diverse bioactive components and demonstrating extensive physiological activities, has attracted attention in food processing. However, its limited emulsion stability restricts application potential, requiring improvement of techno-functional properties. The effects of myofibrillar protein (MP) compounding coupled with ultrasonic treatment on the emulsifying properties and nutritional value of TPP were systematically investigated from a multi-scale perspective in this study.

View Article and Find Full Text PDF

Cerebrovascular Segmentation Network Based on Fast Fourier Convolution and Mamba.

Biomed Phys Eng Express

September 2025

College of Computer Science and Technology, China University of Petroleum East China - Qingdao Campus, College of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, China, Qingdao, Shandong, 266580, CHINA.

Purpose: Cerebrovascular segmentation is crucial for the diagnosis and treatment of cerebrovascular diseases. However, accurately extracting cerebral vessels from Time-of-Flight Magnetic Resonance Angiography (TOF-MRA) remains challenging due to the topological complexity and anatomical variability.

Methods: This paper presents a novel Y-shaped segmentation network with fast Fourier convolution and Mamba, termed F-Mamba-YNet.

View Article and Find Full Text PDF

Computer-aided diagnostic (CAD) systems for color fundus images play a critical role in the early detection of fundus diseases, including diabetes, hypertension, and cerebrovascular disorders. Although deep learning has substantially advanced automatic segmentation techniques in this field, several challenges persist, such as limited labeled datasets, significant structural variations in blood vessels, and persistent dataset discrepancies, which continue to hinder progress. These challenges lead to inconsistent segmentation performance, particularly for small vessels and branch regions.

View Article and Find Full Text PDF

Electrolytes are important components in lithium-ion batteries. However, battery degradation due to irreversible electrochemical reactions in the electrolyte can consume electrolyte molecules and severely reduce its effective operation lifetime. It is hence important to study the electrochemical reaction pathways in the battery electrolyte to further improve lithium-ion battery reliability.

View Article and Find Full Text PDF