EMCAH-Net: an effective multi-scale context aggregation hybrid network for medical image segmentation.

Yu Jin , Rui Tian , Qian Yu , Yu Bai , Guoqing Chao , Danqing Liu , Yanhui Guo

Quant Imaging Med Surg

School of Data and Computer Science, Shandong Women's University, Jinan, China.

Published: April 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background: Pixel-level medical image segmentation tasks are challenging due to factors such as variable target scales, complex geometric shapes, and low contrast. Although U-shaped hybrid networks have demonstrated strong performance, existing models often fail to effectively integrate the local features captured by convolutional neural networks (CNNs) with the global features provided by Transformers. Moreover, their self-attention mechanisms often lack adequate emphasis on critical spatial and channel information. To address these challenges, our goal was to develop a hybrid deep learning model that can effectively and robustly segment medical images, including but not limited to computed tomography (CT) and magnetic resonance (MR) images.

Methods: We propose an effective hybrid U-shaped network, named the effective multi-scale context aggregation hybrid network (EMCAH-Net). It integrates an effective multi-scale context aggregation (EMCA) block in the backbone, along with a dual-attention augmented self-attention (DASA) block embedded in the skip connections and bottleneck layers. Aimed at the characteristics of medical images, the former block focuses on fine-grained local multi-scale feature encoding, whereas the latter enhances global representation learning by adaptively combining spatial and channel attention with self-attention. This approach not only effectively integrates local multi-scale and global features but also reinforces skip connections, thereby highlighting segmentation targets and precisely delineating boundaries. The code is publicly available at https://github.com/AloneIsland/EMCAH-Net.

Results: Compared to previous state-of-the-art (SOTA) methods, the EMCAH-Net achieves outstanding performance in medical image segmentation, with Dice similarity coefficient (DSC) scores of 84.73% (+2.85), 92.33% (+0.27), and 82.47% (+0.76) on the Synapse, automated cardiac diagnosis challenge (ACDC), and digital retinal images for vessel extraction (DRIVE) datasets, respectively. Additionally, it maintains computational efficiency in terms of model parameters and floating point operations (FLOPs). For instance, EMCAH-Net surpasses TransUNet on the Synapse dataset by 7.25% in DSC while requiring only 25% of the parameters and 71% of the FLOPs.

Conclusions: EMCAH-Net has demonstrated significant advantages in segmenting multi-scale, small, and boundary-blurred features in medical images. Extensive experiments on abdominal multi-organ, cardiac, and retinal vessel medical segmentation tasks confirm that EMCAH-Net surpasses previous methods, including pure CNN, pure Transformer, and hybrid architectures.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11994538	PMC
http://dx.doi.org/10.21037/qims-24-1983	DOI Listing

Publication Analysis

Top Keywords

effective multi-scale

multi-scale context

context aggregation

medical image

image segmentation

medical images

aggregation hybrid

hybrid network

segmentation tasks

global features

Similar Publications

Medial parapatellar surgical approach leads to greater loss of postural sway complexity compared to mid-vastus approach in women undergoing total knee arthroplasty.

Knee Surg Sports Traumatol Arthrosc

September 2025

Biomechanics Laboratory, School of Physical Education & Sport Science at Thessaloniki, Aristotle University of Thessaloniki, Thessaloniki, Greece.

Vasileios Mylonas , Stylianos Grigoriadis , Dimitris Metaxiotis , Eleftherios Kellis , Nick Stergiou

Purpose: Total knee arthroplasty (TKA) is associated with acute postoperative effects that increase the risk of falls. These effects differ between the medial parapatellar (PP) and mid-vastus (MV) surgical techniques but have not been evaluated in terms of postural sway complexity. Loss of this complexity leads to increased randomness in the center of pressure and higher fall risk.

View Article and Find Full Text PDF

Similar Publications

Effects of ultrasonication on the emulsification behavior and nutritional properties of myofibrillar protein-tussah pupa (Antheraea pernyi) protein complexes: A multiscale investigation.

Food Res Int

November 2025

College of Food Science, Shenyang Agricultural University, Shenyang, Liaoning 110866, PR China. Electronic address:

Yang Gao , Xiaoliang Liu , Jingxin Sun , Jun-Hua Shao

Tussah pupa protein (TPP), rich in diverse bioactive components and demonstrating extensive physiological activities, has attracted attention in food processing. However, its limited emulsion stability restricts application potential, requiring improvement of techno-functional properties. The effects of myofibrillar protein (MP) compounding coupled with ultrasonic treatment on the emulsifying properties and nutritional value of TPP were systematically investigated from a multi-scale perspective in this study.

View Article and Find Full Text PDF

Similar Publications

Cerebrovascular Segmentation Network Based on Fast Fourier Convolution and Mamba.

Biomed Phys Eng Express

September 2025

College of Computer Science and Technology, China University of Petroleum East China - Qingdao Campus, College of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, China, Qingdao, Shandong, 266580, CHINA.

Chaozhi Yang , Mingzhe Cao , Jinbao Zhu , Peigang Liu , Sibo Qiao

Purpose: Cerebrovascular segmentation is crucial for the diagnosis and treatment of cerebrovascular diseases. However, accurately extracting cerebral vessels from Time-of-Flight Magnetic Resonance Angiography (TOF-MRA) remains challenging due to the topological complexity and anatomical variability.

Methods: This paper presents a novel Y-shaped segmentation network with fast Fourier convolution and Mamba, termed F-Mamba-YNet.

View Article and Find Full Text PDF

Similar Publications

Automated segmentation of retinal vessel using HarDNet fully convolutional networks.

PLoS One

September 2025

School of Medical Engineering, Xinxiang Medical University, Xinxiang, China.

Yuanpei Zhu , Yong Liu , Xuezhi Zhou

Computer-aided diagnostic (CAD) systems for color fundus images play a critical role in the early detection of fundus diseases, including diabetes, hypertension, and cerebrovascular disorders. Although deep learning has substantially advanced automatic segmentation techniques in this field, several challenges persist, such as limited labeled datasets, significant structural variations in blood vessels, and persistent dataset discrepancies, which continue to hinder progress. These challenges lead to inconsistent segmentation performance, particularly for small vessels and branch regions.

View Article and Find Full Text PDF

Similar Publications

Rapid in-silico Battery Electrolyte Electrochemical Reaction Generation using 3T-VASP Multi-Scale Energy Minimization.

J Vis Exp

August 2025

Tencent Quantum Laboratory;

Jonathan P Mailoa , Xin Li , Zhengmi Tang , Jineng Ren , Mingyang Ni

Electrolytes are important components in lithium-ion batteries. However, battery degradation due to irreversible electrochemical reactions in the electrolyte can consume electrolyte molecules and severely reduce its effective operation lifetime. It is hence important to study the electrochemical reaction pathways in the battery electrolyte to further improve lithium-ion battery reliability.

View Article and Find Full Text PDF

Similar Publications