TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.

Pengfei Song , Jinjiang Li , Hui Fan , Linwei Fan

Comput Biol Med

School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, Shandong, 250014, China.

Published: December 2023

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Accurate and automatic segmentation of medical images is a key step in clinical diagnosis and analysis. Currently, the successful application of Transformers' model in the field of computer vision, researchers have begun to gradually explore the application of Transformers in medical segmentation of images, especially in combination with convolutional neural networks with coding-decoding structure, which have achieved remarkable results in the field of medical segmentation. However, most studies have combined Transformers with CNNs at a single scale or processed only the highest-level semantic feature information, ignoring the rich location information in the lower-level semantic feature information. At the same time, for problems such as blurred structural boundaries and heterogeneous textures in images, most existing methods usually simply connect contour information to capture the boundaries of the target. However, these methods cannot capture the precise outline of the target and ignore the potential relationship between the boundary and the region. In this paper, we propose the TGDAUNet, which consists of a dual-branch backbone network of CNNs and Transformers and a parallel attention mechanism, to achieve accurate segmentation of lesions in medical images. Firstly, high-level semantic feature information of the CNN backbone branches is fused at multiple scales, and the high-level and low-level feature information complement each other's location and spatial information. We further use the polarised self-attentive (PSA) module to reduce the impact of redundant information caused by multiple scales, to better couple with the feature information extracted from the Transformers backbone branch, and to establish global contextual long-range dependencies at multiple scales. In addition, we have designed the Reverse Graph-reasoned Fusion (RGF) module and the Feature Aggregation (FA) module to jointly guide the global context. The FA module aggregates high-level semantic feature information to generate an original global predictive segmentation map. The RGF module captures non-significant features of the boundaries in the original or secondary global prediction segmentation graph through a reverse attention mechanism, establishing a graph reasoning module to explore the potential semantic relationships between boundaries and regions, further refining the target boundaries. Finally, to validate the effectiveness of our proposed method, we compare our proposed method with the current popular methods in the CVC-ClinicDB, Kvasir-SEG, ETIS, CVC-ColonDB, CVC-300,datasets as well as the skin cancer segmentation datasets ISIC-2016 and ISIC-2017. The large number of experimental results show that our method outperforms the currently popular methods. Source code is released at https://github.com/sd-spf/TGDAUNet.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.compbiomed.2023.107583	DOI Listing

Publication Analysis

Top Keywords

semantic feature

multiple scales

segmentation

medical images

medical segmentation

attention mechanism

high-level semantic

rgf module

proposed method

popular methods

Similar Publications

CASHNet: Context-Aware Semantics-driven Hierarchical Network for Hybrid Diffeomorphic CT-CBCT Image Registration.

IEEE Trans Med Imaging

September 2025

Xiaoru Gao , Housheng Xie , Donghua Hang , Guoyan Zheng

Computed Tomography (CT) to Cone-Beam Computed Tomography (CBCT) image registration is crucial for image-guided radiotherapy and surgical procedures. However, achieving accurate CT-CBCT registration remains challenging due to various factors such as inconsistent intensities, low contrast resolution and imaging artifacts. In this study, we propose a Context-Aware Semantics-driven Hierarchical Network (referred to as CASHNet), which hierarchically integrates context-aware semantics-encoded features into a coarse-to-fine registration scheme, to explicitly enhance semantic structural perception during progressive alignment.

View Article and Find Full Text PDF

Similar Publications

Incremental Learning for Defect Segmentation With Efficient Transformer Semantic Complement.

IEEE Trans Neural Netw Learn Syst

September 2025

Xiqi Li , Zhifu Huang , Ge Ma , Yu Liu

In industrial scenarios, semantic segmentation of surface defects is vital for identifying, localizing, and delineating defects. However, new defect types constantly emerge with product iterations or process updates. Existing defect segmentation models lack incremental learning capabilities, and direct fine-tuning (FT) often leads to catastrophic forgetting.

View Article and Find Full Text PDF

Similar Publications

FmH2ST: foundation model-based spatial transcriptomics generation from histological images.

Nucleic Acids Res

September 2025

School of Software, Shandong University, Jinan 250101, Shandong, China.

Yuequn Wang , Jun Wang , Yanyu Xu , Ning Liu , Bin Liu

Spatial transcriptomics (ST) reveals gene expression distributions within tissues. Yet, predicting spatial gene expression from histological images still faces the challenges of limited ST data that lack prior knowledge, and insufficient capturing of inter-slice heterogeneity and intra-slice complexity. To tackle these challenges, we introduce FmH2ST, a foundation model-based method for spatial gene expression prediction.

View Article and Find Full Text PDF

Similar Publications

EXPRESS: Influence of word Age-of-Acquisition (AoA), vocabulary size, formal-lexical similarity, and semantic richness of words on lexical recognition and production: A study on foreign-word training.

Q J Exp Psychol (Hove)

September 2025

Psychology Department, Swansea University, Swansea, UK.

Miguel Á Pérez-Sánchez , Lidia Gómez-Cobos , Javier Marín , Hans Stadthagen-Gonzalez , Cristina Izura

A distinctive feature of the lexicon is its susceptibility to the order in which words are acquired; those learned earlier are accessed and retrieved more quickly than those acquired later-a phenomenon known as the age of acquisition (AoA) effect. This study investigates how vocabulary size (i.e.

View Article and Find Full Text PDF

Similar Publications

Cherry-Net: real-time segmentation algorithm of cherry maturity based on improved PIDNet.

Front Plant Sci

September 2025

College of Big Data, Yunnan Agricultural University, Kunming, China.

Jie Cui , Lilian Zhang , Lutao Gao , Chunhui Bai , Linnan Yang

Introduction: Accurate identification of cherry maturity and precise detection of harvestable cherry contours are essential for the development of cherry-picking robots. However, occlusion, lighting variation, and blurriness in natural orchard environments present significant challenges for real-time semantic segmentation.

Methods: To address these issues, we propose a machine vision approach based on the PIDNet real-time semantic segmentation framework.

View Article and Find Full Text PDF

Similar Publications