Hierarchical agent transformer network for COVID-19 infection segmentation.

Biomed Phys Eng Express

College of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai, 201620, People's Republic of China.

Published: March 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Accurate and timely segmentation of COVID-19 infection regions is critical for effective diagnosis and treatment. While convolutional neural networks (CNNs) exhibit strong performance in medical image segmentation, they face challenges in handling complex lesion morphologies with irregular boundaries. Transformer-based approaches, though demonstrating superior capability in capturing global context, suffer from high computational costs and suboptimal multi-scale feature integration. To address these limitations, we proposed Hierarchical Agent Transformer Network (HATNet), a hierarchical encoder-bridge-decoder architecture that optimally balances segmentation accuracy with computational efficiency. The encoder employs novel agent Transformer blocks specifically designed to capture subtle features of small COVID-19 lesions through agent tokens with linear computational complexity. A diversity restoration module (DRM) is innovatively embedded within each agent Transformer block to counteract feature degradation. The hierarchical structure simultaneously extracts high-resolution shallow features and low-resolution fine features, ensuring comprehensive feature representation. The bridge stage incorporates an improved pyramid pooling module (IPPM) that establishes hierarchical global priors, significantly improving contextual understanding for the decoder. The decoder integrates a full-scale bidirectional feature pyramid network (FsBiFPN) with a dedicated border-refinement module (BRM), collectively enhancing edge precision. The HATNet were evaluated on the COVID-19-CT-Seg and CC-CCII datasets. Experimental results yielded Dice scores of 84.14% and 81.22% respectively, demonstrating superior segmentation performance compared to state-of-the-art models. Furthermore, it achieved notable advantages in model parameters and computational complexity, highlighting its clinical deployment potential.

Download full-text PDF

Source
http://dx.doi.org/10.1088/2057-1976/adbafaDOI Listing

Publication Analysis

Top Keywords

agent transformer
16
hierarchical agent
8
transformer network
8
covid-19 infection
8
demonstrating superior
8
computational complexity
8
hierarchical
5
segmentation
5
transformer
4
network covid-19
4

Similar Publications

ASReview LAB v.2: Open-source text screening with multiple agents and a crowd of experts.

Patterns (N Y)

July 2025

Department of Methodology and Statistics, Faculty of Social and Behavioral Sciences, Utrecht University, Utrecht, the Netherlands.

ASReview LAB v.2 introduces an advancement in AI-assisted systematic reviewing by enabling collaborative screening with multiple experts ("a crowd of oracles") using a shared AI model. The platform supports multiple AI agents within the same project, allowing users to switch between fast general-purpose models and domain-specific, semantic, or multilingual transformer models.

View Article and Find Full Text PDF

Purpose: This study aimed to develop a deep learning (DL) framework using registration-guided generative adversarial networks (RegGAN) to synthesize contrast-enhanced CT (Syn-CECT) from non-contrast CT (NCCT), enabling iodine-free esophageal cancer (EC) T-staging.

Methods: A retrospective multicenter analysis included 1,092 EC patients (2013-2024) divided into training (N = 313), internal (N = 117), and external test cohorts (N = 116 and N = 546). RegGAN synthesized Syn-CECT by integrating registration and adversarial training to address NCCT-CECT misalignment.

View Article and Find Full Text PDF

The efficient separation of chiral molecules is a fundamental challenge in the manufacture of pharmaceuticals and light-polarising materials. We developed an approach that combines machine learning with a physics-based representation to predict resolving agents for chiral molecules, using a transformer-based neural network. In retrospective tests, our approach reaches a four to six-fold improvement over the historical - trial and error based - hit rate.

View Article and Find Full Text PDF

The pathogen diversity to infiltrate the host organism highlights the demand for equally sophisticated mechanisms for their prevention. The development of "intelligent" agents with molecular logic capabilities are of great hope, but their full theranostic potential has yet to be realized. The original concept of nanoagents based on "Biocomputing based on particle disassembly" technology has been extended to nucleic acids (NAs) interfaces and inputs.

View Article and Find Full Text PDF

Introduction: Advanced artificial intelligence (AI) frameworks particularly, large language models (LLMs) have recently attracted attention for automating Drug-drug interactions (DDIs) extraction and prediction tasks. However, there is a scarcity of reviews on how LLMs can rapidly identify known and novel DDIs.

Areas Covered: This review summarizes the state of LLM-based DDI extraction and prediction, based on a broad literature search from PubMed, Embase, Web of Science, Scopus, IEEE Xplore, the Cochrane Library, ACM Digital Library, Google Scholar, and Semantic Scholar published between January 2000 and February 2025.

View Article and Find Full Text PDF