Hierarchical agent transformer network for COVID-19 infection segmentation.

Yi Tian , Qi Mao , Wenfeng Wang , Yan Zhang

Biomed Phys Eng Express

College of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai, 201620, People's Republic of China.

Published: March 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Accurate and timely segmentation of COVID-19 infection regions is critical for effective diagnosis and treatment. While convolutional neural networks (CNNs) exhibit strong performance in medical image segmentation, they face challenges in handling complex lesion morphologies with irregular boundaries. Transformer-based approaches, though demonstrating superior capability in capturing global context, suffer from high computational costs and suboptimal multi-scale feature integration. To address these limitations, we proposed Hierarchical Agent Transformer Network (HATNet), a hierarchical encoder-bridge-decoder architecture that optimally balances segmentation accuracy with computational efficiency. The encoder employs novel agent Transformer blocks specifically designed to capture subtle features of small COVID-19 lesions through agent tokens with linear computational complexity. A diversity restoration module (DRM) is innovatively embedded within each agent Transformer block to counteract feature degradation. The hierarchical structure simultaneously extracts high-resolution shallow features and low-resolution fine features, ensuring comprehensive feature representation. The bridge stage incorporates an improved pyramid pooling module (IPPM) that establishes hierarchical global priors, significantly improving contextual understanding for the decoder. The decoder integrates a full-scale bidirectional feature pyramid network (FsBiFPN) with a dedicated border-refinement module (BRM), collectively enhancing edge precision. The HATNet were evaluated on the COVID-19-CT-Seg and CC-CCII datasets. Experimental results yielded Dice scores of 84.14% and 81.22% respectively, demonstrating superior segmentation performance compared to state-of-the-art models. Furthermore, it achieved notable advantages in model parameters and computational complexity, highlighting its clinical deployment potential.

Download full-text PDF	Source
http://dx.doi.org/10.1088/2057-1976/adbafa	DOI Listing

Publication Analysis

Top Keywords

agent transformer

hierarchical agent

transformer network

covid-19 infection

demonstrating superior

computational complexity

hierarchical

segmentation

transformer

network covid-19

Similar Publications

ASReview LAB v.2: Open-source text screening with multiple agents and a crowd of experts.

Patterns (N Y)

July 2025

Department of Methodology and Statistics, Faculty of Social and Behavioral Sciences, Utrecht University, Utrecht, the Netherlands.

Jonathan de Bruin , Peter Lombaers , Casper Kaandorp , Jelle Teijema , Timo van der Kuil

ASReview LAB v.2 introduces an advancement in AI-assisted systematic reviewing by enabling collaborative screening with multiple experts ("a crowd of oracles") using a shared AI model. The platform supports multiple AI agents within the same project, allowing users to switch between fast general-purpose models and domain-specific, semantic, or multilingual transformer models.

View Article and Find Full Text PDF

Similar Publications

RegGAN-based contrast-free CT enhances esophageal cancer assessment: multicenter validation of automated tumor segmentation and T-staging.

Radiol Med

September 2025

Department of Medical Oncology, The Second People's Hospital of Hefei, Hefei, China.

Xiaoyu Huang , Weihang Li , Yaru Wang , Qibing Wu , Ping Li

Purpose: This study aimed to develop a deep learning (DL) framework using registration-guided generative adversarial networks (RegGAN) to synthesize contrast-enhanced CT (Syn-CECT) from non-contrast CT (NCCT), enabling iodine-free esophageal cancer (EC) T-staging.

Methods: A retrospective multicenter analysis included 1,092 EC patients (2013-2024) divided into training (N = 313), internal (N = 117), and external test cohorts (N = 116 and N = 546). RegGAN synthesized Syn-CECT by integrating registration and adversarial training to address NCCT-CECT misalignment.

View Article and Find Full Text PDF

Similar Publications

Predictive design of crystallographic chiral separation.

Nat Commun

August 2025

Department of Physics, University of Cambridge, Cambridge, UK.

Rokas Elijošius , Emma King-Smith , Felix A Faber , Louise Bernier , Simon Berritt

The efficient separation of chiral molecules is a fundamental challenge in the manufacture of pharmaceuticals and light-polarising materials. We developed an approach that combines machine learning with a physics-based representation to predict resolving agents for chiral molecules, using a transformer-based neural network. In retrospective tests, our approach reaches a four to six-fold improvement over the historical - trial and error based - hit rate.

View Article and Find Full Text PDF

Similar Publications

All-in-one Biocomputing Nanoagents with Multilayered Transformable Architecture based on DNA Interfaces.

Theranostics

August 2025

Moscow Center for Advanced Studies, 20 Kulakova St, 123592, Moscow, Russia.

Vladimir R Cherkasov , Elizaveta N Mochalova , Andrey V Babenyshev , Maxim P Nikitin

The pathogen diversity to infiltrate the host organism highlights the demand for equally sophisticated mechanisms for their prevention. The development of "intelligent" agents with molecular logic capabilities are of great hope, but their full theranostic potential has yet to be realized. The original concept of nanoagents based on "Biocomputing based on particle disassembly" technology has been extended to nucleic acids (NAs) interfaces and inputs.

View Article and Find Full Text PDF

Similar Publications

Language models for drug-drug interactions: current applications, pitfalls, and future directions.

Expert Opin Drug Metab Toxicol

August 2025

College of Pharmacy, Al Ain University, Abu Dhabi, United Arab Emirates.

Ahmad Z Al Meslamani , Abdallah Abou Hajal

Introduction: Advanced artificial intelligence (AI) frameworks particularly, large language models (LLMs) have recently attracted attention for automating Drug-drug interactions (DDIs) extraction and prediction tasks. However, there is a scarcity of reviews on how LLMs can rapidly identify known and novel DDIs.

Areas Covered: This review summarizes the state of LLM-based DDI extraction and prediction, based on a broad literature search from PubMed, Embase, Web of Science, Scopus, IEEE Xplore, the Cochrane Library, ACM Digital Library, Google Scholar, and Semantic Scholar published between January 2000 and February 2025.

View Article and Find Full Text PDF

Similar Publications