Residual-conditioned sparse transformer for photoacoustic image artifact reduction.

Xiaoxue Wang , Jinzhuang Xu , Chenglong Zhang , Moritz Wildgruber , Wenjing Jiang , Lili Wang , Xiaopeng Ma

Photoacoustics

School of Control Science and Engineering, Shandong University, 250061, Jinan Shandong, China.

Published: August 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Photoacoustic tomography (PAT) combines the high spatial resolution of ultrasound imaging with the high contrast of optical imaging. To reduce acquisition time and lower the cost of photoacoustic imaging, sparse sampling strategy is often employed. Conventional reconstruction methods often produce artifacts when dealing with sparse data, affecting image quality and diagnostic accuracy. This paper proposes a Residual-Conditioned Sparse Transformer (RCST) network for reducing artifacts in photoacoustic images, aiming to enhance image quality under sparse sampling. By introducing residual prior information, our algorithm encodes and embeds it into local enhancement and detail recovery stages. We utilize sparse transformer blocks to identify and reduce artifacts while preserving key structures and details of the images. Experiments on multiple simulated and experimental datasets demonstrate that our method significantly suppresses artifacts and improves image quality, offering new possibilities for the application of photoacoustic imaging in biomedical research and clinical diagnostics.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12164018	PMC
http://dx.doi.org/10.1016/j.pacs.2025.100731	DOI Listing

Publication Analysis

Top Keywords

sparse transformer

image quality

residual-conditioned sparse

photoacoustic imaging

sparse sampling

photoacoustic

sparse

transformer photoacoustic

image

photoacoustic image

Similar Publications

MovieChat+: Question-aware Sparse Memory for Long Video Question Answering.

IEEE Trans Pattern Anal Mach Intell

September 2025

Enxin Song , Wenhao Chai , Tian Ye , Jenq-Neng Hwang , Xi Li

Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific vision tasks. Yet, existing methods either employ complex spatial-temporal modules or rely heavily on additional perception models to extract temporal features for video understanding, performing well only on short videos. For long videos, the computational complexity and memory costs associated with long-term temporal connections are significantly increased, posing additional challenges.

View Article and Find Full Text PDF

Similar Publications

Pixel-level transformer GAN for enhanced parametric mapping of DCE MRI analysis.

Med Phys

September 2025

Research Center for Medical AI, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.

Yuxi Jin , Gengjia Lin , Qian Yang , Zixiang Chen , Haizhou Liu

Background: Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) plays a crucial role in the diagnosis and monitoring of cancers, as it reveals physiological and vascular characteristics of tumors. Traditional pharmacokinetic modeling necessitates high temporal resolution, resulting in relatively low signal-to-noise ratio (SNR) and spatial resolution with limited allocated time for each phase.

Purpose: To explore the feasibility of using deep learning with sparse DCE MRI phases to generate dense temporal resolution DCE-MRI-derived parametric map.

View Article and Find Full Text PDF

Similar Publications

Bridging known and unknown dynamics by transformer-based machine-learning inference from sparse observations.

Nat Commun

August 2025

School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, AZ, USA.

Zheng-Meng Zhai , Benjamin D Stern , Ying-Cheng Lai

In applications, an anticipated issue is where the system of interest has never been encountered before and sparse observations can be made only once. Can the dynamics be faithfully reconstructed? We address this challenge by developing a hybrid transformer and reservoir-computing scheme. The transformer is trained without using data from the target system, but with essentially unlimited synthetic data from known chaotic systems.

View Article and Find Full Text PDF

Similar Publications

PU-DZMS: Point Cloud Upsampling via Dense Zoom Encoder and Multi-Scale Complementary Regression.

J Imaging

August 2025

School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510630, China.

Shucong Li , Zhenyu Liu , Tianlei Wang , Zhiheng Zhou

Point cloud imaging technology usually faces the problem of point cloud sparsity, which leads to a lack of important geometric detail. There are many point cloud upsampling networks that have been designed to solve this problem. However, the existing methods have limitations in local-global relation understanding, leading to contour distortion and many local sparse regions.

View Article and Find Full Text PDF

Similar Publications

RefSAM: Efficiently adapting segmenting anything model for referring video object segmentation.

Neural Netw

August 2025

College of Computer Science and Technology, National University of Defense Technology, Changsha, 410073, China. Electronic address:

Yonglin Li , Jing Zhang , Xiao Teng , Haoyu Zhang , Xinwang Liu

The Segment Anything Model (SAM) has gained significant attention for its impressive performance in image segmentation. However, it lacks proficiency in referring video object segmentation (RVOS) due to the need for precise user-interactive prompts and a limited understanding of different modalities, such as language and vision. This paper presents the RefSAM model, which explores the potential of SAM for RVOS by incorporating multi-view information from diverse modalities and successive frames at different timestamps in an online manner.

View Article and Find Full Text PDF

Similar Publications