98%
921
2 minutes
20
Photoacoustic tomography (PAT) combines the high spatial resolution of ultrasound imaging with the high contrast of optical imaging. To reduce acquisition time and lower the cost of photoacoustic imaging, sparse sampling strategy is often employed. Conventional reconstruction methods often produce artifacts when dealing with sparse data, affecting image quality and diagnostic accuracy. This paper proposes a Residual-Conditioned Sparse Transformer (RCST) network for reducing artifacts in photoacoustic images, aiming to enhance image quality under sparse sampling. By introducing residual prior information, our algorithm encodes and embeds it into local enhancement and detail recovery stages. We utilize sparse transformer blocks to identify and reduce artifacts while preserving key structures and details of the images. Experiments on multiple simulated and experimental datasets demonstrate that our method significantly suppresses artifacts and improves image quality, offering new possibilities for the application of photoacoustic imaging in biomedical research and clinical diagnostics.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12164018 | PMC |
http://dx.doi.org/10.1016/j.pacs.2025.100731 | DOI Listing |
IEEE Trans Pattern Anal Mach Intell
September 2025
Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific vision tasks. Yet, existing methods either employ complex spatial-temporal modules or rely heavily on additional perception models to extract temporal features for video understanding, performing well only on short videos. For long videos, the computational complexity and memory costs associated with long-term temporal connections are significantly increased, posing additional challenges.
View Article and Find Full Text PDFMed Phys
September 2025
Research Center for Medical AI, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
Background: Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) plays a crucial role in the diagnosis and monitoring of cancers, as it reveals physiological and vascular characteristics of tumors. Traditional pharmacokinetic modeling necessitates high temporal resolution, resulting in relatively low signal-to-noise ratio (SNR) and spatial resolution with limited allocated time for each phase.
Purpose: To explore the feasibility of using deep learning with sparse DCE MRI phases to generate dense temporal resolution DCE-MRI-derived parametric map.
Nat Commun
August 2025
School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, AZ, USA.
In applications, an anticipated issue is where the system of interest has never been encountered before and sparse observations can be made only once. Can the dynamics be faithfully reconstructed? We address this challenge by developing a hybrid transformer and reservoir-computing scheme. The transformer is trained without using data from the target system, but with essentially unlimited synthetic data from known chaotic systems.
View Article and Find Full Text PDFJ Imaging
August 2025
School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510630, China.
Point cloud imaging technology usually faces the problem of point cloud sparsity, which leads to a lack of important geometric detail. There are many point cloud upsampling networks that have been designed to solve this problem. However, the existing methods have limitations in local-global relation understanding, leading to contour distortion and many local sparse regions.
View Article and Find Full Text PDFNeural Netw
August 2025
College of Computer Science and Technology, National University of Defense Technology, Changsha, 410073, China. Electronic address:
The Segment Anything Model (SAM) has gained significant attention for its impressive performance in image segmentation. However, it lacks proficiency in referring video object segmentation (RVOS) due to the need for precise user-interactive prompts and a limited understanding of different modalities, such as language and vision. This paper presents the RefSAM model, which explores the potential of SAM for RVOS by incorporating multi-view information from diverse modalities and successive frames at different timestamps in an online manner.
View Article and Find Full Text PDF