Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Echocardiography is a widely used cardiac imaging modality in clinical practice. Physicians utilize echocardiography images to measure left ventricular volumes at end-diastole (ED) and end-systole (ES) frames, which are pivotal for calculating the ejection fraction and thus quantitatively assessing cardiac function. However, most existing approaches focus on features from ES frames and ED frames, neglecting the inter-frame correlations in unlabeled frames. Our model is based on an encoder-decoder architecture and consists of two modules: the Temporal Feature Fusion Module (TFFA) and the Vision Retentive Network (Vision RetNet) encoder. The TFFA leverages self-attention to learn inter-frame correlations across multiple consecutive frames and aggregates the features of the temporal-channel dimension through channel aggregation to highlight ambiguity regions. The Vision RetNet encoder introduces explicit spatial priors by constructing a spatial decay matrix using the Manhattan distance. We conducted experiments on the EchoNet-Dynamic dataset and the CAMUS dataset, where our proposed model demonstrates competitive performance. The experimental results indicate that spatial prior information and inter-frame correlations in echocardiography images can enhance the accuracy of semantic segmentation, and inter-frame correlations become even more effective when spatial priors are provided.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11946786PMC
http://dx.doi.org/10.3390/s25061909DOI Listing

Publication Analysis

Top Keywords

inter-frame correlations
16
vision retentive
8
retentive network
8
echocardiography images
8
vision retnet
8
retnet encoder
8
spatial priors
8
frames
5
temporal features-fused
4
vision
4

Similar Publications

Objective: The segmentation of ultrasound video objects aims to delineate specific anatomical structures or areas of injury in sequential ultrasound imaging data. Current methods exhibit promising results, but struggle with key aspects of ultrasound video analysis. They insufficiently capture inter-frame object motion, resulting in unsatisfactory segmentation for dynamic or low-contrast scenarios.

View Article and Find Full Text PDF

Background: Dark-field radiography of the human chest has been demonstrated to have promising potential for the analysis of the lung microstructure and the diagnosis of respiratory diseases. However, most previous studies of dark-field chest radiographs evaluated the lung signal only in the inspiratory breathing state.

Purpose: Our work aims to add a new perspective to these previous assessments by locally comparing dark-field lung information between different respiratory states to explore new ways of functional lung imaging based on dark-field chest radiography.

View Article and Find Full Text PDF

A new dataset and versatile multi-task surgical workflow analysis framework for thoracoscopic mitral valvuloplasty.

Med Image Anal

October 2025

Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong Special Administrative Region of China. Electronic address:

Surgical Workflow Analysis (SWA) on videos is critical for AI-assisted intelligent surgery. Existing SWA methods primarily focus on laparoscopic surgeries, while research on complex thoracoscopy-assisted cardiac surgery remains largely unexplored. In this paper, we introduce TMVP-SurgVideo, the first SWA video dataset for thoracoscopic cardiac mitral valvuloplasty (TMVP).

View Article and Find Full Text PDF

Owing to its ability to enable precise perception of dynamic and complex environments, point cloud semantic segmentation has become a critical task for autonomously driven vehicles in recent years. However, in complex, dynamic scenes, cumulative errors and the "many-to-one" mapping problem are challenges for existing semantic segmentation methods, which further limit their accuracy and efficiency. To address these, this paper introduces a new framework that balances accuracy and computational efficiency by utilizing temporal alignment (TA), projection multi-scale convolution (PMC), and priority point retention (PPR).

View Article and Find Full Text PDF

Data-driven assessment of optimal spatiotemporal resolutions for information extraction in noisy time series data.

J Chem Phys

June 2025

Department of Applied Science and Technology, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Torino, Italy.

In general, comprehension of any type of complex system depends on the resolution used to examine the phenomena occurring within it. However, identifying a priori, for example, the best time frequencies/scales to study a certain system over time, or the spatial distances at which correlations, symmetries, and fluctuations are most often non-trivial. Here, we describe an unsupervised approach that, starting solely from the data of a system, allows learning the characteristic length scales of the dominant key events/processes and the optimal spatiotemporal resolutions to characterize them.

View Article and Find Full Text PDF