98%
921
2 minutes
20
Stereo video has been widely applied in various video systems in recent years. Therefore, objective stereo video quality metric (SVQM) is highly necessary for improving the watching experience. However, due to the high dimensional data in stereo video, existing metrics have some defects in accuracy and robustness. Based on the characteristics of stereo video, this paper considers the coexistence and interaction of multi-dimensional information in stereo video and proposes an SVQM based on multi-dimensional analysis (MDA-SVQM). Specifically, a temporal-view joint decomposition (TVJD) model is established by analyzing and comparing correlation in different dimensions and adaptively decomposes stereo group of frames (sGoF) into different subbands. Then, according to the generation mechanism and physical meaning of each subband, histogram-based and LOID-based features are extracted for high and low frequency subband, respectively, and sGoF quality is obtained by regression. Finally, the weight of each sGoF is calculated by spatial-temporal energy weighting (STEW) model, and final stereo video quality is obtained by weighted summation of all sGoF qualities. Experiments on two stereo video databases demonstrate that TVJD and STEW adopted in MDA-SVQM are convincible, and the overall performance of MDA-SVQM is better than several existing SVQMs.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8464717 | PMC |
http://dx.doi.org/10.3390/e23091129 | DOI Listing |
Comput Med Imaging Graph
August 2025
Institute of Medical Robotics, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China. Electronic address:
During minimally invasive robot-assisted surgical procedures, surgeons rely on stereo endoscopes to provide image guidance. Nevertheless, the field-of-view is typically restricted owing to the limited size of the endoscope and constrained workspace. Such a visualization challenge becomes even more severe when surgical instruments are inserted into the already restricted field-of-view, where important anatomical landmarks and relevant clinical contents may become occluded by the inserted instruments.
View Article and Find Full Text PDFActa Neurochir (Wien)
August 2025
Department of Neurosurgery, Fujita Health University Hospital School of Medicine, Aichi, Japan.
The lateralized efficacy of vagus nerve stimulation (VNS) remains insufficiently explored. We report a case of drug-resistant epilepsy with bilateral frontal lobe seizure onset, treated with left cervical VNS. Preoperative video- electroencephalogram revealed predominant interictal discharges in the right hemisphere and frequent seizures from both hemispheres.
View Article and Find Full Text PDFIEEE J Biomed Health Inform
August 2025
Accurate segmentation of endoscopic instruments is essential in robot-assisted surgery, supporting precise navigation, enhancing safety, and advancing surgical automation. However, this task is challenging due to factors like complex environments, instrument-tissue similarity, and lighting variations. Instruments, due to their material properties, have distinct depth distributions compared to surrounding tissues.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
August 2025
This paper introduces Stereo-Talker, a novel one-shot audio-driven human video synthesis system that generates 3D talking videos with precise lip synchronization, expressive body gestures, temporally consistent photo-realistic quality, and continuous viewpoint control. The process follows a two-stage approach. In the first stage, the system maps audio input to high-fidelity motion sequences, encompassing upper-body gestures and facial expressions.
View Article and Find Full Text PDFAdv Photonics Nexus
June 2025
Duke University, Biomedical Engineering, Durham, 27708, NC, USA.
This work presents the Fourier Lightfield Multi-view Stereoscope (FiLM-Scope), a novel imaging device that combines concepts from Fourier Light Field Microscopy and Multi-view Stereo imaging to capture high-resolution 3D videos over large fields-of-view. The FiLM-Scope optical hardware consists of a multi-camera array, with 48 individual micro-cameras, placed behind a high-throughput primary lens. This allows the FiLM-Scope to simultaneously capture 48 unique 12.
View Article and Find Full Text PDF