Stereo Video Quality Metric Based on Multi-Dimensional Analysis.

Zhouyan He , Haiyong Xu , Ting Luo , Yi Liu , Yang Song

Entropy (Basel)

College of Science and Technology, Ningbo University, Ningbo 315211, China.

Published: August 2021

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Stereo video has been widely applied in various video systems in recent years. Therefore, objective stereo video quality metric (SVQM) is highly necessary for improving the watching experience. However, due to the high dimensional data in stereo video, existing metrics have some defects in accuracy and robustness. Based on the characteristics of stereo video, this paper considers the coexistence and interaction of multi-dimensional information in stereo video and proposes an SVQM based on multi-dimensional analysis (MDA-SVQM). Specifically, a temporal-view joint decomposition (TVJD) model is established by analyzing and comparing correlation in different dimensions and adaptively decomposes stereo group of frames (sGoF) into different subbands. Then, according to the generation mechanism and physical meaning of each subband, histogram-based and LOID-based features are extracted for high and low frequency subband, respectively, and sGoF quality is obtained by regression. Finally, the weight of each sGoF is calculated by spatial-temporal energy weighting (STEW) model, and final stereo video quality is obtained by weighted summation of all sGoF qualities. Experiments on two stereo video databases demonstrate that TVJD and STEW adopted in MDA-SVQM are convincible, and the overall performance of MDA-SVQM is better than several existing SVQMs.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8464717	PMC
http://dx.doi.org/10.3390/e23091129	DOI Listing

Publication Analysis

Top Keywords

stereo video

video quality

stereo

quality metric

based multi-dimensional

multi-dimensional analysis

video

quality

metric based

analysis stereo

Similar Publications

SSIFNet: Spatial-temporal stereo information fusion network for self-supervised surgical video inpainting.

Comput Med Imaging Graph

August 2025

Institute of Medical Robotics, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China. Electronic address:

Xiaoyang Zou , Zhuyuan Zhang , Derong Yu , Wenyuan Sun , Wenyong Liu

During minimally invasive robot-assisted surgical procedures, surgeons rely on stereo endoscopes to provide image guidance. Nevertheless, the field-of-view is typically restricted owing to the limited size of the endoscope and constrained workspace. Such a visualization challenge becomes even more severe when surgical instruments are inserted into the already restricted field-of-view, where important anatomical landmarks and relevant clinical contents may become occluded by the inserted instruments.

View Article and Find Full Text PDF

Similar Publications

Asymmetric efficacy of VNS within a single patient with bilateral focal frontal lobe epilepsy: A case report.

Acta Neurochir (Wien)

August 2025

Department of Neurosurgery, Fujita Health University Hospital School of Medicine, Aichi, Japan.

Masanobu Kumon , Shunsuke Nakae , Daijiro Kojima , Noeru Kawase , Yuichi Hirose

The lateralized efficacy of vagus nerve stimulation (VNS) remains insufficiently explored. We report a case of drug-resistant epilepsy with bilateral frontal lobe seizure onset, treated with left cervical VNS. Preoperative video- electroencephalogram revealed predominant interictal discharges in the right hemisphere and frequent seizures from both hemispheres.

View Article and Find Full Text PDF

Similar Publications

EISegNet: Enhancing Instrument Segmentation Network via Dual-View Disparity Estimation.

IEEE J Biomed Health Inform

August 2025

Yongming Yang , Zhaoshuo Diao , Ziliang Song , Shenglin Zhang , Tiancong Liu

Accurate segmentation of endoscopic instruments is essential in robot-assisted surgery, supporting precise navigation, enhancing safety, and advancing surgical automation. However, this task is challenging due to factors like complex environments, instrument-tissue similarity, and lighting variations. Instruments, due to their material properties, have distinct depth distributions compared to surrounding tissues.

View Article and Find Full Text PDF

Similar Publications

Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts.

IEEE Trans Pattern Anal Mach Intell

August 2025

Xiang Deng , Youxin Pang , Xiaochen Zhao , Chao Xu , Lizhen Wang

This paper introduces Stereo-Talker, a novel one-shot audio-driven human video synthesis system that generates 3D talking videos with precise lip synchronization, expressive body gestures, temporally consistent photo-realistic quality, and continuous viewpoint control. The process follows a two-stage approach. In the first stage, the system maps audio input to high-fidelity motion sequences, encompassing upper-body gestures and facial expressions.

View Article and Find Full Text PDF

Similar Publications

Fourier Lightfield Multiview Stereoscope for Large Field-of-View 3D Imaging in Microsurgical Settings.

Adv Photonics Nexus

June 2025

Duke University, Biomedical Engineering, Durham, 27708, NC, USA.

Clare B Cook , Kevin C Zhou , Martin Bohlen , Mark Harfouche , Kanghyun Kim

This work presents the Fourier Lightfield Multi-view Stereoscope (FiLM-Scope), a novel imaging device that combines concepts from Fourier Light Field Microscopy and Multi-view Stereo imaging to capture high-resolution 3D videos over large fields-of-view. The FiLM-Scope optical hardware consists of a multi-camera array, with 48 individual micro-cameras, placed behind a high-throughput primary lens. This allows the FiLM-Scope to simultaneously capture 48 unique 12.

View Article and Find Full Text PDF

Similar Publications