Stereo Video Quality Metric Based on Multi-Dimensional Analysis.

Entropy (Basel)

College of Science and Technology, Ningbo University, Ningbo 315211, China.

Published: August 2021


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Stereo video has been widely applied in various video systems in recent years. Therefore, objective stereo video quality metric (SVQM) is highly necessary for improving the watching experience. However, due to the high dimensional data in stereo video, existing metrics have some defects in accuracy and robustness. Based on the characteristics of stereo video, this paper considers the coexistence and interaction of multi-dimensional information in stereo video and proposes an SVQM based on multi-dimensional analysis (MDA-SVQM). Specifically, a temporal-view joint decomposition (TVJD) model is established by analyzing and comparing correlation in different dimensions and adaptively decomposes stereo group of frames (sGoF) into different subbands. Then, according to the generation mechanism and physical meaning of each subband, histogram-based and LOID-based features are extracted for high and low frequency subband, respectively, and sGoF quality is obtained by regression. Finally, the weight of each sGoF is calculated by spatial-temporal energy weighting (STEW) model, and final stereo video quality is obtained by weighted summation of all sGoF qualities. Experiments on two stereo video databases demonstrate that TVJD and STEW adopted in MDA-SVQM are convincible, and the overall performance of MDA-SVQM is better than several existing SVQMs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8464717PMC
http://dx.doi.org/10.3390/e23091129DOI Listing

Publication Analysis

Top Keywords

stereo video
32
video quality
12
stereo
9
quality metric
8
based multi-dimensional
8
multi-dimensional analysis
8
video
8
quality
4
metric based
4
analysis stereo
4

Similar Publications

SSIFNet: Spatial-temporal stereo information fusion network for self-supervised surgical video inpainting.

Comput Med Imaging Graph

August 2025

Institute of Medical Robotics, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China. Electronic address:

During minimally invasive robot-assisted surgical procedures, surgeons rely on stereo endoscopes to provide image guidance. Nevertheless, the field-of-view is typically restricted owing to the limited size of the endoscope and constrained workspace. Such a visualization challenge becomes even more severe when surgical instruments are inserted into the already restricted field-of-view, where important anatomical landmarks and relevant clinical contents may become occluded by the inserted instruments.

View Article and Find Full Text PDF

The lateralized efficacy of vagus nerve stimulation (VNS) remains insufficiently explored. We report a case of drug-resistant epilepsy with bilateral frontal lobe seizure onset, treated with left cervical VNS. Preoperative video- electroencephalogram revealed predominant interictal discharges in the right hemisphere and frequent seizures from both hemispheres.

View Article and Find Full Text PDF

Accurate segmentation of endoscopic instruments is essential in robot-assisted surgery, supporting precise navigation, enhancing safety, and advancing surgical automation. However, this task is challenging due to factors like complex environments, instrument-tissue similarity, and lighting variations. Instruments, due to their material properties, have distinct depth distributions compared to surrounding tissues.

View Article and Find Full Text PDF

This paper introduces Stereo-Talker, a novel one-shot audio-driven human video synthesis system that generates 3D talking videos with precise lip synchronization, expressive body gestures, temporally consistent photo-realistic quality, and continuous viewpoint control. The process follows a two-stage approach. In the first stage, the system maps audio input to high-fidelity motion sequences, encompassing upper-body gestures and facial expressions.

View Article and Find Full Text PDF

This work presents the Fourier Lightfield Multi-view Stereoscope (FiLM-Scope), a novel imaging device that combines concepts from Fourier Light Field Microscopy and Multi-view Stereo imaging to capture high-resolution 3D videos over large fields-of-view. The FiLM-Scope optical hardware consists of a multi-camera array, with 48 individual micro-cameras, placed behind a high-throughput primary lens. This allows the FiLM-Scope to simultaneously capture 48 unique 12.

View Article and Find Full Text PDF