深度时空一致性增强算法提升高压缩视频质量

103 浏览量更新于2024-08-27 收藏 399KB PDF 举报

本文档探讨了一种新颖的深度空间-时间一致性增强算法，该算法针对基于多视图视频加深度的自由视角视频系统。在这样的系统中，为了确保生成高质量的虚拟视图和提升压缩性能，消除深度视频中的不一致性至关重要。作者提出了一种预处理方法，利用贝叶斯概率模型和竞争学习策略（Rival penalized competitive learning）结合自组织映射（Self-Organizing Maps, SOM），对深度视频的时空信息进行校正。首先，通过聚类技术将深度视频中的每个灰度值分配到特定类别，这一步骤有助于识别并区分深度数据中的不同区域和模式。然后，贝叶斯概率模型被用于估计深度信息的准确性，它考虑了先前观测数据的概率分布，从而提供了一种统计框架来预测和修正潜在的不精确度。 Rival penalized competitive learning作为一种竞争性学习机制，通过引入惩罚机制来优化深度视频中的竞争过程。在SOM中，神经元之间的竞争有助于调整其权重，使得相邻神经元具有相似的输入特征，从而提高了深度数据在时间和空间维度上的一致性。这种方法不仅考虑了深度帧内的变化，还考虑到帧间的连续性，增强了整体的视觉效果和压缩编码的效率。实验部分展示了算法在多组图像序列（如Ballet_color_S7T4、Ballet_color_S7T5等）上的应用，以及在深度帧（Ballet_depth_S7T4、Ballet_depth_S7T5）中的效果。通过对比分析（如FDinA,B和FDinD,E），研究者证明了该算法在消除深度视频不一致性和提高压缩性能方面取得了显著的效果。总结来说，这项工作提出了一种创新的方法，有效地解决了多视点自由视角视频中深度信息的空间和时间一致性问题，对于提升虚拟现实体验和系统性能具有实际价值。对于那些关注视频压缩、多视点成像和深度学习的IT专业人士而言，这篇论文提供了有价值的技术参考和理论支持。

2011 4th International Congress on Image and Signal Processing

a)Ballet_color_S

b) Ballet_color_S

c)FD in A,B

d)Ballet_depth_S

e)Ballet_depth_S

f)FD in D,E

A novel depth spatial-temporal consistency

enhancement algorithm for high compression

performance

Ruiqing Zhang

1,2

, Zongju Peng

1,2

, Mei Yu

, Gangyi Jiang

, Wei Bi

1,2

Faculty of Information Science and Engineering

Ningbo University

Ningbo, China

{yumei,jianggangyi } @nbu.edu.cn

Zhejiang Provincial Key Laboratory of Information

Network Technology

Zhejiang University, Hangzhou, China

pengzongju@126.com

Abstract— In free viewpoint video system based on multiview

video plus depth, inconsistency with depth video need to be

eliminated to ensure high-quality virtual view generation and

compression performance. The preprocessing method proposed

can compensate both spatial and temporal depth information

inaccuracy by using Bayesian probability model and Rival

penalized competitive learning in Self-Organizing Maps. Firstly,

each gray value in depth video is assigned to specific class after

clustering. Then gradient filter is utilized in smoothing.

Experiments show that the proposed algorithm reduced the bit

rate ranging 7.97%-46.83% while ensuring quality of generated

virtual viewpoint.

Keywords-Depth video preprocessing; depth spatial-temporal

consistency enhancement; Bayesian Probability Model

I. INTRODUCTION

Free viewpoint video (FVV) is a new type of natural video

media that allows users to freely navigate in the real world

visual scenes. In order to represent 3D scenes, MPEG specified

a standard for efficient compression and transmission [1,2].

This context proposed Multiview Video plus Depth (MVD)

format, consisting of multiple color videos with associated

depth data. Depth maps are utilized in projecting image of

limited number of viewpoints to the image of other viewpoints

through depth image based rendering (DIBR) [3]. In the MVD

based FVV system, the total bitrate yield by Multiview Video

Coding (MVC) is proportional to the number of views [4].

Thus, MVD has huge data compared with monoview video.

Depth video is composed by depth maps which only have

luminance component. Its simpler texture makes it use only

10%-20% of bit rate of the normal color video without

decreasing PSNR [5]. However, disturbance like optical noise

in depth maps constitute inaccuracy in generating high-quality

virtual view and increase bit rate. Therefore, for transmission

on limited bandwidth channel, depth redundancies need to be

exploited. Preprocessing of depth maps is required mainly to

assure high-quality virtual view generation and suppressing

unnecessary details on both temporal and spatial dimension.

Kwanghee Jung had proposed K-Means cluster and smooth

filter [6], setting k to 5 factitiously. Varied by complexity of

sequences however, it is unpractical to define class number as

a constant. For this reason, this contribution presents a novel

depth spatial-temporal consistency enhancement algorithm. In

this algorithm, improved Self-Organizing Maps (SOM) is used

for adaptive depth map cluster. Firstly, Bayesian Probability

Model is used in network competition and Rival penalized

competitive learning (RPCL) [7] is appended to update neural

weight. Subsequently, gradient smoothing is used to enhance

spatial-temporal consistency. In Section II, challenge in pre-

procession and introduction of Self-Organizing Maps are

briefly introduced. In Section III, the application of Bayesian

probability and whole algorithm flow is presented. Section IV

introduces the experiments conducted and its result. Finally,

conclusion is given in Section V.

II. P

ROBLEM DESCRIPTION

In MVD format, depth maps are directly captured by depth

camera or estimated by computer vision algorithm. Gray value

in depth maps denotes the distance from camera image plane

to the 3D point. However, the gray value is not accurate, which

decreases the spatial and temporal correlation of depth video.

Figure 1. Comparison of FD in color and depth sequence

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38700320

粉丝: 4
资源: 931

深度时空一致性增强算法提升高压缩视频质量

Algorithm-spatial-temporal-LCMV.zip

Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting

Deep Spatial-Temporal 3D Convolutional Neural Networks for Traffic Data Forecasting

Fully-Connected Spatial-Temporal

Spatial-Temporal-Attention-Network-for-POI-Recommendation:WWW'21论文的代码。 最新的推荐系统，用于位置轨迹预测

Spatial-Temporal Content-Preserving Warping

matlab精度检验代码-Spatial-temporal-Adaptive-Transient-Stability-Assessment-f

A Mode of Resonators with Spatial-temporal Phase Modulation

TSCA: A Temporal-Spatial Real-Time Charging Scheduling Algorithm for On-Demand Architecture in Wireless Rechargeable Sensor Networks

Spatial-Temporal-Pooling-Networks-ReID:徐双杰（2017.https

最新资源

Spatial-Temporal-Attention-Network-for-POI-Recommendation:WWW'21论文的代码。最新的推荐系统，用于位置轨迹预测