3D-HEVC编码优化：贝叶斯决策规则的快速在线学习参数决策算法

需积分: 9 37 浏览量更新于2024-08-26 收藏 848KB PDF 举报

"这篇研究论文提出了一种基于贝叶斯决策规则的3D-高效视频编码（3D-HEVC）快速在线学习参数决策算法，旨在降低3D视频编码过程中的计算复杂性。3D-HEVC是高效视频编码（HEVC）标准的扩展，相较于多视图视频编码（MVC），在3D视频编码效率上有了显著提升，但同时也带来了更大的计算复杂度。为了应对这一挑战，该论文提出了一种新的算法，将编码单元（CU）、预测单元（PU）和变换单元（TU）的选择过程建模为基于贝叶斯决策规则的在线学习分类过程。通过训练选择的特征向量，分类器可以精确预测PU的预测模式以及当前CU和TU是否需要分割。实验结果显示，该算法能够在轻微的率失真（RD）下降情况下实现大约56%的计算时间减少。" 这篇研究论文专注于3D视频编码领域，特别是3D-HEVC标准的优化。3D-HEVC标准通过改进编码技术提高了3D视频的压缩效率，然而这增加了编码的计算复杂性。为了减少这种复杂性，作者提出了一个创新的解决方案，即使用贝叶斯决策规则进行快速在线学习参数决策。贝叶斯决策规则是一种统计学方法，它利用先验概率和条件概率来做出最优决策。在这个上下文中，它被用于构建一个分类系统，该系统能够根据特征向量的训练数据，预测CU、PU和TU的最佳编码策略。CU是编码的基本单元，PU负责预测，而TU则涉及变换编码。通过准确预测这些单元的分割和预测模式，算法能够减少不必要的计算步骤，从而大大加快编码速度。论文中提到的实验结果证明了这种方法的有效性。通过实施该算法，编码时间显著缩短了约56%，同时保持了良好的率失真性能。这意味着在不牺牲太多图像质量的情况下，视频编码过程变得更加高效。这项工作展示了贝叶斯决策理论在优化高复杂度视频编码问题上的潜力，为未来3D视频编码的优化提供了新的思路和工具。这样的算法对于实时或低延迟的3D视频传输特别有价值，例如在虚拟现实、远程教育、医疗影像和3D电视等领域。

Fast Online-Learning Parameters Decision Algorithm

Based on Bayesian Decision Rule for 3D-HEVC

Yayong Li, Xingang Liu, Tao Yu, Yongyong Mei and Peicheng Wang

School of Electronic Engineering

University of Electronic Science and Technology of China

Chengdu, China

liyayong163@163.com

Abstract—3D-HEVC, as the extension of High Efficiency Video

Coding standard, achieves a significant improvement in the coding

efficiency of 3D videos, compared with the Multi-View Video

Coding (MVC). However the improvement causes a great

computational complexity. In this paper, a fast coding parameters

decision algorithm is proposed to reduce the computational

complexity. The process of the selection of Coding Unit (CU),

Prediction Unit (PU) and Transform Unit (TU) are modeled as the

online-learning classification process based on the Bayesian

decision rule. Through the training of the selected feature vectors,

the classifiers can precisely predict the prediction mode of PUs and

whether or not the current CU and TU should be partitioned. The

experimental results show that the proposed algorithm can achieve

about 56% time reduction with a slight RD degradation.

Keywords—3D-HEVC; Bayesian decision rule; fast coding

parameters decision;

I. INTRODUCTION

Due to the immersive viewing experience, three-dimensional

videos are becoming more and more popular, which has been

pushing the advance of the 3D technology. Joint Collaborative

Team on 3D Video Coding Extension Development (JCT-3V)

was established in 2012, and proposed 3D-HEVC standard in

2013. The standard, based on the HEVC standard, is aimed at

achieving a higher compression rate to the multi-view

stereoscopic videos with little degradation in video quality. 3D-

HEVC not only takes full advantages of the spatiotemporal

correlation, but also exploits the inter-view correlation to

remove the redundant, which improves the coding efficiency

more.

Because of the special characteristics of 3D videos, 3D-

HEVC develops some new techniques. For the dependent views,

disparity-compensated prediction (DCP), Inter-View Motion

Parameters Prediction (IVMP), Inter-View Residual Prediction

(IVRP) are introduced to remove the inter-view redundant.

DCP adds the encoded inter-view frames at the same moment

to the reference frame list to reduce the redundant. IVMP

utilizes the inter-view motion information to predict the motion

parameters of the current view. IVRP, similar to the IVMP,

utilizes the inter-view residual to predict the residual of current

view. Since depth map has quite different characteristics than

the texture videos, which consists of large quantity of smooth

regions separated by sharp edges, 3D-HEVC introduces Depth

Modeling Modes(DMM), which can preserve the edge

information better.

For texture videos and depth maps, as showed in Figure 1,

3D-HEVC adopts the same quad-tree coding structure. Coding

Tree Unit (CTU) is still the basic coding unit. One CTU can be

split into four Coding Trees (CU), and one CU can be

recursively split into four sub-CUs in rate distortion

optimization (RDO) process. The maximum size of CU is 64x64

and minimum size is 8x8. Prediction Unit (PU) carries the

prediction information, and all of the prediction modes,

including Skip, 2Nx2N, 2NxN, Nx2N, NxN, 2NxnU, 2NxnD,

nRx2N and nLx2N, intra2Nx2N, intraNxN, PCM must be

checked to find the best mode. It should be noted that NxN mode

is only checked when the current CU is the smallest CU.

Transform Unit (TU) is the basic unit of transform and

quantization. TU, similar to CU, can be split in a quad-tree

structure and each split TU can be further split recursively,

whose maximum size is 32x32 and minimum size is 4x4.

The rest of the papers are organized as follows. Section II

presents the related works about 3D-HEVC. The process of

modeling the parameters selection is showed in section III.

Section IV presents the proposed algorithm, and the experiment

results are showed in the section V. Finally, section VI presents

the conclusion.

II. RELATED WORK

In 3D-HEVC, CU, TU partition process and PU selection

process cost most of the encoding time, and these processes have

a significant effect on the compression rate and the video quality,

so how to predict the best CU, PU and TU in advance is very

important for reducing the encoding time efficiently.

For CU partition process, 3D-HEVC has to recursively split

the current CU into four sub-CU and calculate the RD cost to

find the most optimal CU depth. In order to decrease the

CU PU

Fig. 1 Example of CU, PU, and TU split structure

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38651786

粉丝: 7
资源: 915

3D-HEVC编码优化：贝叶斯决策规则的快速在线学习参数决策算法

优化3D-HEVC编码的快速模式决策算法

3D-HEVC快速模式判决：基于深度图的优化算法

基于学习模型的3D-HEVC提前Merge模式优化算法：编码效率与时间降低41.9%

基于边缘建模的3D-HEVC纹理深度联合快速编码算法

用于3D-HEVC的快速CU大小决策算法

基于概率模型的3D-HEVC中从属视图的早期合并模式决策

MV-HEVC and 3D-HEVC Reference Software 16.2

Test Model 11 of 3D-HEVC and MV-HEVC.docx

Test Model 11 of 3D-HEVC and MV-HEVC： JVT3V-K1003

基于概率模型的3D-HEVC中依赖视图早期合并模式决策优化

最新资源