HEVC视频编码：基于SKIP/Merge RD成本的B帧CU快速终止策略

54 浏览量更新于2024-08-28 收藏 488KB PDF 举报

"这篇研究论文探讨了在HEVC（高效视频编码）标准中，如何针对B帧进行基于SKIP和合并模式率失真成本的早期CU（编码单元）终止，以降低编码复杂性。作者们来自上海交通大学图像通信与信息处理研究所，通过分析不同编码模式、深度分布与SKIP/Merge RD成本之间的关联，提出了一种快速决策方法，以避免不必要的模式决策和率失真优化。" 正文：在视频编码领域，HEVC（高效视频编码）相对于H.264/AVC有了显著的进步，能够在相同的视频质量下，将比特率降低一半。然而，这种提升伴随着编码过程计算复杂性的增加，主要体现在更适应的四叉树结构、率失真优化(RDO)等技术上。针对这一问题，论文提出了一个快速决策方法，特别是在处理B帧时，以减少HEVC编码器的复杂性。研究发现，预测单元（PU）的模式选择、编码单元的深度分布与SKIP/Merge模式的率失真成本之间存在强烈的关联性。论文对当前CU深度的SKIP/Merge RD成本进行了统计分析，并根据64x64和32x32 CU的SKIP/Merge RD成本来研究深度分布。这些分析结果为算法设计提供了基础。提出的算法包含早期检测机制，能够根据分析出的成本和模式分布规律，在编码过程中尽早判断是否需要继续进行更复杂的模式决策和RDO步骤。通过这种方式，算法能够在保证编码效率的同时，减少不必要的计算，从而显著降低编码时间，优化编码效率。此外，对于B帧编码，由于其涉及到前向和后向预测，模式决策的复杂性更高，因此早期CU终止策略对于B帧编码的优化尤为重要。该算法有望在不牺牲太多视频质量的前提下，为HEVC编码带来实质性的速度提升，对于实时视频编码和处理应用具有重要的实践价值。这篇论文对HEVC编码中的B帧处理提出了一种创新的优化策略，通过对SKIP和Merge模式率失真成本的深入分析，实现了早期CU终止，有效降低了编码复杂性，为未来视频编码技术的进一步优化提供了新的思路。

Early CU Termination Based on SKIP/Merge RD Cost for B Frames in HEVC

Jing Shen, Xiaoyun Zhang, Zhiyong Gao, Jia Wang

Institute of Image Communication & Information Processing

Shanghai Jiao Tong University

Shanghai, China

jingshen1990@gmail.com, {xiaoyun.zhang, zhiyong.gao, jiawang}@sjtu.edu.cn

Abstract—Compared to H.264/AVC, High Efficiency Video

Coding (HEVC) aims to deliver the same video quality at half

the bit rate. However, it imposes enormous computational

complexity because of taking advantage of a more adaptive

quad-tree structure, rate distortion optimization (RDO) and

etc. In this paper, a fast decision method is proposed to reduce

HEVC encoder complexity of B frames which avoids

unnecessary mode decision and rate distortion optimization

(RDO). It is found that there exist strong correlations between

PU modes, depth distribution and SKIP/Merge RD cost. We

statistically analyze the prediction mode distribution according

to SKIP/Merge RD cost of current CU depth and depth

distribution according to SKIP/Merge RD cost of 64x64 and

32x32 CUs. Based on the analysis results, the proposed

algorithm includes an early detection of SKIP mode and a

depth pruning to skip searching for 8x8 and 16x16 CUs. This

algorithm enables us to skip unnecessary RDO by early

termination of mode decisions in CUs. Experimental results

show that the encoding complexity can be reduced by 50% on

average with only 1.25% BD-rate increase compared to the

HEVC test model (HM) 12.0 reference software.

Keywords-HEVC, Mode Decision, SKIP/Merge RD cost,

RDO

INTRODUCTION

HEVC is the latest video coding standard developed by

ITU-T Video Coding Experts Group and the ISO/IEC

Moving Picture Experts Group. The main goal of the HEVC

standardization effort is to enable significantly improved

compression performance relative to existing standard - to

reduce 50% bit rate on average for equal perceptual video

quality [1]. To achieve this goal, several efficient coding

algorithms were introduced to HEVC which also made

HEVC encoders several times more complex than

H.264/AVC encoders. In [2], the authors described several

complexity-related aspects that were considered in the

standardization process, including quad-tree-based block

partitioning, motion estimation, RDO and so on.

HEVC adopts a more adaptive quad-tree structure based

on a coding tree unit (CTU) instead of a macroblock in

H.264/AVC. This allows HEVC to support coding unit (CU)

size from 64x64 to 8x8. Kim et al [3] show that the benefits

from the use of larger CU size become really significant

when the test sequences are high-resolution video sequences.

CU is a basic unit of region splitting used for intra/inter

prediction, which allows recursive subdividing into four

equally sized blocks. Additionally, in inter frames, each CU

enables different prediction unit (PU) modes: SKIP mode,

Merge mode, Inter 2Nx2N, Inter Nx2N, Inter 2NxN, Inter

2NxnU, Inter 2NxnD, Inter nLx2N, Inter nRx2N, Inter NxN

(only available for the smallest CU), Intra 2Nx2N and Intra

NxN (only available for the smallest CU). To obtain the best

CU block partitioning and coding modes, the HEVC

encoder needs to test all the possible modes and select the

one which provides the smallest RD cost by means of rate

distortion optimization (RDO) process. This greatly

increases computational complexity of the encoder. Tan et

al [4] illustrates that using a fixed CU size of 16x16

involves 1584 times of RDO process, while using a CTU

structure of 64x64 and a maximum quad-tree depth of 4

involves 8415 times of RDO process.

RD cost of coding modes is computed through the RDO

process by a Lagrange cost function (𝐽

𝑚𝑜𝑑𝑒

) as follows:

𝐽

𝑚𝑜𝑑𝑒

(

𝑆𝑆𝐸

𝑙𝑢𝑚𝑎

+ 𝜔

𝑐ℎ𝑟𝑜𝑚𝑎

∙ 𝑆𝑆𝐸

𝑐ℎ𝑟𝑜𝑚𝑎

)

+ 𝜆 ∙ 𝐵

𝑚𝑜𝑑𝑒

(1)

where 𝐵

𝑚𝑜𝑑𝑒

is the bitrate cost of corresponding coding

mode, SSE is the sum of square error between original

pixels and reconstructed pixels, 𝜔

𝑐ℎ𝑟𝑜𝑚𝑎

is a weighting

factor for chroma component and 𝜆 is the Lagrange

multiplier. To obtain reconstructed pixels and bitrate,

Motion Estimation (ME), Motion Compensation (MC),

Transform, Quantization, Entropy Coding, Inverse

Quantization, Inverse Transform are conducted at the

HEVC encoder, which make RD cost computation really

complex. However, RD cost computation of SKIP and

Merge mode is simple because there is no residue

information for SKIP mode and Merge mode doesn’t need

ME process. In addition, SKIP mode has a very high

occurrence probability in HEVC. So it is worthwhile to take

advantage of SKIP/Merge RD cost to skip other complex

RDO process. Our proposed algorithm is mainly based on

this idea.

The rest of the paper is organized as follows. Section 2

briefly reviews some relevant fast algorithms. Section 3

introduces the proposed fast algorithm in details.

Performance evaluations and analysis are presented in

Section 4. At last, the conclusion is drawn in Section 5.

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38589774

粉丝: 4

HEVC视频编码：基于SKIP/Merge RD成本的B帧CU快速终止策略

视频信号快速帧间编码论文翻译-An Effective CU Size Decision Method for HEVC Encoders

基于贝叶斯模型的HEVC早期SKIP模式决策

基于Neyman-Pearson的HEVC编码的早期模式决策

基于贝叶斯的HEVC早期SKIP模式决策算法：加速与优化编码性能

HEVC编码优化：基于单峰停止模型的早期SKIP模式决策

HEVC编码优化：一种快速CU尺寸决策算法

HEVC帧间快速算法：运动信息与率失真优化

vue.js v2.5.17

DM8-SQL语言详解及其数据管理和查询操作指南

1108_ba_open_report.pdf

最新资源