HEVC标准中的块划分结构分析

需积分: 9 12 浏览量更新于2024-09-12 收藏 2.81MB PDF 举报

"这篇论文详细探讨了高效视频编码（HEVC）标准中的块划分结构，由IEEE顶级期刊发表，作者团队在业界具有很高影响力。HEVC是ITU-T WP3/16和ISO/IEC JTC1/SC29/WG11联合制定的最新视频编码标准，其双文本形式将由ITU-T和ISO/IEC共同发布，后者也将其称为MPEG-H Part 2。" HEVC（High Efficiency Video Coding，高效视频编码）是继H.264/AVC之后的重要视频压缩标准，它引入了许多创新技术以显著提高编码效率。在HEVC中，块划分结构的改变是相对于前一代标准最显著的改进之一。相较于H.264/AVC中固定大小的16x16宏块结构，HEVC定义了三种不同的功能单元，以适应更灵活的编码策略。 1. **编码单元（Coding Unit, CU）**：这是编码过程的基本构建模块，定义了一个共享相同预测模式（如内插和间插）的区域。CU可以基于四叉树结构的叶节点进行划分，允许不同大小的编码单位以适应内容的局部复杂性变化。 2. **预测单元（Prediction Unit, PU）**：PU决定了编码区域内预测样本的覆盖范围，它可以是CU的子集，根据预测模式的不同，可以有不同的形状和大小。这种灵活性使得编码器可以选择最佳的预测区域，从而提高压缩效率。 3. **变换单元（Transform Unit, TU）**：在预测后的残差信号上执行离散余弦变换（DCT）或离散余弦变换II（DCT-II），以减少信号的统计相关性，便于熵编码。TU也可以是CU或PU的子集，根据需要选择合适的大小进行变换。论文对HEVC的块划分结构进行了编码效率和复杂度的分析。这些分析对于理解标准的性能至关重要，因为它们直接影响到码流的大小、解码时的计算复杂度以及最终的视频质量。通过这种方式，HEVC能够实现更高的压缩比，同时保持或改善视频质量，这对于带宽有限的传输环境尤其重要。此外，HEVC还引入了其他技术，如多角度预测、深度视频编码、多参考帧和改进的运动估计等，以进一步提升编码效率。这些技术和块划分结构一起构成了HEVC的核心，使得它在视频编码领域具有极高的竞争力和实用性。这篇论文详细阐述了HEVC标准中关键的块划分结构，展示了其在提高编码效率和降低复杂度方面的优势，为视频编码研究和应用提供了宝贵的理论与实践指导。

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 22, NO. 12, DECEMBER 2012 1697

Block Partitioning Structure in the HEVC Standard

Il-Koo Kim, Junghye Min, Tammy Lee, Woo-Jin Han, and JeongHoon Park

Abstract—High Efﬁciency Video Coding (HEVC) is the latest

joint standardization effort of ITU-T WP 3/16 and ISO/IEC

JTC 1/SC 29/WG 11. The resultant standard will be published

as twin text by ITU-T and ISO/IEC; in the latter case, it will

also be known as MPEG-H Part 2. This paper describes the

block partitioning structure of the draft HEVC standard and

presents the results of an analysis of coding efﬁciency and

complexity. Of the many new technical aspects of HEVC, the

block partitioning structure has been identiﬁed as representing

one of the most signiﬁcant changes relative to previous video

coding standards. In contrast to the ﬁxed size 16× 16 macroblock

structure of H.264/AVC, HEVC deﬁnes three different units

according to their functionalities. The coding unit deﬁnes a

region sharing the same prediction mode, e.g., intra and inter,

and it is represented by the leaf node of a quadtree structure.

The prediction unit deﬁnes a region sharing the same prediction

information. The transform unit, speciﬁed by another quadtree,

deﬁnes a region sharing the same transformation. This paper

introduces technical details of the block partitioning structure

of HEVC with an emphasis on the method of designing a

consistent framework by combining the three different units

together. Experimental results are provided to justify the role

of each component of the block partitioning structure and a

comparison with the H.264/AVC design is performed.

Index Terms—Advanced video coding (AVC), H.264, High

Efﬁciency Video Coding (HEVC), Joint Collaborative Team on

Video Coding (JCT-VC), standards, video.

I. Introduction

UE TO THE ever-increasing demand for bit rate to

support higher resolution video, there is a requirement to

develop video compression technologies which would provide

signiﬁcantly higher coding efﬁciency than the current genera-

tion of video coding standards. The Joint Collaborative Team

on Video Coding (JCT-VC), a joint activity of ITU-T WP 3/16

and ISO/IEC JTC 1/SC 29/WG 11, was set up in April 2010 to

address these requirements for a next generation video coding

standard. The resultant standard, called High Efﬁciency Video

Coding (HEVC), is expected to be completed in January 2013.

Over the past decades, video coding standards such as

MPEG-1 Video [1], MPEG-2 Video [2], MPEG-4 Visual

[3], and H.264/advanced video coding (AVC) [4] played

Manuscript received April 16, 2012; revised July 18, 2012; accepted August

20, 2012. Date of publication October 5, 2012; date of current version January

8, 2013. This work was supported by the Gachon University Research Fund

of 2012 under Grant GCU-2011-R257. This paper was recommended by

Associate Editor J. Ridge. (Corresponding author: W.-J. Han.)

I.-K. Kim, J. Min, T. Lee, and J. H. Park are with Samsung

Electronics, Suwon 442-742, Korea (e-mail: ilkoo.kim@samsung.com;

jh643.min@samsung.com; tammy.lee@samsung.com; jeonghoon@samsung.

com).

W.-J. Han is with Gachon University, Seongnam 461-701, Korea (e-mail:

hurumi@gmail.com).

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/TCSVT.2012.2223011

an important role in enabling multimedia applications. The

basic ingredients of these standards are block-based motion

compensation and spatial transforms. The current state-of-the-

art, H.264/AVC provides approximately double the coding

efﬁciency of the earlier MPEG-2 standard, using more ﬂexi-

ble macroblock and submacroblock partitioning and variable

transforms sizes of 4 × 4 and 8 × 8. However, due to some

restrictions in the design, e.g., the ﬁxed size of macroblock,

limited depth of block partitioning and limited adaptivity be-

tween inter and intra prediction schemes, the coding efﬁciency

is still not sufﬁcient to cope with the ever increasing demands

for storage and transmission of video content.

To overcome these problems, several papers tried to inves-

tigate the effect of relaxing the restrictions and using more

ﬂexible block partitioning. One direction was to add larger

size blocks on top of the existing block structure [6]–[10],

including 16 × 16, 16 × 8, and 8 × 16 transforms. Another

direction utilized the more general quadtree structure for block

partitioning [11]–[17] in addition to enlarging the size of

blocks. In these approaches, up to 128 × 128 block size was

allowed and more ﬂexible motion and transform block (TB)

partitioning structures were utilized.

The emerging HEVC standard represents one of the more

advanced versions of the second approach. In the main proﬁle

of HEVC, a slice is partitioned into multiple coding tree units

(CTU) which are allowed to have sizes from 8×8upto64×64.

For comparison, prior video coding standards typically support

a maximum block size of 16 × 16. Inside the CTU, a quadtree

structure is built to allow more ﬂexibility for partitioning of the

CTU while maintaining consistent design, even when the CTU

size is larger than 16 × 16. Each leaf node of the coding tree

is called a coding unit (CU); this speciﬁes how the prediction

should be done between spatial and temporal schemes. The

CU can have multiple prediction units (PU) and transform

units (TU); these deﬁne regions sharing the same prediction-

related information and the same transformation, respectively.

The shape of the PU is speciﬁed by the splitting type, as

in H.264/AVC whereas that of TU is represented by another

quadtree, called the transform tree.

This paper explains the issues with H.264/AVC motivat-

ing HEVC development in Section II. Technical details of

the block partitioning structure of HEVC are presented in

Section III. Section IV provides the experimental results and

Section V concludes this paper.

II. H.264/AVC Block Partitioning Structure

The block partitioning structure of the H.264/AVC is

designed to provide more ﬂexibility compared with the prior

standards such as MPEG-2 or MPEG-4 Visual. However,

1051-8215/$31.00

 2012 IEEE

下载后可阅读完整内容，剩余9页未读，立即下载

JIH488

粉丝: 9
资源: 23

HEVC标准中的块划分结构分析

Block Merging for Quadtree-Based Partitioning in HEVC

hevc h265 论文集合

Partitioning Trust in Network Testbeds

Set Partitioning in Hierarchical Trees

Aegis: Partitioning Data Block for Efficient Recovery of Stuck-At-Faults in Phase Change Memory

An Introduction to Recursive Partitioning Using the RPART Routines

嵌入式编码算法,(Set partitioning in hierarchical trees, SPIHT)

Optimal Macrocell Partitioning for Redistributed Fractional Frequency Reuse in Heterogeneous Networks

spiht.rar_SPIHT_in_set partitioning

Mesh Generation and Partitioning Techniques in Hypermesh

最新资源