IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 22, NO. 12, DECEMBER 2012 1697
Block Partitioning Structure in the HEVC Standard
Il-Koo Kim, Junghye Min, Tammy Lee, Woo-Jin Han, and JeongHoon Park
Abstract—High Efficiency Video Coding (HEVC) is the latest
joint standardization effort of ITU-T WP 3/16 and ISO/IEC
JTC 1/SC 29/WG 11. The resultant standard will be published
as twin text by ITU-T and ISO/IEC; in the latter case, it will
also be known as MPEG-H Part 2. This paper describes the
block partitioning structure of the draft HEVC standard and
presents the results of an analysis of coding efficiency and
complexity. Of the many new technical aspects of HEVC, the
block partitioning structure has been identified as representing
one of the most significant changes relative to previous video
coding standards. In contrast to the fixed size 16× 16 macroblock
structure of H.264/AVC, HEVC defines three different units
according to their functionalities. The coding unit defines a
region sharing the same prediction mode, e.g., intra and inter,
and it is represented by the leaf node of a quadtree structure.
The prediction unit defines a region sharing the same prediction
information. The transform unit, specified by another quadtree,
defines a region sharing the same transformation. This paper
introduces technical details of the block partitioning structure
of HEVC with an emphasis on the method of designing a
consistent framework by combining the three different units
together. Experimental results are provided to justify the role
of each component of the block partitioning structure and a
comparison with the H.264/AVC design is performed.
Index Terms—Advanced video coding (AVC), H.264, High
Efficiency Video Coding (HEVC), Joint Collaborative Team on
Video Coding (JCT-VC), standards, video.
I. Introduction
D
UE TO THE ever-increasing demand for bit rate to
support higher resolution video, there is a requirement to
develop video compression technologies which would provide
significantly higher coding efficiency than the current genera-
tion of video coding standards. The Joint Collaborative Team
on Video Coding (JCT-VC), a joint activity of ITU-T WP 3/16
and ISO/IEC JTC 1/SC 29/WG 11, was set up in April 2010 to
address these requirements for a next generation video coding
standard. The resultant standard, called High Efficiency Video
Coding (HEVC), is expected to be completed in January 2013.
Over the past decades, video coding standards such as
MPEG-1 Video [1], MPEG-2 Video [2], MPEG-4 Visual
[3], and H.264/advanced video coding (AVC) [4] played
Manuscript received April 16, 2012; revised July 18, 2012; accepted August
20, 2012. Date of publication October 5, 2012; date of current version January
8, 2013. This work was supported by the Gachon University Research Fund
of 2012 under Grant GCU-2011-R257. This paper was recommended by
Associate Editor J. Ridge. (Corresponding author: W.-J. Han.)
I.-K. Kim, J. Min, T. Lee, and J. H. Park are with Samsung
Electronics, Suwon 442-742, Korea (e-mail: ilkoo.kim@samsung.com;
jh643.min@samsung.com; tammy.lee@samsung.com; jeonghoon@samsung.
com).
W.-J. Han is with Gachon University, Seongnam 461-701, Korea (e-mail:
hurumi@gmail.com).
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TCSVT.2012.2223011
an important role in enabling multimedia applications. The
basic ingredients of these standards are block-based motion
compensation and spatial transforms. The current state-of-the-
art, H.264/AVC provides approximately double the coding
efficiency of the earlier MPEG-2 standard, using more flexi-
ble macroblock and submacroblock partitioning and variable
transforms sizes of 4 × 4 and 8 × 8. However, due to some
restrictions in the design, e.g., the fixed size of macroblock,
limited depth of block partitioning and limited adaptivity be-
tween inter and intra prediction schemes, the coding efficiency
is still not sufficient to cope with the ever increasing demands
for storage and transmission of video content.
To overcome these problems, several papers tried to inves-
tigate the effect of relaxing the restrictions and using more
flexible block partitioning. One direction was to add larger
size blocks on top of the existing block structure [6]–[10],
including 16 × 16, 16 × 8, and 8 × 16 transforms. Another
direction utilized the more general quadtree structure for block
partitioning [11]–[17] in addition to enlarging the size of
blocks. In these approaches, up to 128 × 128 block size was
allowed and more flexible motion and transform block (TB)
partitioning structures were utilized.
The emerging HEVC standard represents one of the more
advanced versions of the second approach. In the main profile
of HEVC, a slice is partitioned into multiple coding tree units
(CTU) which are allowed to have sizes from 8×8upto64×64.
For comparison, prior video coding standards typically support
a maximum block size of 16 × 16. Inside the CTU, a quadtree
structure is built to allow more flexibility for partitioning of the
CTU while maintaining consistent design, even when the CTU
size is larger than 16 × 16. Each leaf node of the coding tree
is called a coding unit (CU); this specifies how the prediction
should be done between spatial and temporal schemes. The
CU can have multiple prediction units (PU) and transform
units (TU); these define regions sharing the same prediction-
related information and the same transformation, respectively.
The shape of the PU is specified by the splitting type, as
in H.264/AVC whereas that of TU is represented by another
quadtree, called the transform tree.
This paper explains the issues with H.264/AVC motivat-
ing HEVC development in Section II. Technical details of
the block partitioning structure of HEVC are presented in
Section III. Section IV provides the experimental results and
Section V concludes this paper.
II. H.264/AVC Block Partitioning Structure
The block partitioning structure of the H.264/AVC is
designed to provide more flexibility compared with the prior
standards such as MPEG-2 or MPEG-4 Visual. However,
1051-8215/$31.00
c
2012 IEEE