HEVC标准中的Tile特性详解

需积分: 9 70 浏览量更新于2024-09-07 收藏 1.32MB PDF 举报

"本文是IEEE期刊中的一篇文章，深入探讨了HEVC（高效率视频编码）标准中的Tile特性，以及相关的实验数据和对比分析。文章由Kiran Misra等人撰写，详细介绍了Tiles如何增强并行处理的效率，改进最大传输单元（MTU）匹配，减少线性缓冲区内存，并实现兴趣区域编码等功能。" 在HEVC（H.265）标准中，Tile是一个重要的新特性，它将图片分割成独立的矩形区域，这一划分带来了诸多优势。首先，Tiles提高了新标准的“并行友好性”，相比于以前基于切片的方法，能更有效地利用并行架构提高编码效率。这使得在多核处理器上进行视频编码时，各个Tile可以独立处理，显著提升了处理速度。其次，Tiles有助于优化最大传输单元（MTU）的匹配。在视频流传输中，MTU大小的适应性对网络传输效率至关重要，而Tiles结构使得MTU大小可以根据Tile的边界灵活调整，减少了因不匹配导致的分包和重组开销。再者，Tiles特性还减少了线性缓冲区的内存需求。每个Tile编码时只需存储其自身的缓冲数据，降低了系统内存的总体需求，尤其对于内存有限的设备，这是一个显著的优点。此外，文章中还提出了一种基于Tile的兴趣区域编码方法。这种方法允许对视频中的特定区域（如人物、重要事件等）进行优先编码，以提高这部分区域的质量，满足特定应用场景的需求，例如监控、体育赛事直播等。文章通过实验数据展示了不同并行化因子和MTU大小要求下的编码效率，并分析了Tile-based ROI编码方法的效果。这些实验结果为理解Tiles在实际应用中的性能提供了有价值的参考。关键词：视频编码，多核处理，高效率视频编码，Tiles。这篇IEEE文章详细阐述了HEVC标准中Tiles的引入、工作原理及其带来的效益，包括并行处理能力的提升、MTU匹配的优化、内存使用效率的增加，以及兴趣区域编码的实现，对于理解和利用HEVC标准进行高效视频编码具有重要意义。

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 7, NO. 6, DECEMBER 2013 969

An Overview of Tiles in HEVC

Kiran Misra,Member,IEEE, Andrew Segall, Member, IEEE, Michael Horowitz, Shilin Xu, Arild Fuldseth, and

Minhua Zhou

Abstract—Tiles is a new feature in the High Efﬁciency Vi

deo

Coding (HEVC) standard that divides a picture into independent,

rectangular regions. This division provides a number of advan-

tages. Speciﬁcally, it increases the “parallel fri

endliness” of the

new standard by enabling improved c oding efﬁciency for parallel

architectures, as compared to previous sliced based m ethods.

Additionally, tiles facilitate improved m

aximum transmission unit

(MTU) size matching, reduced line buffer memory, an d add itional

region-of-interest functionality. In this paper, w e introduce the

tiles feature and survey the performance

of the tool. Coding efﬁ-

ciency is reported for different parallelization factors and MTU

size requirements. Additionally, a tile-based region of interest

coding method is developed.

Index Terms—Video coding, multico

re processing, high efﬁ-

ciency video coding, tiles.

I. INTRODUCTION

HE ISO/IE C’s Movin g Pictures Experts Group (MPE G)

and the International Telecommunications Union’s

(ITU-T) Video Coding Experts Group (VCEG) have recently

concluded work on the ﬁrst edition of the High Efﬁciency

Video Co din g (HEVC) standard [3]–[5]. This standard was

developed collaboratively by the Joint Collaborative Team on

Video Coding (JCT-V C). For consumer applications, HEVC

has been reported to achieve 50% improvement in coding

efﬁciency when compared to previous coding standards such

as MPEG-4 AVC/IT U- T H.264 [1], [5]. These coding gains

are achieved through a number of improvements that result in

an increase in compu tational complexity for both encoder and

decoder.

Here, computatio nal complexity refers to a com bination of

algorithmic operations and memory transfers. Algorithmic op -

erations correspond to the calculations required in a decoder to

convert bit-stream in formation to reconstructed pixel values or

in an encoder to convert the original p ixel values to a bit-stream.

For hardware, this corresponds to logic gates; for software, this

corresponds to calculations on a CPU, GPU, or other processing

units. Memory transfers represent the amoun t of data that must

Manuscript received February 01, 2013; revised May 10, 2013; acce pte d June

12, 2013. Date of publication June 27, 2013; date of current version November

18, 2013. The guest editor c oordinating the review of this manuscript and ap-

proving it for publication was Prof. Oscar C. Au.

K. Misra and A. Segall are with Sharp L ab oratories of A me rica, Inc., C amas,

WA 98607 USA (e-mail: misrak@sharplabs.com; asegall@sharplabs.com).

M.HorowitzandS.XuarewitheBriskVideo, Inc., Vancouver, BC V6E 2E9,

Canada (e-mail: michael@ebriskvideo.com; shilin@ebriskvideo .com).

A. Fuldseth is with Cisco Systems, Oslo 1367, Norway (e-mail:

arild.fuldseth@cisco.com).

M. Zhou is with Tex as Instruments, Inc., Dallas, TX 75243 USA (e-mail:

zhou@ti.com).

Color versions of one or more of the ﬁgures in this paper are available online

at http://ieeexplore.i eee.org.

Dig

ital Object Identiﬁer 10.1109/JSTSP.2013.2271451

be stored and accessed to perform the required calculat

ions.

Typical architectures contain multiple memory types

,ranging

from high speed memory that is on-chip (including cac

hes n ear

a CPU core) to lower speed memory that is off-chip or f

ar-

ther from the core. In general, on-chip memory is mo

re expen-

sive and therefore relatively small. Addit

ionally, for many ar-

chitectures, the critical bottleneck is t

he bandwidth necessary to

transfer data from off-chip to on-chip m em

oryintimetocom-

plete the required calculations.

The increase in computational com p lexity

in HEVC com-

pared with earlier standards directly im

pacts the im plem enta-

tion an d design. F or systems with a s

ingle-core processor, the

increased complexity requires hig

her clock speeds. This has the

additional cost of increased pow e

r consumption and heat dis-

sipation. Fo r many applications

of interest today, the increased

clock rate is not desirable.

An alternative solution for addr

essing the increased computa-

tional com plexity is par alleli

sm. Parallelism in a vid eo system is

not a new concept. For examp l

e, today’s software based video

conferencing systems that

operate at resolutions up to 1080 p

(1920

1080 pixels) and fra

me rates of 60 frames per second

(fps) rely on high-level p

arallelism (i.e., encoders and decoders

that can process differe

nt portions of a video picture in a rela-

tively independent fas

hion) despite using the less computation-

ally complex H.264/A

VC and its scalable extension SVC. With

previous standards

, high-level parallelism within a picture may

be realized by parti

tioning the source frames using slices and

assigning each sli

ce to one of several processing cores. Slices

were o riginally d

esigned to map a bit-stream into smaller in-

dependently deco

dable ch unk s for transmission. The size of a

coded slice was t

ypically determined by the network character-

istics; for ex

ample, the size is often selected to be less than the

maximum trans

mission u nit (MTU) s ize of the network being

considered.

In practice, u

sing slices for parallelization results in a

number of dis

advantages. For example, the pixel segm entation

achievedbys

lices using only network constraints often result

in partiti

oning where the correlation existing in the pixel data is

reduced. T

his lowers the achieva ble coding efﬁciency. More-

over, sli

ces contain header information to facilitate independent

processi

ng of pixel data. With th e high er coding efﬁciency of

HEVC, th

is becomes problematic—it is p ossible to transmit

high re

solution video at low bit rates such that the overhead

introd

uced by a slice header is not negligible. Finally, for

appli

cations that require both parallelization and packetization,

it is

difﬁcult to use slices to achieve an optimal partitioning for

both

goals.

Tile

s provide an alternative partitioning that divides a pictu re

into

rectangular sections that are processed in a relatively inde-

pen

dent fashion. Fig. 1 illustrates an exam ple where a picture

下载后可阅读完整内容，剩余8页未读，立即下载

yawooh

粉丝: 1
资源: 1

HEVC标准中的Tile特性详解

An Overview of Multi-Task Learning in Deep Neural Networks.pdf

An Overview of YOLOv8's Application in Object Detection

Getting Started with DBeaver: An Overview of the Open Source Database Tool

an overview of ieee 802.11n.pdf

写一下关于《自主学习中基于多模态数据的学习风格高精度识别》期刊论文的英文框架

palladium z1 overview introduction.pdf

application.yml

Write a 5,000-word thesis on "Application of Sandbox Technology in Cybersecurity Engineering."

用英文表达：接下来有请Dr.Tobias来给大家介绍一下NEW Organization overview

最新资源