HEVC标准详解：高效视频编码技术综述

需积分: 10 126 浏览量更新于2024-07-09 收藏 4MB PDF 举报

本文档《High Efficiency Video Coding (HEVC) Standard》深入探讨了高效视频编码(HEVC)标准，这是由国际电信联盟（ITU-T）视频编码专家组和国际标准化组织/国际电工委员会（ISO/IEC）运动图像专家组共同发起的最新视频编码标准。HEVC的主要目标是相对于现有标准如H.264和MPEG-4实现大约50%的比特率降低，同时保持或提升相同的视频感知质量。在2012年12月的IEEETRANSACTIONSONCIRCUITSANDSYSTEMSFORVIDEOTECHNOLOGY期刊上，作者Gary J. Sullivan、Jens-Rainer Ohm、Woo-Jin Han和Thomas Wiegand四位专家详细阐述了HEVC的技术特性。文章指出，HEVC是在AVC（高级视频编码）的基础上发展而来，它采用了更为先进的编码技术，旨在提高编码效率并减少带宽需求。 HEVC的设计重点在于联合协作团队（JCT-VC）和视频编码专家组（VCEG）之间的紧密合作，这两个组织负责推动视频编码领域的技术创新。论文概述了HEVC的编码框架，包括帧结构、变换和量化方法的改进，如采用了更灵活的帧结构，提高了并行处理能力，以及采用了更深的变换层次，减少了冗余数据。此外，HEVC引入了更精细的纹理分析和预测方法，例如采用了3D块模式、环内预测和更多的预测方向，这些都显著提升了编码效率。为了应对高动态范围和高分辨率视频的需求，HEVC支持4K和8K分辨率，并且对低带宽条件下的编码优化有所考虑，如使用短码字和自适应编码工具。 HEVC还加强了编码器和解码器的交互性，引入了高效的熵编码算法，如Context-adaptive binary arithmetic coding (CABAC)，以及更多的编码工具选择，以适应不同应用场景的需求。文章总结指出，HEVC标准的制定对于推动视频行业的进步具有重要意义，它代表了视频压缩技术的一个重大飞跃，不仅降低了带宽成本，也提高了视频内容的可用性和用户体验。《High Efficiency Video Coding (HEVC) Standard》是一篇介绍当前视频编码领域前沿技术的文章，对于理解HEVC如何超越先前标准，优化视频质量和压缩效率提供了宝贵的参考。对于从事视频编码、传输和处理的工程师、研究人员以及标准制定者来说，这篇论文是深入学习和应用HEVC技术的重要资源。

1652 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 22, NO. 12, DECEMBER 2012

H.264/MPEG-4 AVC). Similar to H.264/MPEG-4 AVC,

multiple reference pictures are used. For each PB, either

one or two motion vectors can be transmitted, resulting

either in unipredictive or bipredictive coding, respec-

tively. As in H.264/MPEG-4 AVC, a scaling and offset

operation may be applied to the prediction signal(s) in

a manner known as weighted prediction.

7) Intrapicture prediction: The decoded boundary samples

of adjacent blocks are used as reference data for spa-

tial prediction in regions where interpicture prediction

is not performed. Intrapicture prediction supports 33

directional modes (compared to eight such modes in

H.264/MPEG-4 AVC), plus planar (surface ﬁtting) and

DC (ﬂat) prediction modes. The selected intrapicture

prediction modes are encoded by deriving most probable

modes (e.g., prediction directions) based on those of

previously decoded neighboring PBs.

8) Quantization control: As in H.264/MPEG-4 AVC, uni-

form reconstruction quantization (URQ) is used in

HEVC, with quantization scaling matrices supported for

the various transform block sizes.

9) Entropy coding: Context adaptive binary arithmetic cod-

ing (CABAC) is used for entropy coding. This is sim-

ilar to the CABAC scheme in H.264/MPEG-4 AVC,

but has undergone several improvements to improve

its throughput speed (especially for parallel-processing

architectures) and its compression performance, and to

reduce its context memory requirements.

10) In-loop deblocking ﬁltering: A deblocking ﬁlter similar

to the one used in H.264/MPEG-4 AVC is operated

within the interpicture prediction loop. However, the

design is simpliﬁed in regard to its decision-making and

ﬁltering processes, and is made more friendly to parallel

processing.

11) Sample adaptive offset (SAO): A nonlinear amplitude

mapping is introduced within the interpicture prediction

loop after the deblocking ﬁlter. Its goal is to better

reconstruct the original signal amplitudes by using a

look-up table that is described by a few additional

parameters that can be determined by histogram analysis

at the encoder side.

B. High-Level Syntax Architecture

A number of design aspects new to the HEVC standard

improve ﬂexibility for operation over a variety of applications

and network environments and improve robustness to data

losses. However, the high-level syntax architecture used in

the H.264/MPEG-4 AVC standard has generally been retained,

including the following features.

1) Parameter set structure: Parameter sets contain informa-

tion that can be shared for the decoding of several re-

gions of the decoded video. The parameter set structure

provides a robust mechanism for conveying data that are

essential to the decoding process. The concepts of se-

quence and picture parameter sets from H.264/MPEG-4

AVC are augmented by a new video parameter set (VPS)

structure.

2) NAL unit syntax structure: Each syntax structure is

placed into a logical data packet called a network

abstraction layer (NAL) unit. Using the content of a two-

byte NAL unit header, it is possible to readily identify

the purpose of the associated payload data.

3) Slices: A slice is a data structure that can be decoded

independently from other slices of the same picture, in

terms of entropy coding, signal prediction, and residual

signal reconstruction. A slice can either be an entire

picture or a region of a picture. One of the main

purposes of slices is resynchronization in the event of

data losses. In the case of packetized transmission, the

maximum number of payload bits within a slice is

typically restricted, and the number of CTUs in the slice

is often varied to minimize the packetization overhead

while keeping the size of each packet within this bound.

4) Supplemental enhancement information (SEI) and video

usability information (VUI) metadata: The syntax in-

cludes support for various types of metadata known as

SEI and VUI. Such data provide information about the

timing of the video pictures, the proper interpretation of

the color space used in the video signal, 3-D stereoscopic

frame packing information, other display hint informa-

tion, and so on.

C. Parallel Decoding Syntax and Modiﬁed Slice Structuring

Finally, four new features are introduced in the HEVC stan-

dard to enhance the parallel processing capability or modify

the structuring of slice data for packetization purposes. Each

of them may have beneﬁts in particular application contexts,

and it is generally up to the implementer of an encoder or

decoder to determine whether and how to take advantage of

these features.

1) Tiles: The option to partition a picture into rectangular

regions called tiles has been speciﬁed. The main pur-

pose of tiles is to increase the capability for parallel

processing rather than provide error resilience. Tiles are

independently decodable regions of a picture that are

encoded with some shared header information. Tiles can

additionally be used for the purpose of spatial random

access to local regions of video pictures. A typical

tile conﬁguration of a picture consists of segmenting

the picture into rectangular regions with approximately

equal numbers of CTUs in each tile. Tiles provide

parallelism at a more coarse level of granularity (pic-

ture/subpicture), and no sophisticated synchronization of

threads is necessary for their use.

2) Wavefront parallel processing: When wavefront parallel

processing (WPP) is enabled, a slice is divided into

rows of CTUs. The ﬁrst row is processed in an ordinary

way, the second row can begin to be processed after

only two CTUs have been processed in the ﬁrst row,

the third row can begin to be processed after only

two CTUs have been processed in the second row,

and so on. The context models of the entropy coder

in each row are inferred from those in the preceding

row with a two-CTU processing lag. WPP provides a

form of processing parallelism at a rather ﬁne level of

剩余19页未读，继续阅读

ccvcc97

粉丝: 0
资源: 6

HEVC标准详解：高效视频编码技术综述

Overview of the High Efficiency Video Coding(HEVC) Standard.pdf

Overview of the High Efficiency Video Coding (HEVC) Standard

Overview of the Range Extensions for the HEVC Standard

Overview of HEVC codec standard.rar

Overview of (HEVC)翻译版

HEVC_overview

HEVC入门论文(多篇)

HEVC标准详解：高效视频编码技术概览

SL-ST 差速器3D模型 SL-ST 差速器

C#大型药品进销存管理系统源码数据库 Access源码类型 WinForm

最新资源