HEVC标准详解：压缩效率提升50%

需积分: 9 31 浏览量更新于2024-07-25 收藏 871KB PDF 举报

"IEEE-HEVC-Overview" 本文档是针对HEVC（High-Efficiency Video Coding，高效视频编码）标准的一个预出版草案，适合初学者参考。HEVC是由国际电信联盟（ITU-T）的视频编码专家组（VCEG）和国际标准化组织/国际电工委员会（ISO/IEC）移动图像专家组（MPEG）共同制定的新一代视频编码标准。该标准的主要目标是在保持同等视觉质量的情况下，相较于现有标准实现显著的压缩效率提升，即大约能减少50%的比特率。 HEVC的技术特点和特性包括： 1. **分层结构**：HEVC采用了更细粒度的块划分策略，允许在编码单元（Coding Unit, CU）、预测单元（Prediction Unit, PU）和变换单元（Transformation Unit, TU）之间进行灵活的选择，以适应不同复杂性的视频内容。 2. **多模式预测**：HEVC支持更多的预测模式，包括双向预测、多方向预测以及像素级别的自适应预测模式，提高了预测的准确性和编码效率。 3. **高性能的变换**：HEVC采用了更大的变换尺寸，如32x32和64x64的离散余弦变换（DCT），以及更复杂的量化技术，进一步提升了压缩性能。 4. **熵编码优化**：HEVC使用改进的熵编码器，如上下文自适应二进制算术编码（Context-Adaptive Binary Arithmetic Coding, CABAC）和上下文自适应变量长度编码（Context-Adaptive Variable Length Coding, CAVLC），使得码流表示更加紧凑。 5. **深度信息处理**：HEVC还考虑了深度视频编码，这对于立体视频和3D视频应用至关重要，可以单独编码和传输深度信息，以实现高质量的三维视频重建。 6. **高级运动补偿**：HEVC引入了更精确的运动估计方法，如半像素和四分之一像素精度的运动矢量预测，以及更有效的运动补偿滤波器，降低了预测误差。 7. **增强的去块效应滤波器**：HEVC采用了一种新的去块效应滤波器（Deblocking Filter, DBF）设计，可以更精细地处理编码块边界，提高解码后图像的平滑性。 8. **多参考帧**：HEVC允许使用更多数量的参考帧进行预测，增强了对时间冗余的利用，尤其是在快速移动场景中。 9. **自适应循环内去噪**：HEVC集成了自适应循环内去噪（Adaptive Loop Filtering, ALF）机制，可以根据图像内容自适应地调整去噪强度，以平衡压缩效率和图像质量。 10. **语法优化**：HEVC的语法结构进行了优化，减少了编码开销，使得编码器和解码器之间的信息交换更为高效。 HEVC通过其创新的编码技术，在视频压缩领域实现了重大突破，为高清和超高清视频的传输和存储提供了经济高效的解决方案。它在互联网流媒体、数字电视广播、视频会议、移动通信等多个领域都有广泛的应用前景。HEVC的这些特性使其成为H.264/AVC和MPEG-4 AVC等前一代标准的重要升级，为视频编码标准树立了新的标杆。

PRE-PUBLICATION DRAFT, TO APPEAR IN IEEE TRANS. ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, DEC. 2012

IEEE by sending an email to pubs-permissions@ieee.org.

modes (compared to 8 such modes in H.264/MPEG-4

AVC), plus planar (surface fitting) and DC (flat) prediction

modes. The selected intra prediction modes are encoded by

deriving most probable modes (e.g. prediction directions)

based on those of previously-decoded neighboring PBs.

 Quantization control: As in H.264/MPEG-4 AVC,

uniform reconstruction quantization (URQ) is used in

HEVC, with quantization scaling matrices supported for

the various transform block sizes.

 Entropy coding: Context adaptive binary arithmetic

coding (CABAC) is used for entropy coding. This is

similar to the CABAC scheme in H.264/MPEG-4 AVC,

but has undergone several improvements to improve its

throughput speed (especially for parallel-processing

architectures) and its compression performance, and to

reduce its context memory requirements.

 In-loop deblocking filtering (DF): A deblocking filter

(DF) similar to the one used in H.264/MPEG-4 AVC is

operated in the inter-picture prediction loop. However, the

design is simplified in regard to its decision-making and

filtering processes, and is made more friendly to parallel

processing.

 Sample adaptive offset (SAO): A non-linear amplitude

mapping is introduced in the inter-picture prediction loop

after the deblocking filter. The goal is to better reconstruct

the original signal amplitudes by using a look-up table that

is described by a few additional parameters that can be

determined by histogram analysis at the encoder side.

B. High-level syntax architecture

A number of design aspects new to the HEVC standard

improve flexibility for operation over a variety of applications

and network environments and improve robustness to data

losses. However, the high-level syntax architecture used in the

H.264/MPEG-4 AVC standard has generally been retained,

including the following features:

 Parameter set structure: Parameter sets contain

information that can be shared for the decoding of several

regions of the decoded video. The parameter set structure

provides a secure mechanism for conveying data that are

essential to the decoding process. The concepts of

sequence and picture parameter sets from H.264/MPEG-4

AVC are augmented by a new video parameter set (VPS)

structure.

 NAL unit syntax structure: Each syntax structure is

placed into a logical data packet called a network

abstraction layer (NAL) unit. Depending on the content of

a two-byte NAL unit header, it is possible to readily

identify the purpose of the associated payload data.

 Slices: A slice is a data structure that can be decoded

independently from other slices of the same picture, in

terms of entropy coding, signal prediction, and residual

signal reconstruction. (This describes ordinary slices; an

alternative form known as dependent slices is discussed

below.) A slice can either be an entire picture or a region of

a picture. One of the main purposes of slices is

re-synchronization in the event of data losses. In the case

of packetized transmission, the maximum number of

payload bits within a slice is typically restricted, and the

number of CTUs in the slice is often varied to minimize the

packetization overhead while keeping the size of each

packet within this bound.

 SEI and VUI metadata: The syntax includes support for

various types of metadata known as supplemental

enhancement information (SEI), video usability

information (VUI). Such data provides information about

the timing of the video pictures, the proper interpretation of

the color space used in the video signal, 3D stereoscopic

frame packing information, other “display hint”

information, etc.

C. Parallel decoding syntax and modified slice structuring

Finally, four new features are introduced in the HEVC

standard to enhance parallel processing capability or modify

the structuring of slice data for packetization purposes. Each of

them may have benefits in particular application contexts, and

it is generally up to the implementer of an encoder or decoder to

determine whether and how to take advantage of these features.

 Tiles: The option to partition a picture into rectangular

regions called tiles has been specified. The main purpose

of tiles is to increase the capability for parallel processing

rather than provide error resilience. Tiles are

independently-decodable regions of a picture that are

encoded with some shared header information. Therefore,

they could additionally be used for the purpose of random

access to local regions of video pictures. A typical tile

configuration of a picture consists of segmenting the

picture into rectangular regions with approximately equal

numbers of CTUs in each tile. Tiles provide parallelism at

a more coarse level (picture/sub-picture) of granularity,

and no sophisticated synchronization of threads is

necessary for their use.

 Wavefront parallel processing: When wavefront parallel

processing (WPP) is enabled, a slice is divided into rows of

CTUs. The first row is processed in an ordinary way; the

second row can begin to be processed after only a few

decisions have been made in the first row; the third row can

begin to be processed after only a few decisions have been

made in the second row; etc. The context models of the

entropy coder in each row are inferred from those in the

preceding row with a small fixed processing lag. WPP

provides a form of processing parallelism at a rather fine

level of granularity, i.e. within a slice. WPP may often

provide better compression performance than tiles (and

avoid some visual artifacts that may be induced by tiles).

剩余18页未读，继续阅读

Nereus_Li

粉丝: 18
资源: 1

HEVC标准详解：压缩效率提升50%

Overview of HEVC

HEVC-Overview

HEVC_overview

TEncBinCoderCABAC_1.rar_3D-HEVC_TEncSearch_xdl

arm64_ChromePublic_HEVC-92.0.4515.115.apk

01-SVT-HEVC.zip SVT-HEVC的官方最新源码

Test Model 11 of 3D-HEVC and MV-HEVC.docx

mpeg-4解码代码.rar_MP4 - MPEG-4 解码_MPEG-4解码工程_mpeg-4_mpeg-4解码

H.265HEVC视频编码规范标准2018年9月版本.zip_h.265_hevc_hevc 官方_标准_规范

ISO_IEC_14496-14_2003.rar_IEC_ISO 14496_iso-iec

最新资源