H.264-MPEG4-AVC Amendment 3: Scalable Video Coding for Enhanced Quality and Compatibility

需积分: 1 111 浏览量更新于2024-08-02 收藏 3.64MB PDF 举报

H.264-MPEG4-AVC-Version 8 Final Draft 是一项于2007年7月通过的重要标准更新，它隶属于ISO/IEC 14496-10和ITU-T H.264编码体系，旨在提供更为先进的视频编码技术。这次修订的主要内容包括： 1. **扩展视频质量能力**：通过增加四个简表——高级、高级10、高级4:2:2和高级4:4:4，这些新的编码格式能够支持更广泛的图像样点精度和更高的分辨率，特别是色度信号处理，显著提升了视频的清晰度和细节表现，从而拓宽了标准的应用领域。 2. **补充数据类型**：标准定义了新的补充数据类型，这进一步增强了编码的灵活性，使得视频能够适应不同的应用场景，如高清、超高清（UHD）以及对色彩和细节要求更高的专业级内容。 3. **技术一致性**：这个版本与ISO/IEC 14496-10标准保持技术上的同步，确保了两个标准之间的兼容性和互操作性，这对于开发者来说是一个重要的考量，降低了技术集成的复杂性。 4. **错误修正**：文档还包含了对已出版版本中的一些错误进行修正，提高了标准的准确性，确保了编码过程的精确性。 5. **组织与作者**：本次更新由Joint Video Team (JVT)，由ISO/IEC JTC1/SC29/WG11和ITU-T SG16 Q.6共同组成，编辑和联系人包括来自Heinrich Hertz Institute, Microsoft Corporation, SpinetiX SA等机构的专家，他们共同负责标准的制定和修订工作。这个Final Draft对于所有涉及H.264编码的领域，如视频编码、流媒体、数字电视和互联网传输，都有着重大的影响，因为它不仅提升了视频质量和编码效率，还推动了高清、超高清视频技术的发展。对于任何希望利用最新编码技术进行视频处理或开发相关应用的工程师和技术人员来说，理解和掌握这一修订版是至关重要的。

Table 9-21 – Values of variables m and n for ctxIdx from 227 to 275...........................................................................246

Table 9-22 – Values of variables m and n for ctxIdx from 277 to 337...........................................................................247

Table 9-23 – Values of variables m and n for ctxIdx from 338 to 398...........................................................................248

Table 9-24 – Values of variables m and n for ctxIdx from 402 to 459...........................................................................249

Table 9-25 – Values of variables m and n for ctxIdx from 460 to 483...........................................................................250

Table 9-26 – Values of variables m and n for ctxIdx from 484 to 571...........................................................................250

Table 9-27 – Values of variables m and n for ctxIdx from 572 to 659...........................................................................252

Table 9-28– Values of variables m and n for ctxIdx from 660 to 717............................................................................253

Table 9-29– Values of variables m and n for ctxIdx from 718 to 775............................................................................255

Table 9-30 – Values of variables m and n for ctxIdx from 776 to 863...........................................................................256

Table 9-31 – Values of variables m and n for ctxIdx from 864 to 951...........................................................................257

Table 9-32 – Values of variables m and n for ctxIdx from 952 to 1011.........................................................................259

Table 9-33 – Values of variables m and n for ctxIdx from 1012 to 1023.......................................................................260

Table 9-34 – Syntax elements and associated types of binarization, maxBinIdxCtx, and ctxIdxOffset.........................262

Table 9-35 – Bin string of the unary binarization (informative).....................................................................................264

Table 9-36 – Binarization for macroblock types in I slices ............................................................................................266

Table 9-37 – Binarization for macroblock types in P, SP, and B slices..........................................................................267

Table 9-38 – Binarization for sub-macroblock types in P, SP, and B slices...................................................................268

Table 9-39 – Assignment of ctxIdxInc to binIdx for all ctxIdxOffset values except those related to the syntax elements

coded_block_flag, significant_coeff_flag, last_significant_coeff_flag, and coeff_abs_level_minus1...................270

Table 9-40 – Assignment of ctxIdxBlockCatOffset to ctxBlockCat for syntax elements coded_block_flag,

significant_coeff_flag, last_significant_coeff_flag, and coeff_abs_level_minus1.................................................271

Table 9-41 – Specification of ctxIdxInc for specific values of ctxIdxOffset and binIdx................................................279

Table 9-42 – Specification of ctxBlockCat for the different blocks ...............................................................................280

Table 9-43 – Mapping of scanning position to ctxIdxInc for ctxBlockCat = = 5, 9, or 13...........................................281

Table 9-44 – Specification of rangeTabLPS depending on pStateIdx and qCodIRangeIdx...........................................285

Table 9-45 – State transition table ..................................................................................................................................286

Table A-1 – Level limits.................................................................................................................................................301

Table A-2 – Specification of cpbBrVclFactor and cpbBrNalFactor...............................................................................304

Table A-3 – Baseline profile level limits........................................................................................................................304

Table A-4 – Main, High, High 10, High 4:2:2, High 4:4:4 Predictive, High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra,

and CAVLC 4:4:4 Intra profile level limits............................................................................................................305

Table A-5 – Extended profile level limits.......................................................................................................................306

Table A-6 – Maximum frame rates (frames per second) for some example frame sizes................................................307

Table D-1 – Interpretation of pic_struct .........................................................................................................................337

Table D-2 – Mapping of ct_type to source picture scan.................................................................................................338

Table D-3 – Definition of counting_type values ............................................................................................................338

Table D-4 – scene_transition_type values ......................................................................................................................345

Table D-5 – model_id values..........................................................................................................................................352

Table D-6 – blending_mode_id values...........................................................................................................................353

Table D-7 – filter_hint_type values ................................................................................................................................360

File: JVT-X201.doc

Page: 17 Date Saved: 2007-10-26

Table E-1 – Meaning of sample aspect ratio indicator ...................................................................................................367

Table E-2 – Meaning of video_format............................................................................................................................368

Table E-3 – Colour primaries .........................................................................................................................................369

Table E-4 – Transfer characteristics ...............................................................................................................................370

Table E-5 – Matrix coefficients......................................................................................................................................373

Table E-6 – Divisor for computation of ∆t

fi,dpb

( n ) ........................................................................................................375

Table G-1 – Name association to slice_type for NAL units with nal_unit_type equal to 20..........................................416

Table G-2 – Memory management control operation (memory_management_control_operation) values ....................424

Table G-3 – Allowed collective macroblock types for slice_type..................................................................................426

Table G-4 – Macroblock types for EI slices. ..................................................................................................................426

Table G-5 – Scale values cS for transform coefficient level scaling ..............................................................................480

Table G-6 – Macroblock type predictors mbTypeILPred...............................................................................................497

Table G-7 – Sub-macroblock type predictors subMbTypeILPred[ mbPartIdx ] ............................................................498

Table G-8 – 16-phase luma interpolation filter for resampling in Intra_Base prediction...............................................505

Table G-9 – Mapping of ( nX, nY ) to coeffTokenIdx and vice versa............................................................................527

Table G-10 - Association of ctxIdx and syntax elements for each slice type in the initialisation process......................530

Table G-11 - Values of variables m and n for ctxIdx from 1024 to 1026.......................................................................530

Table G-12 - Values of variables m and n for ctxIdx from 1027 to 1029.......................................................................531

Table G-13 - Syntax elements and associated types of binarization, maxBinIdxCtx, and ctxIdxOffset ........................531

Table G-14 - Assignment of ctxIdxInc to binIdx for the ctxIdxOffset values related to the syntax elements

base_mode_flag and residual_prediction_flag........................................................................................................532

Table G-15 – Scalable Baseline profile level limits........................................................................................................539

Table G-16 – Specification of cpbBrVclFactor and cpbBrNalFactor.............................................................................539

Foreword

The International Telecommunication Union (ITU) is the United Nations specialized agency in the field of

telecommunications. The ITU Telecommunication Standardization Sector (ITU-T) is a permanent organ of ITU. ITU-T

is responsible for studying technical, operating and tariff questions and issuing Recommendations on them with a view

to standardising telecommunications on a world-wide basis. The World Telecommunication Standardization Assembly

(WTSA), which meets every four years, establishes the topics for study by the ITU-T study groups that, in turn, produce

Recommendations on these topics. The approval of ITU-T Recommendations is covered by the procedure laid down in

WTSA Resolution 1. In some areas of information technology that fall within ITU-T's purview, the necessary standards

are prepared on a collaborative basis with ISO and IEC.

ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form

the specialised system for world-wide standardisation. National Bodies that are members of ISO and IEC participate in

the development of International Standards through technical committees established by the respective organisation to

deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual

interest. Other international organisations, governmental and non-governmental, in liaison with ISO and IEC, also take

part in the work. In the field of information technology, ISO and IEC have established a joint technical committee,

ISO/IEC JTC 1. Draft International Standards adopted by the joint technical committee are circulated to national bodies

for voting. Publication as an International Standard requires approval by at least 75% of the national bodies casting a

vote.

This Recommendation | International Standard was prepared jointly by ITU-T SG 16 Q.6, also known as VCEG (Video

Coding Experts Group), and by ISO/IEC JTC 1/SC 29/WG 11, also known as MPEG (Moving Picture Experts Group).

VCEG was formed in 1997 to maintain prior ITU-T video coding standards and develop new video coding standard(s)

appropriate for a wide range of conversational and non-conversational services. MPEG was formed in 1988 to establish

standards for coding of moving pictures and associated audio for various applications such as digital storage media,

distribution, and communication.

In this Recommendation | International Standard Annexes A through E contain normative requirements and are an

integral part of this Recommendation | International Standard.

File: JVT-X201.doc

Page: 19 Date Saved: 2007-10-26

ITU-T Recommendation H.264

Advanced video coding for generic audiovisual services

0 Introduction

This clause does not form an integral part of this Recommendation | International Standard.

0.1 Prologue

This subclause does not form an integral part of this Recommendation | International Standard.

As the costs for both processing power and memory have reduced, network support for coded video data has

diversified, and advances in video coding technology have progressed, the need has arisen for an industry standard for

compressed video representation with substantially increased coding efficiency and enhanced robustness to network

environments. Toward these ends the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture

Experts Group (MPEG) formed a Joint Video Team (JVT) in 2001 for development of a new

Recommendation | International Standard.

0.2 Purpose

This subclause does not form an integral part of this Recommendation | International Standard.

This Recommendation | International Standard was developed in response to the growing need for higher compression

of moving pictures for various applications such as videoconferencing, digital storage media, television broadcasting,

internet streaming, and communication. It is also designed to enable the use of the coded video representation in a

flexible manner for a wide variety of network environments. The use of this Recommendation | International Standard

allows motion video to be manipulated as a form of computer data and to be stored on various storage media,

transmitted and received over existing and future networks and distributed on existing and future broadcasting channels.

0.3 Applications

This subclause does not form an integral part of this Recommendation | International Standard.

This Recommendation | International Standard is designed to cover a broad range of applications for video content

including but not limited to the following:

CATV Cable TV on optical networks, copper, etc.

DBS Direct broadcast satellite video services

DSL Digital subscriber line video services

DTTB Digital terrestrial television broadcasting

ISM Interactive storage media (optical disks, etc.)

MMM Multimedia mailing

MSPN Multimedia services over packet networks

RTC Real-time conversational services (videoconferencing, videophone, etc.)

RVS Remote video surveillance

SSM Serial storage media (digital VTR, etc.)

0.4 Publication and versions of this specification

This subclause does not form an integral part of this Recommendation | International Standard.

This specification has been jointly developed by ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving

Picture Experts Group. It is published as technically-aligned twin text in both organizations ITU-T and ISO/IEC.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 1 refers to the first approved version of this Recommendation |

International Standard. Version 1 was approved by ITU-T on 30 May 2003. The first published version in ISO/IEC

corresponded to version 1.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 2 refers to the integrated text containing the corrections specified in the

first technical corrigendum. The first fully-published version in the ITU-T was version 2 as approved by ITU-T on

7 May 2004, due to the development of the corrigendum during the publication process. Version 2 was also published in

integrated form by ISO/IEC.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 3 refers to the integrated text containing both the first technical

corrigendum (2004) and the first amendment, which is referred to as the "Fidelity range extensions". Version 3 was

approved by ITU-T on 1 March 2005.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 4 refers to the integrated text containing the first technical corrigendum

(2004), the first amendment (the "Fidelity range extensions"), and an additional technical corrigendum (2005).

Version 4 was approved by ITU-T on 13 September 2005. In both ITU-T and ISO/IEC, the next complete published

version after version 2 was version 4.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 5 refers to the integrated version 4 text with its specification of the High

4:4:4 profile removed.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 6 refers to the integrated version 5 text after its amendment to support

additional colour space indicators. In the ITU-T, the changes for versions 5 and 6 were approved on 13 June 2006 and

were published as a single amendment.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 7 refers to the integrated version 6 text after its amendment to define

five new profiles intended primarily for professional applications (the High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra,

CAVLC 4:4:4 Intra, and High 4:4:4 Predictive profiles) and two new types of supplemental enhancement information

(SEI) messages (the post-filter hint SEI message and the tone mapping information SEI message). Version 7 was

approved by ITU-T on 6 April 2007.

ITU-T Rec. H.264 | ISO/IEC 14496-10 version 8 (the current specification) refers to the integrated version 7 text after

its amendment to specify scalable video coding in three profiles (Scalable Baseline, Scalable High, and Scalable High

Intra profiles). Version 8 was approved by the ITU-T on day month 2007.

0.5 Profiles and levels

This subclause does not form an integral part of this Recommendation | International Standard.

This Recommendation | International Standard is designed to be generic in the sense that it serves a wide range of

applications, bit rates, resolutions, qualities, and services. Applications should cover, among other things, digital storage

media, television broadcasting and real-time communications. In the course of creating this Specification, various

requirements from typical applications have been considered, necessary algorithmic elements have been developed, and

these have been integrated into a single syntax. Hence, this Specification will facilitate video data interchange among

different applications.

Considering the practicality of implementing the full syntax of this Specification, however, a limited number of subsets

of the syntax are also stipulated by means of "profiles" and "levels". These and other related terms are formally defined

in clause 3.

A "profile" is a subset of the entire bitstream syntax that is specified by this Recommendation | International Standard.

Within the bounds imposed by the syntax of a given profile it is still possible to require a very large variation in the

performance of encoders and decoders depending upon the values taken by syntax elements in the bitstream such as the

specified size of the decoded pictures. In many applications, it is currently neither practical nor economic to implement

a decoder capable of dealing with all hypothetical uses of the syntax within a particular profile.

In order to deal with this problem, "levels" are specified within each profile. A level is a specified set of constraints

imposed on values of the syntax elements in the bitstream. These constraints may be simple limits on values.

Alternatively they may take the form of constraints on arithmetic combinations of values (e.g. picture width multiplied

by picture height multiplied by number of pictures decoded per second).

Coded video content conforming to this Recommendation | International Standard uses a common syntax. In order to

achieve a subset of the complete syntax, flags, parameters, and other syntax elements are included in the bitstream that

signal the presence or absence of syntactic elements that occur later in the bitstream.

0.6 Overview of the design characteristics

This subclause does not form an integral part of this Recommendation | International Standard.

剩余562页未读，继续阅读

shengixucui

粉丝: 0
资源: 1

H.264-MPEG4-AVC Amendment 3: Scalable Video Coding for Enhanced ...

最新资源

H.264-MPEG4-AVC Amendment 3: Scalable Video Coding for Enhanced ...

ISO/IEC 18000-3:2010 - RFID通信协议标准

Qi无线充电系统Power Class 0设计规范（V1.2.4终稿）

H.264-MPEG4-AVC-Version8-FinalDraft.pdf

Qi-PC0-part1&2-v1.2.4_FinalDraft.pdf

Qi Wireless Power Qi-PC0-part3-v1.2.4_FinalDraft

Qi Wireless Power Qi-PC0-part4-v1.2.4_FinalDraft

编剧软件FinalDraft_41143.zip

Qi Wireless Power Qi-PC0-part1&2-v1.2.4_FinalDraft

IPCC_AR6_WGII_FinalDraft_Chapter01.pdf

ISO IEC 18000-3-2010.pdf

最新资源