没有合适的资源?快使用搜索试试~ 我知道了~
首页H.264-MPEG4-AVC Amendment 3: Scalable Video Coding for Enhanced Quality and Compatibility
H.264-MPEG4-AVC Amendment 3: Scalable Video Coding for Enhanced ...
需积分: 1 4 下载量 28 浏览量
更新于2024-08-02
收藏 3.64MB PDF 举报
H.264-MPEG4-AVC-Version 8 Final Draft 是一项于2007年7月通过的重要标准更新,它隶属于ISO/IEC 14496-10和ITU-T H.264编码体系,旨在提供更为先进的视频编码技术。这次修订的主要内容包括:
1. **扩展视频质量能力**:通过增加四个简表——高级、高级10、高级4:2:2和高级4:4:4,这些新的编码格式能够支持更广泛的图像样点精度和更高的分辨率,特别是色度信号处理,显著提升了视频的清晰度和细节表现,从而拓宽了标准的应用领域。
2. **补充数据类型**:标准定义了新的补充数据类型,这进一步增强了编码的灵活性,使得视频能够适应不同的应用场景,如高清、超高清(UHD)以及对色彩和细节要求更高的专业级内容。
3. **技术一致性**:这个版本与ISO/IEC 14496-10标准保持技术上的同步,确保了两个标准之间的兼容性和互操作性,这对于开发者来说是一个重要的考量,降低了技术集成的复杂性。
4. **错误修正**:文档还包含了对已出版版本中的一些错误进行修正,提高了标准的准确性,确保了编码过程的精确性。
5. **组织与作者**:本次更新由Joint Video Team (JVT),由ISO/IEC JTC1/SC29/WG11和ITU-T SG16 Q.6共同组成,编辑和联系人包括来自Heinrich Hertz Institute, Microsoft Corporation, SpinetiX SA等机构的专家,他们共同负责标准的制定和修订工作。
这个Final Draft对于所有涉及H.264编码的领域,如视频编码、流媒体、数字电视和互联网传输,都有着重大的影响,因为它不仅提升了视频质量和编码效率,还推动了高清、超高清视频技术的发展。对于任何希望利用最新编码技术进行视频处理或开发相关应用的工程师和技术人员来说,理解和掌握这一修订版是至关重要的。
Table 9-21 – Values of variables m and n for ctxIdx from 227 to 275...........................................................................246
Table 9-22 – Values of variables m and n for ctxIdx from 277 to 337...........................................................................247
Table 9-23 – Values of variables m and n for ctxIdx from 338 to 398...........................................................................248
Table 9-24 – Values of variables m and n for ctxIdx from 402 to 459...........................................................................249
Table 9-25 – Values of variables m and n for ctxIdx from 460 to 483...........................................................................250
Table 9-26 – Values of variables m and n for ctxIdx from 484 to 571...........................................................................250
Table 9-27 – Values of variables m and n for ctxIdx from 572 to 659...........................................................................252
Table 9-28– Values of variables m and n for ctxIdx from 660 to 717............................................................................253
Table 9-29– Values of variables m and n for ctxIdx from 718 to 775............................................................................255
Table 9-30 – Values of variables m and n for ctxIdx from 776 to 863...........................................................................256
Table 9-31 – Values of variables m and n for ctxIdx from 864 to 951...........................................................................257
Table 9-32 – Values of variables m and n for ctxIdx from 952 to 1011.........................................................................259
Table 9-33 – Values of variables m and n for ctxIdx from 1012 to 1023.......................................................................260
Table 9-34 – Syntax elements and associated types of binarization, maxBinIdxCtx, and ctxIdxOffset.........................262
Table 9-35 – Bin string of the unary binarization (informative).....................................................................................264
Table 9-36 – Binarization for macroblock types in I slices ............................................................................................266
Table 9-37 – Binarization for macroblock types in P, SP, and B slices..........................................................................267
Table 9-38 – Binarization for sub-macroblock types in P, SP, and B slices...................................................................268
Table 9-39 – Assignment of ctxIdxInc to binIdx for all ctxIdxOffset values except those related to the syntax elements
coded_block_flag, significant_coeff_flag, last_significant_coeff_flag, and coeff_abs_level_minus1...................270
Table 9-40 – Assignment of ctxIdxBlockCatOffset to ctxBlockCat for syntax elements coded_block_flag,
significant_coeff_flag, last_significant_coeff_flag, and coeff_abs_level_minus1.................................................271
Table 9-41 – Specification of ctxIdxInc for specific values of ctxIdxOffset and binIdx................................................279
Table 9-42 – Specification of ctxBlockCat for the different blocks ...............................................................................280
Table 9-43 – Mapping of scanning position to ctxIdxInc for ctxBlockCat = = 5, 9, or 13...........................................281
Table 9-44 – Specification of rangeTabLPS depending on pStateIdx and qCodIRangeIdx...........................................285
Table 9-45 – State transition table ..................................................................................................................................286
Table A-1 – Level limits.................................................................................................................................................301
Table A-2 – Specification of cpbBrVclFactor and cpbBrNalFactor...............................................................................304
Table A-3 – Baseline profile level limits........................................................................................................................304
Table A-4 – Main, High, High 10, High 4:2:2, High 4:4:4 Predictive, High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra,
and CAVLC 4:4:4 Intra profile level limits............................................................................................................305
Table A-5 – Extended profile level limits.......................................................................................................................306
Table A-6 – Maximum frame rates (frames per second) for some example frame sizes................................................307
Table D-1 – Interpretation of pic_struct .........................................................................................................................337
Table D-2 – Mapping of ct_type to source picture scan.................................................................................................338
Table D-3 – Definition of counting_type values ............................................................................................................338
Table D-4 – scene_transition_type values ......................................................................................................................345
Table D-5 – model_id values..........................................................................................................................................352
Table D-6 – blending_mode_id values...........................................................................................................................353
Table D-7 – filter_hint_type values ................................................................................................................................360
File: JVT-X201.doc
Page: 17 Date Saved: 2007-10-26
Table E-1 – Meaning of sample aspect ratio indicator ...................................................................................................367
Table E-2 – Meaning of video_format............................................................................................................................368
Table E-3 – Colour primaries .........................................................................................................................................369
Table E-4 – Transfer characteristics ...............................................................................................................................370
Table E-5 – Matrix coefficients......................................................................................................................................373
Table E-6 – Divisor for computation of ∆t
fi,dpb
( n ) ........................................................................................................375
Table G-1 – Name association to slice_type for NAL units with nal_unit_type equal to 20..........................................416
Table G-2 – Memory management control operation (memory_management_control_operation) values ....................424
Table G-3 – Allowed collective macroblock types for slice_type..................................................................................426
Table G-4 – Macroblock types for EI slices. ..................................................................................................................426
Table G-5 – Scale values cS for transform coefficient level scaling ..............................................................................480
Table G-6 – Macroblock type predictors mbTypeILPred...............................................................................................497
Table G-7 – Sub-macroblock type predictors subMbTypeILPred[ mbPartIdx ] ............................................................498
Table G-8 – 16-phase luma interpolation filter for resampling in Intra_Base prediction...............................................505
Table G-9 – Mapping of ( nX, nY ) to coeffTokenIdx and vice versa............................................................................527
Table G-10 - Association of ctxIdx and syntax elements for each slice type in the initialisation process......................530
Table G-11 - Values of variables m and n for ctxIdx from 1024 to 1026.......................................................................530
Table G-12 - Values of variables m and n for ctxIdx from 1027 to 1029.......................................................................531
Table G-13 - Syntax elements and associated types of binarization, maxBinIdxCtx, and ctxIdxOffset ........................531
Table G-14 - Assignment of ctxIdxInc to binIdx for the ctxIdxOffset values related to the syntax elements
base_mode_flag and residual_prediction_flag........................................................................................................532
Table G-15 – Scalable Baseline profile level limits........................................................................................................539
Table G-16 – Specification of cpbBrVclFactor and cpbBrNalFactor.............................................................................539
Foreword
The International Telecommunication Union (ITU) is the United Nations specialized agency in the field of
telecommunications. The ITU Telecommunication Standardization Sector (ITU-T) is a permanent organ of ITU. ITU-T
is responsible for studying technical, operating and tariff questions and issuing Recommendations on them with a view
to standardising telecommunications on a world-wide basis. The World Telecommunication Standardization Assembly
(WTSA), which meets every four years, establishes the topics for study by the ITU-T study groups that, in turn, produce
Recommendations on these topics. The approval of ITU-T Recommendations is covered by the procedure laid down in
WTSA Resolution 1. In some areas of information technology that fall within ITU-T's purview, the necessary standards
are prepared on a collaborative basis with ISO and IEC.
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form
the specialised system for world-wide standardisation. National Bodies that are members of ISO and IEC participate in
the development of International Standards through technical committees established by the respective organisation to
deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual
interest. Other international organisations, governmental and non-governmental, in liaison with ISO and IEC, also take
part in the work. In the field of information technology, ISO and IEC have established a joint technical committee,
ISO/IEC JTC 1. Draft International Standards adopted by the joint technical committee are circulated to national bodies
for voting. Publication as an International Standard requires approval by at least 75% of the national bodies casting a
vote.
This Recommendation | International Standard was prepared jointly by ITU-T SG 16 Q.6, also known as VCEG (Video
Coding Experts Group), and by ISO/IEC JTC 1/SC 29/WG 11, also known as MPEG (Moving Picture Experts Group).
VCEG was formed in 1997 to maintain prior ITU-T video coding standards and develop new video coding standard(s)
appropriate for a wide range of conversational and non-conversational services. MPEG was formed in 1988 to establish
standards for coding of moving pictures and associated audio for various applications such as digital storage media,
distribution, and communication.
In this Recommendation | International Standard Annexes A through E contain normative requirements and are an
integral part of this Recommendation | International Standard.
File: JVT-X201.doc
Page: 19 Date Saved: 2007-10-26
ITU-T Recommendation H.264
Advanced video coding for generic audiovisual services
0 Introduction
This clause does not form an integral part of this Recommendation | International Standard.
0.1 Prologue
This subclause does not form an integral part of this Recommendation | International Standard.
As the costs for both processing power and memory have reduced, network support for coded video data has
diversified, and advances in video coding technology have progressed, the need has arisen for an industry standard for
compressed video representation with substantially increased coding efficiency and enhanced robustness to network
environments. Toward these ends the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture
Experts Group (MPEG) formed a Joint Video Team (JVT) in 2001 for development of a new
Recommendation | International Standard.
0.2 Purpose
This subclause does not form an integral part of this Recommendation | International Standard.
This Recommendation | International Standard was developed in response to the growing need for higher compression
of moving pictures for various applications such as videoconferencing, digital storage media, television broadcasting,
internet streaming, and communication. It is also designed to enable the use of the coded video representation in a
flexible manner for a wide variety of network environments. The use of this Recommendation | International Standard
allows motion video to be manipulated as a form of computer data and to be stored on various storage media,
transmitted and received over existing and future networks and distributed on existing and future broadcasting channels.
0.3 Applications
This subclause does not form an integral part of this Recommendation | International Standard.
This Recommendation | International Standard is designed to cover a broad range of applications for video content
including but not limited to the following:
CATV Cable TV on optical networks, copper, etc.
DBS Direct broadcast satellite video services
DSL Digital subscriber line video services
DTTB Digital terrestrial television broadcasting
ISM Interactive storage media (optical disks, etc.)
MMM Multimedia mailing
MSPN Multimedia services over packet networks
RTC Real-time conversational services (videoconferencing, videophone, etc.)
RVS Remote video surveillance
SSM Serial storage media (digital VTR, etc.)
0.4 Publication and versions of this specification
This subclause does not form an integral part of this Recommendation | International Standard.
This specification has been jointly developed by ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving
Picture Experts Group. It is published as technically-aligned twin text in both organizations ITU-T and ISO/IEC.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 1 refers to the first approved version of this Recommendation |
International Standard. Version 1 was approved by ITU-T on 30 May 2003. The first published version in ISO/IEC
corresponded to version 1.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 2 refers to the integrated text containing the corrections specified in the
first technical corrigendum. The first fully-published version in the ITU-T was version 2 as approved by ITU-T on
7 May 2004, due to the development of the corrigendum during the publication process. Version 2 was also published in
integrated form by ISO/IEC.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 3 refers to the integrated text containing both the first technical
corrigendum (2004) and the first amendment, which is referred to as the "Fidelity range extensions". Version 3 was
approved by ITU-T on 1 March 2005.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 4 refers to the integrated text containing the first technical corrigendum
(2004), the first amendment (the "Fidelity range extensions"), and an additional technical corrigendum (2005).
Version 4 was approved by ITU-T on 13 September 2005. In both ITU-T and ISO/IEC, the next complete published
version after version 2 was version 4.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 5 refers to the integrated version 4 text with its specification of the High
4:4:4 profile removed.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 6 refers to the integrated version 5 text after its amendment to support
additional colour space indicators. In the ITU-T, the changes for versions 5 and 6 were approved on 13 June 2006 and
were published as a single amendment.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 7 refers to the integrated version 6 text after its amendment to define
five new profiles intended primarily for professional applications (the High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra,
CAVLC 4:4:4 Intra, and High 4:4:4 Predictive profiles) and two new types of supplemental enhancement information
(SEI) messages (the post-filter hint SEI message and the tone mapping information SEI message). Version 7 was
approved by ITU-T on 6 April 2007.
ITU-T Rec. H.264 | ISO/IEC 14496-10 version 8 (the current specification) refers to the integrated version 7 text after
its amendment to specify scalable video coding in three profiles (Scalable Baseline, Scalable High, and Scalable High
Intra profiles). Version 8 was approved by the ITU-T on day month 2007.
0.5 Profiles and levels
This subclause does not form an integral part of this Recommendation | International Standard.
This Recommendation | International Standard is designed to be generic in the sense that it serves a wide range of
applications, bit rates, resolutions, qualities, and services. Applications should cover, among other things, digital storage
media, television broadcasting and real-time communications. In the course of creating this Specification, various
requirements from typical applications have been considered, necessary algorithmic elements have been developed, and
these have been integrated into a single syntax. Hence, this Specification will facilitate video data interchange among
different applications.
Considering the practicality of implementing the full syntax of this Specification, however, a limited number of subsets
of the syntax are also stipulated by means of "profiles" and "levels". These and other related terms are formally defined
in clause 3.
A "profile" is a subset of the entire bitstream syntax that is specified by this Recommendation | International Standard.
Within the bounds imposed by the syntax of a given profile it is still possible to require a very large variation in the
performance of encoders and decoders depending upon the values taken by syntax elements in the bitstream such as the
specified size of the decoded pictures. In many applications, it is currently neither practical nor economic to implement
a decoder capable of dealing with all hypothetical uses of the syntax within a particular profile.
In order to deal with this problem, "levels" are specified within each profile. A level is a specified set of constraints
imposed on values of the syntax elements in the bitstream. These constraints may be simple limits on values.
Alternatively they may take the form of constraints on arithmetic combinations of values (e.g. picture width multiplied
by picture height multiplied by number of pictures decoded per second).
Coded video content conforming to this Recommendation | International Standard uses a common syntax. In order to
achieve a subset of the complete syntax, flags, parameters, and other syntax elements are included in the bitstream that
signal the presence or absence of syntactic elements that occur later in the bitstream.
0.6 Overview of the design characteristics
This subclause does not form an integral part of this Recommendation | International Standard.
剩余562页未读,继续阅读
2008-06-26 上传
2011-02-16 上传
2024-10-21 上传
2024-10-21 上传
2024-10-21 上传
shengixucui
- 粉丝: 0
- 资源: 1
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- Postman安装与功能详解:适用于API测试与HTTP请求
- Dart打造简易Web服务器教程:simple-server-dart
- FFmpeg 4.4 快速搭建与环境变量配置教程
- 牛顿井在围棋中的应用:利用牛顿多项式求根技术
- SpringBoot结合MySQL实现MQTT消息持久化教程
- C语言实现水仙花数输出方法详解
- Avatar_Utils库1.0.10版本发布,Python开发者必备工具
- Python爬虫实现漫画榜单数据处理与可视化分析
- 解压缩教材程序文件的正确方法
- 快速搭建Spring Boot Web项目实战指南
- Avatar Utils 1.8.1 工具包的安装与使用指南
- GatewayWorker扩展包压缩文件的下载与使用指南
- 实现饮食目标的开源Visual Basic编码程序
- 打造个性化O'RLY动物封面生成器
- Avatar_Utils库打包文件安装与使用指南
- Python端口扫描工具的设计与实现要点解析
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功