4K/UHD视频质量模型：ITU-T P.1204.3标准解析与实现

需积分: 39 80 浏览量更新于2024-08-29 收藏 1.05MB PDF 举报

"ITU-T P.1204.3标准，即Bitstream-based Model Standard for 4K/UHD，是一个针对4K超高清视频的位流基础模型标准，由国际电信联盟（ITU-T）的第12研究组和视频质量专家小组（VQEG）共同开发。该标准详细介绍了模型的细节、评估、分析以及开源实现。" 在当前数字化时代，随着4K/超高清（UHD）视频的需求不断增长，用户期望在有限的带宽条件下观看高质量的视频内容。这通常通过基于HTTP的自适应流媒体技术来实现。在这种背景下，准确评估编码视频的质量变得至关重要，以便评估并可能优化整体流媒体质量。论文中提出的ITU-T P.1204.3标准是一个无参考（no-reference）的视频质量模型，是P.NATS（No-Reference Assessment of Video Subjective Quality - 视频主观质量无参考评估）第二阶段竞赛的一部分。无参考意味着该模型无需原始未压缩视频作为对比，仅依赖于编码后的位流信息来预测视频的主观质量。该模型的核心在于分析经过编码的位流数据，通过复杂的算法和统计方法来推断视频在解码后可能出现的视觉质量。这种方法对于实时监控视频传输质量、优化编码参数、以及在无法获取原始视频的情况下进行质量评估尤其有用。论文详细阐述了模型的设计原理，包括如何从位流中提取特征，如何将这些特征转换成与人类视觉系统（HVS）相关的质量指标，以及如何利用这些指标来预测用户的主观体验。此外，论文还讨论了模型的性能评估，通过大量的实验和比较，验证了模型的预测准确性，并与其他已知的视频质量评估方法进行了对比。更重要的是，该模型的开源实现使得研究人员和行业从业者可以方便地应用和改进这个模型，进一步推动视频质量评估技术的发展。这种开放性不仅促进了学术交流，也有助于加快新技术在实际应用中的采纳和优化。 ITU-T P.1204.3标准提供了一种创新的方法，用于在没有原始视频参考的情况下评估4K/超高清视频的编码质量，对提升流媒体服务质量和用户体验具有重要意义。其开源实现为研究和工业界提供了宝贵的工具，以应对日益增长的高清晰度视频内容需求和带宽限制挑战。

Bitstream-based Model Standard for 4K/UHD:

ITU-T P.1204.3 – Model Details, Evaluation,

Analysis and Open Source Implementation

Rakesh Rao Ramachandra Rao

∗

, Steve G

oring

∗

, Peter List

†

Werner Robitza

∗

, Bernhard Feiten

†

, Ulf W

ustenhagen

†

, Alexander Raake

∗

Dept. of Audio Visual Technology; Technische Universit

at Ilmenau, Germany

Email: [rakesh-rao.ramachandra-rao, steve.goering, werner.robitza, alexander.raake]@tu-ilmenau.de

†

Deutsche Telekom AG, Technology & Innovation, Germany

Email: [peter.list, bernhard.feiten, ulf.wuestenhagen]@telekom.de

Abstract—With the increasing requirement of users to view

high-quality videos with a constrained bandwidth, typically real-

ized using HTTP-based adaptive streaming, it becomes more and

more important to determine the quality of the encoded videos

accurately, to assess and possibly optimize the overall streaming

quality. In this paper, we describe a bitstream-based no-reference

video quality model developed as part of the latest model-

development competition conducted by ITU-T Study Group 12

and the Video Quality Experts Group (VQEG), “P.NATS Phase

2”. It is now part of the new P.1204 series of Recommendations as

P.1204.3. It can be applied to bitstreams encoded with H.264/AVC,

HEVC and VP9, using various encoding options, including

resolution, bitrate, framerate and typical encoder settings such as

number of passes, rate control variants and speeds. The proposed

model follows an ensemble-modelling–inspired approach with

weighted parametric and machine-learning parts to efﬁciently

leverage the performance of both approaches. The paper provides

details about the general approach to modelling, the features used

and the ﬁnal feature aggregation. The model creates per-segment

and per-second video quality scores on the 5-point Absolute

Category Rating scale, and is applicable to segments of 5–10

seconds duration. It covers both PC/TV and mobile/tablet viewing

scenarios. We outline the databases on which the model was

trained and validated as part of the competition, and perform

an additional evaluation using a total of four independently

created databases, where resolutions varied from 360p to 2160p,

and frame rates from 15–60fps, using realistic coding and

bitrate settings. We found that the model performs well on the

independent dataset, with a Pearson correlation of 0.942 and

an RMSE of 0.42. We also provide an open-source reference

implementation of the described P.1204.3 model, as well as the

multi-codec bitstream parser required to extract the input data,

which is not part of the standard.

Index Terms—bitstream model, video quality, machine learn-

ing, HTTP adaptive streaming

2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX)

I. I

NTRODUCTION

With the advancement in image capture technology in both

cameras and mobile phones, the number of content producers

generating and streaming 4K content is increasing rapidly. In

addition to this, most of today’s streaming platforms such

as Netﬂix, YouTube, or Amazon Prime Video also stream

content in 4K to provide a more immersive experience to the

end-user. Along with the need of Internet Service Providers

(ISP) and “over-the top” (OTT) streaming providers to ensure

a high degree of satisfaction among their customers, these

developments highlight the need for efﬁcient video quality

algorithms that can be used to assess, benchmark and possibly

optimize the overall streaming quality, and enhance the Quality

of Experience (QoE) for the end-user.

HTTP-based adaptive streaming (HAS) is the preferred

technology for streaming video content over the Internet, with

Dynamic Adaptive Streaming over HTTP (DASH) or HTTP

Live Streaming (HAS) being two popular implementations.

As a consequence, video quality algorithms need to consider

HAS-speciﬁc features such as quality switches and stalling

during playout.

In view of both these developments, namely, the pervasion

of high-quality contents and the increasing usage of HAS-

based technologies for streaming, it becomes important to

develop adequate video quality algorithms. In general, video

quality models can be distinguished in three main different

categories depending on the input data, namely, 1) media-

layer or pixel-based models, 2) bitstream-layer models, and

3) hybrid models. Here, media-layer models use the decoded

video to estimate video quality scores, bitstream-layer models

use the encoded bitstream for video quality estimation, and

hybrid models use a combination of decoded video and

encoded bitstream as input to predict video quality [15, 1].

One example of a video quality algorithm developed to

handle HAS-speciﬁc scenarios is the audiovisual QoE model

according to ITU-T Rec. P.1203. The bitstream-based model

comprises components for short-term video and short-term

audio quality prediction, and for quality integration – together

with initial loading delay and stalling information. P.1203 is

generally suitable for more accurate quality predictions based

on full bitstream access, or can be used for more light-weight

quality estimation based only on metadata, such as resolution,

framerate and bitrate [1].

In the context of the standardization work conducted in

ITU-T Study Group 12 (SG12), the bitstream-layer models

Authorized licensed use limited to: SUNY AT STONY BROOK. Downloaded on July 26,2020 at 14:43:01 UTC from IEEE Xplore. Restrictions apply.

下载后可阅读完整内容，剩余5页未读，立即下载

Annie_heyeqq

粉丝: 11

4K/UHD视频质量模型：ITU-T P.1204.3标准解析与实现

u-boot-xlnx-xilinx-v2017.4.tar.gz

VESA-DSC-1.2a.pdf

MPEG-4.rar_MPEg-2_bitstream_mpeg 测试_mpeg-4

ITU-T T.800标准：JPEG2000核心编码系统详解

ITU-T H.264 视频编码标准与宽带功率放大器预失真

FPGA-based-PWM-generator.rar_VHDL/FPGA/Verilog_VHDL_

Generating-data-bit-stream.rar_c语言比特流_比特_比特流

XFree86-Video-Timings.rar_ahead2bk_sexfreevedio_sexfreevideo_xfr

Simply-RISC-S1-Source-code.zip_VHDL/FPGA/Verilog_Windows_Unix_

vp9-bitstream-specification.rar_VP9 编码_VP9官方文档_vp9_联合开发

最新资源