HEVC框架下的屏幕内容高效编码策略

161 浏览量更新于2024-08-26 1 收藏 1.9MB PDF 举报

"基于HEVC框架的屏幕内容编码是一种针对屏幕内容视频的高效编码技术，旨在提高编码效率和视觉质量。研究中，作者分析了屏幕内容的特性，如定向相关性、基本颜色表示、高效率视频编码以及屏幕内容编码等，并提出了一种新的编码方案。该方案将屏幕内容分为颜色分量和结构分量两部分，利用非变换表示方法，设计了两种编码模式，以利用视频序列中的方向相关性和非翻译变化。实施到HEVC范围扩展参考软件HM9.0后，实验结果显示，与HM9.0相比，新方案可以节省高达52.6%的比特率，平均而言，在内部、随机访问和低延迟配置下分别节省35.1%、29.2%和23.6%的比特率。此外，解码后的视频序列在视觉质量上也有显著提升，减少了尖锐边缘的振铃伪影，同时保持了文本的清晰度，避免模糊现象。" 屏幕内容编码是视频编码领域的一个重要研究方向，因为屏幕内容（如动画、计算机屏幕捕获、带有文本覆盖的视频等）具有独特的特点，传统视频编码技术可能无法有效处理。HEVC（高效视频编码，High Efficiency Video Coding）作为一种先进的视频编码标准，已经在视频压缩领域取得了显著成果，但针对屏幕内容，其编码效率还有待提升。本研究提出的编码方案，首先识别出屏幕内容的特性，如定向相关性（屏幕内容中常见线条和结构的特定方向性）和非翻译变化（文本、图形等元素相对稳定的局部移动）。通过将屏幕内容分离为颜色分量和结构分量，新方案能够更精确地捕捉这些特性。颜色分量主要关注图像的颜色信息，而结构分量则关注形状和轮廓。接着，针对这两种分量，设计了两种编码模式，充分利用了屏幕内容的定向相关性和非翻译变化，以实现更高的编码效率。为了验证新方案的有效性，研究人员将其整合到HEVC的范围扩展参考软件HM9.0中，进行了实际的编码和解码测试。实验结果表明，新方案在比特率节省方面表现优异，尤其是在内部、随机访问和低延迟场景下，比特率节省幅度较大，这意味着在同等质量下，视频文件的大小可以显著减小，这对于网络传输和存储资源的节省非常有利。此外，解码后的视频质量改善也是该方案的一大亮点。通过对尖锐边缘的处理，减少了振铃伪影的出现，同时保持了文本的清晰度，确保了观看者的视觉体验。这些改进对于那些包含大量文本和图形的屏幕内容视频尤其重要，例如在线教育、远程会议和新闻报道等应用。这项工作为HEVC框架下的屏幕内容编码提供了新的思路和方法，对于进一步优化视频编码效率和提高解码后视频质量具有重要的理论和实践意义。未来的研究可能会在此基础上继续探索更高效的编码策略，以适应不断发展的屏幕内容视频需求。

1318 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 16, NO. 5, A UGUST 2014

Variable Size Transforms and transform skipping: 4x4

and 8x8 DCT transforms are adopted in H.264/AVC to remove

the redundancy of residual signal [22]. Since video content with

high resolution shows up stronger correlatio n than that of lower

resolution, HEVC extends the largest DCT transform size to

32x32 to fully ex plo it the correlation in video signal.

However blocks with screen content contain complex struc-

tures and sharp edges, which cannot be compactly represented

in DCT domain. Th us the coding efﬁciency of HEVC on screen

contents is compromised. To code the screen content more ef-

ﬁciently, transform skipping modes proposed in [16] an d [17]

are incorporated into HEVC, which skip the transform process

to minim ize the R-D cost [23].

Loop Filters: Duetothehybridblockcoding structure, block

artifacts and inter-block inconsistence can b e observed at block

boundaries. Loop ﬁlter is adopted to smo oth neighboring coding

blocks and impro ves the visual quality of reconstru

cted blocks.

In HEV C, both strong ﬁlter and weak ﬁlter are adopted to re-

move the blocking artifacts. However, when applied t o screen

content, the l oop ﬁ lter may downgrade the visual quality o f

screen content when neighboring blocks are inconsistent with

each other at boundaries. S o a clipping operation is applied

on the strong ﬁlter to avoid averaging pi

xels with big differ-

ence [24].

III. P

ROPOSED CODING S

CHEME BASED ON HEVC

HEVC introduces several new techniques to improve the

coding efﬁciency of natural video, however its coding efﬁ-

ciency on screen content is compromised. On one hand, blocks

with complex structures in screen content cannot be efﬁciently

compressed. Although the transform skipping (TS) modes can

be used for screen content, they change the d istribution of

the residual ene rgy rather than r edu cing it. On the other hand,

non-translational motions of screen content video cannot be

handled efﬁciently in HEVC. To solve the above problems,

the multi-stage directional mode (MDM) and the multi-stage

temporal mode (MTM) are proposed and incorporated into

HEVC for screen content coding. The proposed modes are

based on the base co lor representation. The main d ifference

between them is how the i ndex map is coded.

A. Base Color Representation for Screen C ontents

Screen content can be regarded as a combination of two parts:

colors and structure. From the histogram of screen content, we

can ﬁnd that the colors can be compactly represented using sev-

eral base colors. The structure can be represented using an index

map, which speciﬁes the color of each pixel. As shown in Fig. 3,

the proposed representation ﬁrst decomposes the input image

block into base colors and index map using colo r quantization.

Then the base colors and index map will be entropy coded using

CABAC [25] to achieve better coding performance. At the de-

coder side, the base colors and in dex map will be obtained and

then combined together to generate the recons tru c ted block.

There are several advan tages in the base color representation.

First the color component and structure component are sepa-

rated, which makes it possible to exploit t he color redundancy

and structure redundancy using different schemes. Second , due

Fig. 3. Base color representation.

Fig. 4. Multi-stage index pr ed ictio n scheme.

to the high degree ﬂexibility of the in dex map, the complex

structures can be compactly represented to achieve better coding

efﬁciency. Last, the tr a nsfo rm is completely skipped, th us the

energy will not be scattered into m any coefﬁcients.

B. Multi-Stage Directional Mode (MDM)

The indexes in the index map are highly correlated with their

spatial neighbors. In our p reviou s work [21], we proposed a

multi-stage index coding scheme to exploit the correlations

among indexes. Fig. 4 shows the prediction process of the

multi-stage index coding scheme. T he current index is ﬁrst

compared with the ﬁrst prediction and the comparison result

is stored to the ﬁrst matching table. If the current index is not

matchedbytheﬁrst prediction, it will be further compared with

the second prediction. If the current index is not matched by all

predictions, it will be stored to the un matched ind ex map. In ou r

previous work [21 ], t he two indexes with minim al differences

along the four texture directions (vertical, horizontal, diagonal

and negative-diagonal) are selected as predictions. The cost of

theindexmatchedbytheﬁrst p rediction is one bit. Tw o bits are

needed for the index matched by the second prediction. Tw o

bits and additional Log(K-2) bits are needed to encode each

unmatched index, where

denotes the number of base colors.

In this paper, we propose a multi-stage directional mode

based on the two-stage index decomposition, which employs

asimpliﬁed directional index prediction scheme to reduce

剩余10页未读，继续阅读

weixin_38699352

粉丝: 8

HEVC框架下的屏幕内容高效编码策略

在HEVC屏幕内容编码扩展中使用机器学习进行帧内编码的快速模式和分区决策

高效率视频编码（HEVC）解码器的数据流模型开发与优化

基于哈希的块匹配提升屏幕内容编码效率：HEVC框架下的创新策略

HEVC屏幕内容编码：机器学习驱动的快速模式与分区决策

基于C++的学生假期视频播放器开发

【视频编码质量评估】：掌握H265(HEVC)的客观与主观评价方法

【能耗影响分析】：HEVC视频扩展包对移动设备电池寿命的影响

初识DirectShow和音视频采集编码技术

利用ONVIF协议实现视频编码和解码处理

多媒体框架集成秘术：Ingenic Zeratul T31沉浸式用户体验打造

最新资源