高清AVS视频编码器：速率失真优化模式决策算法与硬件架构

68 浏览量更新于2024-07-15 收藏 2.77MB PDF 举报

"高清AVS视频编码器中速率失真优化模式决策的算法分析和架构设计" 这篇文章探讨了在高清AVS（Advanced Video Coding Standard，高级视频编码标准）视频编码器中，如何通过优化模式决策来提升编码效率。AVS标准包含丰富的帧内和帧间预测模式，这些模式为提高时空预测效率提供了可能性。然而，由于大量的处理需求，实现这种优化的复杂度非常高。作者提出了针对VLSI（Very Large Scale Integration，超大规模集成电路）实现的硬件导向模式决策算法，以适应高清视频编码的挑战。这种方法的核心是模式预选，它可以显著减轻整体计算负担。通过这种方式，能够在保持较低硬件成本的同时，实现更高效的编码过程。为了进一步优化，文章还介绍了一种智能流水线调度机制，旨在打破帧内预测过程中的数据依赖性。这一创新能够减少因模式决策导致的数据瓶颈，使得算法简化后仍能适合硬件实现，同时对性能的影响最小。在设计VLSI架构时，作者进行了权衡，确保在电路消耗和速率失真性能之间达到良好的平衡。速率失真优化是视频编码中的关键步骤，它旨在找到编码质量和带宽消耗之间的最佳平衡点。因此，这种优化对于提供高质量、低带宽占用的视频流至关重要。在摘要中，作者提到了他们工作的关键时间节点，如文章接收和接受的日期，以及相关的关键词。这些关键词包括AVS，模式决策，速率失真优化和VLSI架构，这些都是该研究领域的核心概念。这篇论文深入研究了如何在高清AVS视频编码器中，通过特定的算法设计和VLSI架构优化，实现高效且经济的模式决策，从而提高编码效率和视频质量。这些研究成果对于视频编码硬件设计和视频编码标准的改进具有重要意义。

may also be well-suited for VLSI implementation using

simpliﬁed decision criterion, however they suffer from

obvious performance degradation. Three typical simpli-

ﬁed criterions are sum of absolute difference (SAD), sum

of absolute transformed difference (SATD), and weighted

SAD (WSAD) [3,4]. By employing the Lagrangian optimi-

zation technique, the WSAD criterion achieves superior

performance than SAD or SATD. Nevertheless, its coding

performance degradation compared with genuine RDO

based method is still obvious with unnegligible image

quality degradation.

The performance degradation is mainly derived from

measure simpliﬁcation of rate and distortion. Suppose S

and S

are the original MB and the reconstructed one, and

P is the predicted version of the current MB of a certain

mode. Q

and

are the MB quantization step and the

Lagrange multiplier for mode decision. Two mode deci-

sion criterions RDcost and WSAD are described in the

following equations:

RDcostðS, Su, mode9Q

¼ SSDðS, Su, mode, Q

Þþ

 R

ðS, Su, mode, Q

Þð1Þ

WSADðS, P, mod9Q

¼ SADð S, P, mod, Q

Þþ

u  R

MBheader

ðS, P, mod, Q

Þð2Þ

Here, SSD (S,S

,mode,Q

) is the sum of the squared

difference between S and S

in the case of Q

and

, while

SAD ( S, P,mode,Q

) is the sum of the absolute difference

between S and P.R

(S,S

,mode,Q

) is the coding bit of all

syntax elements in the MB in the case of Q

and

MBheader

(S,P,mode,Q

) is the coding bit of the syntax

elements in the MB header.

RDO based mode decision achieves superior coding

performance contributed by Lagrangian optimization.

Genuine distortion is measured with SSD (S,S

,mode,Q

)

in the case of RDcost criterion, genuine rate is also used in

the case of RDcost measured with R

(S,S

,mode,Q

) with

all syntax elements considered. Comparatively, only rate

factor is considered for mode decision in the case of

WSAD criterion, in which rate is estimated with SAD

(S,P,mode,Q

) and R

MBheader

(S,P,mode,Q

). The prediction

residue SAD (S,P,mode,Q

) is approximately used as the

rate measure for quantized DCT coefﬁcients.

It is the measure simpliﬁcations of rate and distortion in

WSAD that result in the obvious performance degradation

compared with RDcost. In order to sustain the superiority

of AVS, we will focus on RDO based mode decision for

hardware implementation in this work.

It is very computationally intensive due to the abundant

modes adopted in H.264. However, almost all H.264 video

encoder architectures adopt simpliﬁed mode decision, and

WSAD, SATD, or SAD criterion was used instead. Relatively,

challenges of RDO based mode decision in AVS video

encoder is relatively lower than H.264. On the one hand,

the numbers of inter and intra modes in AVS are smaller

than that of H.264, and the processing throughput burden

is also lower than that of H.264. On the other hand, the

processing unit granularity in AVS such as DCT, quantiza-

tion, inverse DCT, inverse quantization is 8  8 block in the

rate and distortion calculation loop, while that is 4  4

block in H.264 smaller than AVS. This means that the

circuit consumption for the basic processing unit of AVS is

higher than that of H.264. Thus, it is possible to implement

RDO based mode decision with reasonable mode preselec-

tion to alleviate the throughput burden without too much

additional circuit consumption.

2.3. Computation analysis for RDcost estimation

Fig. 2 shows the framework of RDcost calculation for

one 8  8 block of a certain MB coding mode. First, the

difference between the input block S and its prediction P

is calculated by residue generation. Then, the residue r is

transformed by DCT and followed by quantization (Quant,

Q), and then the coding bit rate R is computed by entropy

coding (EC). The quantized coefﬁcients are also fed into

inverse quantization (Inver Quant, IQ), inverse transform

(IDCT), and compensation to reconstruct the block S

. SSD

between the original residue (r) and the reconstructed

residue (r

) is computed for distortion measure. In the end,

RDcost is obtained according to R and SSD. RDcost is used

to evaluate the RD performance of all candidate modes,

and the mode with the smallest RDcost is selected for

bitstream generation.

RDO based mode decision for hardware implementa-

tion is challenged by the following two factors. On the one

hand, intrinsic data dependencies exist in video coding

algorithms. At the MB level, integer pixel motion estima-

tion (IME), fractional motion estimation (FME), mode

decision (MD), and intra prediction (IP), deblocking ﬁlter

Reference

(reconstructed)

Inter

prediction

Intra

prediction

DCT Q

IDCT IQ

Zigzag

Scan

VLC

coding bits

estimation

Distortion

estimation

Rdcost Comparison

and mode decision

Filter

Inter

Intra

S’

Fig. 2. Framework of cost function calculation and mode decision.

Hai bing Yin et al. / Signal Processing: Image Communication 25 (2010) 633–647 635

剩余14页未读，继续阅读

weixin_38725902

粉丝: 4
资源: 929

高清AVS视频编码器：速率失真优化模式决策算法与硬件架构

AVS视频码流解析软件QtAVS

avs码流分析工具源码

AVS参考软件rm52j

视频编码从avs+变为avs视频画面会有什么变化

avs3编码ts格式 解析

用ffmpeg 编码没有F帧的avs2视频

视频编码从avs+变为h.264视频画面会有什么变化

视频编码从avs+变为mpeg-2视频画面会有什么变化

视频编码从avs变为mpeg-2视频画面会有什么变化

ffmpeg 配置avs3编码

最新资源

avs3编码ts格式解析