多视点视频编码优化：宏块位置约束模型自适应模式决策

192 浏览量更新于2024-08-31 收藏 632KB PDF 举报

"基于宏块位置约束模型的多视点视频编码自适应模式决策" 这篇研究论文探讨了在多视点视频编码（MVC）中采用宏块位置约束模型（MBPCM）进行自适应模式决策的方法，以降低计算复杂度并提高编码效率。MVC通过模式决策、运动估计和视差估计实现高压缩比，但这些过程通常会导致计算复杂度增加。文章首先介绍了MBPCM的概念，该模型用于捕捉和利用宏块在时间-空间域以及视图间的模式相关性。MBPCM通过分析相邻和不同视图帧中的宏块模式，能够预测当前宏块的最佳预测方向。这有助于减少候选模式的数量，从而简化决策过程。在具体实施中，论文提出了一种策略，即首先利用先前编码帧/视图的模式相关性和率失真成本（RD成本）来构建MBPCM。通过这种模型，可以更准确地预测当前宏块应采用的预测模式，无论是帧内模式还是帧间模式。特别是对于帧间模式，MBPCM能够提前确定最佳预测方向，进一步优化编码流程。实验结果显示，与传统方法相比，所提出的MBPCM自适应模式决策方法能节省约86.03%的编码时间，同时保持了良好的视频质量。这表明，MBPCM有效地减少了编码过程中的计算需求，提高了MVC的实时性和能效。此外，这种方法可能对多视点视频流的应用有重大意义，如虚拟现实和3D电视，这些应用需要高效且快速的视频编码技术。通过引入MBPCM，不仅降低了系统复杂性，还有助于在带宽有限的情况下提供更流畅的多视点视频体验。这篇研究论文提出的基于MBPCM的自适应模式决策为多视点视频编码提供了一种创新且高效的解决方案，它有望成为未来编码标准和技术发展的重要参考。通过深入理解宏块模式的时空和跨视图相关性，编码器可以更加智能地选择编码模式，显著提高编码效率，这对于处理大量多视点视频数据的系统来说具有极大的价值。

ORIGINAL RESEARCH PAPER

Adaptive mode decision for multiview video coding based

on macroblock position constraint model

Yue Li

•

Gaobo Yang

•

Yapei Zhu

•

Can Liu

•

Kai Liu

Received: 8 April 2015 / Accepted: 15 August 2015

Ó Springer-Verlag Berlin Heidelberg 2015

Abstract Multiview video coding (MVC) exploits mode

decision, motion estimation and disp arity estimation to

achieve high compression ratio, which results in an

extensive computational complexity. This paper presents

an efﬁcient mode decision approach for MVC using a

macroblock (MB) position constraint model (MPCM). The

proposed approach reduces the number of candi date modes

by utilizing the mode correlation and rate distortion cost

(RD cost) in the previously encoded frames/views.

Speciﬁcally, the mode correlations both in the temporal-

spatial domain and the inter-view are modeled with

MPCM. Then, MPCM is exploited to select the optimal

prediction direction for the current encoding MB. Finally,

the inter mode is early determined in the optimal prediction

direction. Experimental results show that the proposed

method can save 86.03 % of encoding time compared with

the exhaustive mode decision used in the reference soft-

ware of joint multiview video coding, with only 0.077 dB

loss in Bjontegaard delta peak signal-to-noise ratio

(BDPSNR) and 2.29 % increment of the total Bjontegaard

delta bit rate (BDBR), which is superior to the perfor-

mances of state-of-the-art approaches.

Keywords Multiview video coding  Mode decision 

Macroblock position constraint model  H.264/AVC

1 Introduction

Multi-view video refers to a set of temporally synchronized

videos captured at the same scene by multiple cameras

from different viewpoints [1]. Compared with the single-

view video, multi-view video provides more interactivity

and realistic experience for viewers, which has great

potential in new video applications such as Free-viewpoint

Television (FTV) and Three-dimensional Television

(3DTV) [2, 3]. To facilitate the research of multi-view

video coding (MVC), Joint Video Team (JVT), which was

composed of experts from both ISO/IEC MPEG and ITU-T

Video Coding Experts Grou p (VCEG) [4, 5], developed

reference software of Joint Multiview Video Coding

(JMVC) on the basis of H.264/AVC video coding standard.

In JMVC, hierarchical B picture (HBP) structure achieves

higher coding efﬁciency compared with the straightforward

solution of independently encoding each view with H.264/

AVC. Figure 1 shows the HBP architecture in JMVC,

where the arrow denotes the direction of reference frame.

All the views are divided into two classes: even views and

odd views. The even views (V0, V2, V4 and V6) use

variable block-size motion estimation (ME) technique to

exploit the spatial-temporal correlation. Meanwhile the odd

views (V1, V3, V5 and V7) adopt a new variable block-

size disparity estimation (DE) technique which exploits the

inter-view correlation to improve the coding efﬁciency.

Because the process of ME and DE is separately and

repeatedly performed for each MB, the computational

complexity of mode decision is very intensive.

To reduce the computational complexity, several fast

mode decision approaches are presented in the literature.

They can be categorized into two classes. The ﬁrst class is

to early terminate the SKIP/DIRECT mode decision pro-

cess [6–8]. If the SKIP/DIRECT mode is considered as the

& Gaobo Yang

yanggaobo@hnu.edu.cn

School of Information Science and Engineering, Hunan

University, Changsha 410082, China

Faculty of Physics and Electronic Information Science,

Hengyang Normal University, Hengyang 421002, China

North University of China, Taiyuan 030051, China

123

J Real-Time Image Proc

DOI 10.1007/s11554-015-0527-1

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38612437

粉丝: 5
资源: 906

多视点视频编码优化：宏块位置约束模型自适应模式决策

基于模式相关性的多视点视频编码宏块模式快速选择算法

基于全零块和速率失真成本的EarlyDIRECT模式决策用于多视点视频编码

一种自适应局部亮度补偿多视点视频编码方法 (2013年)

采用率失真与模式特征的多视点视频编码快速模式选择

一种基于多视点视频的低复杂度自适应环路滤波算法

全零块与速率失真优化：多视点视频编码的EarlyDIRECT模式决策

多视点视频编码：宏块模式快速选择算法

自适应视点间预测结构提升多视点视频编码性能

多视点视频编码中基于速率失真活动的快速宏块编码算法

多视点视频编码快速模式选择：基于率失真与模式特征

最新资源