优化多视图深度视频编码的快速宏块模式选择算法

30 浏览量更新于2024-08-30 收藏 1.67MB PDF 举报

"Fast macroblock mode selection algorithm for multiview depth video coding" 本文提出了一种针对多视点视频加深度（MVD）编码的快速宏块模式选择算法，旨在降低多视点深度视频编码的计算复杂性。在当前的多视点视频编码中，由于需要处理多个视角的数据，其计算复杂度非常高，这限制了MVD技术在实际应用中的推广。该算法在联合编码方案上实现，结合了有效的预测机制和对象边界区分方法，以减少编码过程中的计算负担。预测机制是该算法的核心组成部分，它基于宏块模式的相似性来设计。通过对相邻宏块模式的分析，预测机制能够减少需要评估的模式数量，从而显著降低了编码过程中的计算量。这种机制理解了视频内容的局部特性，能够更准确地预测当前宏块应采用的编码模式，避免了对所有可能模式的逐一比较。同时，对象边界辨别方法在算法中也起到了关键作用。在多视点视频中，对象边界信息对于正确编码和重建深度信息至关重要。通过区分这些边界，算法可以优化宏块的分割，提高编码效率，同时保持图像质量和深度信息的准确性。这种方法有助于减少由于边界处理不当导致的编码失真，尤其是在复杂场景中。此外，由于多视点视频涉及到多个视角的同步和一致性，因此，快速宏块模式选择算法还需要考虑不同视角之间的相关性。通过有效利用这些相关性，算法能够在减少计算量的同时，保持各视点间的视差一致性，确保观看者的立体视觉体验。总结来说，这篇论文提出的快速宏块模式选择算法通过创新的预测机制和对象边界处理策略，成功降低了多视点深度视频编码的复杂性，提高了编码效率，为MVD技术的实际应用提供了可行性。这一算法对于解决多视点视频编码的计算效率问题具有重要意义，有助于推动3D视频和虚拟现实等领域的技术发展。

February 10, 2010 / Vol. 8, No. 2 / CHINESE OPTICS LETTERS 151

Fast macroblock mode selection algorithm for multiview

depth video coding

Zongju Peng ($$$mmmÞÞÞ)

1,2,3

, Mei Yu ( rrr)

, Gangyi Jiang (öööfffÀÀÀ)

1,2∗

, Feng Shao ( ¶¶¶)

Yun Zhang (ÜÜÜ )

1,3

, and You Yang ( fff)

1,3

Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China

Faculty of Information Science and Engineering, Ningbo University, Ningbo 315211, China

Graduate University of Chinese Academy of Sciences, Beijing 100049, China

∗

E-mail: jianggangyi@126.com

Received April 17, 2009

Huge computational complexity of multiview video plus depth (MVD) coding is an obstacle for putting

MVD into applications. A fast macroblock mode selection algorithm is proposed to reduce the computa-

tional complexity of multiview depth video coding. The proposed algorithm, implementing on a joint coding

scheme, combines an eﬀective prediction mechanism and an object boundary discriminating method. The

prediction mechanism which is designed based on the macroblock mode similarities reduces the number of

macroblo ck mode candidates in depth video coding. The object boundary discriminating method extracts

the regions, which are with discontinuous depth values and important for virtual view rendering, by using

macroblo ck deviation factor. Exp erimental results show that the prop osed algorithm can signiﬁcantly

promote the coding speed of depth video by 2.00–3.40 times, while maintaining high rate distortion (RD)

p erformance in comparison with the full search algorithm.

OCIS codes: 110.0110, 100.6890, 330.1690.

doi: 10.3788/COL20100802.0151.

With the fast development in the areas of integrated

optics with sensors and network infrastructures, three-

dimensional (3D) video systems will soon be used in a

great number of applications. Integral imaging technol-

ogy, one of the most promising methods for 3D scenes

representation, attracts a lot of research interests

[1,2]

Multiview video plus depth (MVD)

[3]

is an alternative

to integral imaging for representing 3D scenes. MVD

signals include multiple texture videos and associated

depth videos of the same scene. MVD signals are ﬁrst

captured at diﬀerent sparse viewpoints and compressed,

then transmitted to client. The MVD bit streams are

decoded and utilized to synthesize the virtual views with

depth-image-based rendering (DIBR) technique.

To eﬃciently compress MVD signals, Park et al. pro-

posed the view-temporal prediction structures that can

be adjusted to various characteristics of general multi-

view video

[4]

. In Ref. [5], an eﬀective algorithm was

proposed to eliminate the color inconsistency between

multiview videos for better coding and rendering perfor-

mances. Yang et al. proposed an image region partition

and regional disparity estimation algorithm for mul-

tiview video coding

[6]

. For standardizing encoding of

MVD, the joint multiview video model (JMVM) was de-

veloped, based on the video coding standard H.264/AVC.

In JMVM, an exquisite view-temporal prediction struc-

ture based on hierarchical B pictures (HBP) is used to

exploit not only the temporal correlations within a single

view, but also the inter-view correlations among diﬀerent

views

[7]

The JMVM has nine macroblock modes, including

SKIP, Inter 16×16, Inter 16 × 8, Inter 8 × 16, Inter 8 × 8,

Inter 8×8 Frext, Intra 16×16, Intra 8×8, and Intra 4×4.

These modes are probed by the full search algorithm

to determine the optimal macroblock mode for the best

rate distortion (RD) performance. The mode with the

minimal RD cost is then selected as the best mode for

Inter frame coding. Unfortunately, the full search algo-

rithm is time consuming. The computational complexity

of MVD coding can be approximately expressed as O

(η×α×β×θ), where η, α, β, and θ denote the number

of videos in each view, views, average reference frames,

and macroblock modes, respectively. It is an obstacle for

putting MVD into applications. To reduce the complex-

ity of MVD coding, the fast macroblock mode selection

algorithms were proposed to accelerate the coding speed

of multiview texture video

[8,9]

. However, the fast algo-

rithms for multiview depth video so far are marginal.

This letter focuses on reducing the computational com-

plexity of multiview depth video coding. Firstly, a joint

coding scheme is proposed based on macroblock mode

similarity between the texture videos and the associated

depth videos. Then, a fast depth video coding algorithm

is presented by combining an eﬀective prediction mech-

anism and an object boundary discriminating method.

Finally, the fast algorithm is implemented and evaluated.

Figure 1 shows an MVD-based 3D video system. In tex-

ture video and associated depth video, boundaries of ob-

jects in the scene coincide and directions of object move-

ments are also very similar. Therefore, the macroblock

mode distributions of the texture image and its associ-

ated depth image will be similar. Figures 2(a) and (b)

show the mode distributions of the texture image and the

associated depth image of a frame in Ballet test sequence.

The blocks with red, green, and blue borders denote the

macroblocks encoded with SKIP, Inter, and Intra modes.

It can be found that the macroblock modes are similar

between these two images. The similarity can be utilized

to speed up the coding process. Based on the analy-

ses above, a joint MVD coding scheme is proposed and

1671-7694/2010/020151-04

° 2010 Chinese Optics Letters

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38699830

粉丝: 6
资源: 972

优化多视图深度视频编码的快速宏块模式选择算法

ISO/IEC 14496-10:2004(E) - Advanced Video Coding Standard

H.264编码器JM8.6核心函数encode_one_macroblock解析

x264_macroblock_analyse深度解析：P类型与skip模式详解

Global-local correlation-based early large-size mode decision for multiview video coding

1. A novel macroblock-tree algorithm for high-performance optimization of.pdf

Dynamic macroblock wavefront parallelism for parallel video coding

A Fast and Efficient Inter Mode Decision Algorithm for the H.264

Motion Estimation Techniques for Digital Video Coding

human centered perceptual adaptation for video coding

Macroblock Level Rate Control for H264.pdf.zip

最新资源