深度图压缩与高效3D视频视图渲染技术

59 浏览量更新于2024-08-30 收藏 497KB PDF 举报

"这篇研究论文探讨了三维视频系统的深度图压缩和深度辅助视图渲染技术，旨在降低压缩和渲染的复杂性，同时保持高质量的渲染效果。作者提出了一个创新的方法，利用不同层次对深度图进行压缩，并结合多种优化技术，如时空一致变形、色彩校正和时间一致Kong填充等，显著减少了计算复杂度。实验结果显示，与传统方法相比，这种方法在保持高渲染质量的前提下，压缩计算复杂度降低了79%以上，渲染计算复杂度降低了45%以上。" 三维视频技术已经成为当前的热门领域，它能够为用户带来高度真实和沉浸式的体验。在这个过程中，深度图起着关键作用，因为它能通过深度图像为基础的渲染技术生成虚拟视图。然而，如何在不牺牲渲染质量的情况下，有效地压缩深度图并简化渲染过程是一个尚未解决的问题。论文中提出的新方法解决了这一挑战。首先，深度图被分成了不同的层次，每层采用不同的宏块模式决策程序进行压缩，这允许更灵活且高效的编码策略。这种多层次的处理方式有助于优化数据存储和传输，减少了处理深度信息所需的计算量。接着，为了进一步降低渲染复杂性并提高效率，研究者集成了一系列优化技术。时空一致变形技术确保了在不同时间点和空间位置的视图之间的一致性，减少了失真和不连续性。色彩校正则用于调整因深度信息处理导致的颜色偏差，以保持图像的视觉一致性。时间一致Kong填充是一种特殊的空洞填充技术，它能够在时间轴上保持连续性，避免在渲染过程中出现闪烁或跳变。实验结果表明，这种方法在降低系统复杂性方面取得了显著成效。压缩计算复杂度下降超过79%，这意味着处理器可以更快地处理深度图数据，节省了计算资源。而渲染计算复杂度降低45%以上，意味着系统可以在保持图像质量的同时，更快地生成虚拟视图，提升了用户体验。这项研究为3D视频系统提供了一个高效的解决方案，通过深度图的智能压缩和深度辅助视图渲染，实现了性能与质量之间的良好平衡。这对于推动3D视频技术的发展，尤其是在资源有限的设备上实现高质量3D视频体验具有重要意义。

Published in IET Signal Processing

Received on 18th February 2011

Revised on 14th November 2011

doi: 10.1049/iet-spr.2011.0062

ISSN 1751-9675

Depth map compression and depth-aided view

rendering for a three-dimensional video system

F. Shao M. Yu G. Jiang F. Li Z. Peng

Faculty of Information Science and Engineering, Ningbo University, Ningbo 315211, People’s Republic of China

E-mail: shaofeng@nbu.edu.cn

Abstract: Three-dimensional (3D) video technologies are becoming increasingly popular, as they can provide high quality and

immersive experience to end users, where depth maps are employed to generate the virtual views by depth-image-based rendering

technique. However, how to reduce the compression and rendering complexities for depth maps while maintaining high rendering

quality is still unresolved. In this study, a novel depth map compression and depth-aided view rendering method is proposed. In

the proposed method, depth maps are represented with different layers and compressed with di fferent macroblock-mode decision

procedure, and several optimisation techniques, including spatio-temporal consistent warping, colour correction and temporal

consistent hole ﬁlling are embed ded into the view rendering framework. Experimental results show that compared with the

traditional method, the proposed method can reduce more than 79% compression computational complexity and more than

45% rendering computational complexity, while maintaining high rendering quality.

1 Introduction

Three-dimensional (3D) video has gained popularity with its

ability to give viewers an enhanced experience of multimedia

in comparison to traditional two-dimensional (2D) video.

With these features, 3D video (3DV) will revolutionise

visual media by providing 3D television (3DTV) and free

viewpoint television (FTV) applications [1, 2]. In order to

promote the 3DTV and FTV applications, relevant

problems such as capturing, pre-processing, coding and

rendering of 3DV data are now very active research topic.

In order to represent 3D scene, different 3DV formats were

proposed, among which multi-view video plus depth (MVD)

format was recommended by MPEG of ISO/IEC and VCEG

of ITU-T because of its ﬂexible representation and

compatibility with the existing compression and

transmission technologies [3]. Since MVD representation

causes a huge amount of data to be stored or transmitted to

the user, it is essential to develop efﬁcient coding

techniques. Multi-view video coding (MVC) had been

widely researched [4]. Instead of directly using the MVC

technique, some totally different compression methods were

proposed for depth maps. Morvan et al. [5] concentrated on

depth smooth properties, and proposed quadtree

decomposition scheme to model those regions. Oh et al. [6]

proposed a depth boundary reconstruction ﬁlter and utilised

it as an in-loop ﬁlter to compress the depth map.

Furthermore, by considering the joint characteristics from

MVD representation, some fast MVC methods were

proposed by sharing the same macroblock (MB) mode or

motion vector information between colour videos and depth

maps [7, 8]. However, besides the compression efﬁciency,

the compression complexity and the effect of depth

distortion on view rendering quality are also important

issues to be solved in depth map compression.

In MVD representation, virtual view can be rendered from

MVD data by using depth image-based rendering (DIBR)

technique [9]. Since accurate depth map acquisition is still

an unsolved problem, DIBR requires solving high rendering

quality because of the inaccurate depth information. Many

depth pre-processing and depth post-processing methods

were proposed. Lai et al. [10] proposed iterative joint

multilateral ﬁltering to process the estimated depth maps,

and aimed to align their edges with those in video frames

and to reduce false contours. Ekmekcioglu et al. [11]

proposed content adaptive ﬁlters for different depth map

regions to enforce consistency across the spatial, temporal

and inter-view dimensions of depth maps. Lee and Effendi

[12] proposed an adaptive edge-oriented smoothing ﬁlter to

deal with the problems of hole occurrences, geometric

distortions and computational complexity in DIBR.

However, even though these methods can eliminate the

inﬂuence of inaccurate depth information, how to improve

the rendering quality in DIBR is still a very interesting

problem in 3DV research.

From anothe r perspective, in order to improve the view

rendering quality in DIBR, many optimisation methods

were proposed. Bulbul et al. [13] proposed a perceptually

based approach to improve the view rendering quality by

utilising binocular suppression mechanism in the human

visual system. Do et al. [14] proposed supersampling

technique to reduce the warping errors for obtaining higher

view rendering quality. Zhao et al. [15] proposed a novel

solution of suppression of misalignment and alignment

enforcement between colour videos and depth maps to

reduce boundary artefact. Tech et al. [16] evaluated the

IET Signal Process., 2012, Vol. 6, Iss. 3, pp. 247 –254 247

doi: 10.1049/iet-spr.2011.0062

The Institution of Engineering and Technology 2012

www.ietdl.org

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38670318

粉丝: 6
资源: 919

深度图压缩与高效3D视频视图渲染技术

三维四子棋

用于多视图深度视频编码的跨视图下/上采样方法

Android平台实现STL文件的三维显示方法

OpenGL实现三维动物动态展示及视角控制技术

从二维到三维：SolidWorks基本建模技巧

JavaFX 3D图形渲染管线全解析：CPU到GPU的优化之道

立体视觉里程计仿真框架深度剖析：构建高效仿真流程

【Java图形算法进阶】：掌握高级技术，优化图形应用性能

从零开始精通数据可视化：进阶秘籍带你图表到动态图形

OpenGL中的3D模型加载和展示

最新资源