深度图压缩中的边缘保持上/下采样优化方法

17 浏览量更新于2024-08-26 收藏 296KB PDF 举报

在高效视频编码（High-Efficiency Video Coding, HEVC）的深度图压缩过程中，边缘保留下采样/上采样技术起着关键作用。由于深度信息对于增强合成视图的质量至关重要，本文提出了一种新颖的边缘保持深度上采样方法，专为基于下采样/上采样的深度编码设计。该方法充分考虑了深度图与对应纹理图像之间的边缘相似性以及深度图之间的结构相似性。作者构建了一个权重模型，这个模型利用深度图的局部协方差系数来估计最优的最小均方误差（Minimum Mean Square Errors, MMSE）上采样系数。通过这种方式，能够在保持边缘细节的同时，实现深度图的有效压缩。这种方法不仅关注编码效率的提升，还注重合成视图质量的主观评价，即在压缩后的深度数据重构过程中，如何尽可能地接近原始深度图的视觉效果。实验结果显示，与传统方法相比，提出的下采样/上采样结合的边缘保持深度插值方法在编码效率和合成视图质量方面都有显著改善。这对于三维视频（3D video）编码而言，是重要的进步，特别是在对空间分辨率进行动态调整、节省带宽或处理低比特率传输场景时，这种优化技术能够提供更高质量的深度信息还原。本文的研究为高效视频编码中的深度图处理提供了一种创新策略，通过兼顾边缘保护和压缩性能，有助于推动3D视频技术在实时流媒体、虚拟现实和增强现实等领域的广泛应用。

EDGE-PRESERVING INTERPOLATION FOR DOWN/UP SAMPLING-BASED DEPTH

COMPRESSION

Huiping Deng, Li Yu, and Zixiang Xiong

∗

Dept of Electronics and Information Engineering, Huazhong Univ. of Sci. & Tech., Wuhan, China

∗

Dept of ECE, Texas T&M University, USA

Email: denghuiping.hust@gmail.com,hustlyu@mail.hust.edu.cn,zx@ece.tamu.edu

ABSTRACT

Preserving the edges in depth compression is important for

improving the synthesized view quality, this paper presents

a novel edge-preserving depth up-sampling method for

down/up sampling-based depth coding using both the tex-

ture and depth information. We take into account the edge

similarity between depth maps and their corresponding tex-

ture images as well as the structural similarity among depth

maps to build a weight model. Based on the weight model,

the optimal MMSE up-sampling coefﬁcients are estimated

from the local covariance coefﬁcients of the down-sampled

depth map. Experimental results show that our proposed

interpolation method for down/up sampling-based depth cod-

ing improves both the coding efﬁciency and synthesized view

quality.

Index Terms— 3D video, down/up sampling-based depth

coding, view synthesis, and edge-preserving interpolation.

1. INTRODUCTION

R&D in 3D video has been rapidly growing in recent years

in both the industry and academia. An attractive 3D video

representation is to utilize depth information of the scene (on

top of the 2D texture information). With the help of depth

maps, many interesting applications such as glasses-free 3D

video, free-viewpoint television (FTV), and gesture/motion-

based human computer interaction are becoming possible;

in addition, an arbitrary number of intermediate views can

be synthesized with low-cost depth image-based rendering

(DIBR) techniques, but the quality depends on the accuracy

of the depth maps. Consequently, efﬁcient depth coding is

one of the key issues in 3D video systems.

Depth maps generally have more spatial redundancy than

natural images. This property can be exploited to compress a

down-sampled depth map at the encoder, followed by decom-

pression and up-sampling at the decoder. MPEG 3DV exper-

iments demonstrate that this down/up sampling-based depth

coding approach can improve the depth map coding efﬁciency

[1]. Since the quality of the synthesized views depends on the

Work supported by the NSFC grants 60972016 and 60903172 and the

China-Finnish cooperation project 2010DFB10570.

accuracy of the depth information, depth coding-induced dis-

tortion not only affects the depth quality but also the synthe-

sized view quality. Therefore, depth up-sampling method at

the decoder needs to be carefully designed to guarantee syn-

thesized view quality.

Classical techniques, such as pixel repetition, bilinear

or bi-cubic interpolation cause jagged boundaries, blurred

edges, and annoying artifacts around edges. In [2, 3], a

median up-sampling method is applied to down/up sampling-

based depth map coding. In [4], Oh et al. proposed a depth

boundary reconstruction ﬁlter to correct depth coding errors;

the ﬁlter is designed on the basis of occurrence frequency,

depth difference, as well as pixel position distances which

are all related to the depth map itself. Liu et al. [5] designed

a trilateral in-loop ﬁlter to reconstruct the depth map that

takes into account both the similarity among depth samples

and that among corresponding texture pixels. Wildeboer

et al. [6, 7] proposed a joint bilateral up-sampling algo-

rithm by utilizing the high-resolution texture video in the

process of depth up-sampling; they calculated a weight-cost

based on pixel positions and intensity similarities. Although

these depth map reconstruction methods achieve good per-

formances, they do not consider any special characteristics

of the depth maps. Since depth maps contain sharp intensity

changes at object boundaries, depth coding-induced distor-

tion in the synthesized view is most pronounced along object

boundaries. Therefore, depth map up-sampling algorithm at

the decoder needs to be carefully designed to preserve depth

edges.

Edge-directed interpolation techniques recover sharp

edges while suppressing pixel jaggedness and blurring ar-

tifacts by imposing accurate source models. Li and Orchard

[8] proposed a new edge-directed interpolation (NEDI) al-

gorithm for natural images, which exploits image geometric

regularity by using the covariance of a low resolution im-

age to estimate that of a high resolution image. Asuni and

Giachetti [9] improved the stability of NEDI by using edge

segmentation. Zhang et al. [10] estimated the low resolution

covariance adaptively with improved non-local edge-directed

interpolation. Since NEDI needs a relatively large window

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38638033

粉丝: 5
资源: 940

深度图压缩中的边缘保持上/下采样优化方法

深度图像分块自适应压缩采样提升视点绘制质量

自适应块压缩感知深度图编码技术

移动3D视频系统中深度图重建与编码参数校正技术

基于自适应块压缩感知的深度图编码

深度图像的分块自适应压缩感知 (2016年)

这是一个简单的 JPEG 编码器，支持颜色 JPEG 基线的脸采样，输入唯一的 24 位 BMP 文件

压缩感知驱动的自适应深度图像编码提升虚拟视点质量

边缘扩展码与TV正则化的压缩采样光声成像

深度图像分块自适应压缩感知提升视点绘制质量

视频信号处理：图像种类与编码技术

最新资源