任意形状贴片与跨模态匹配的深度修复方法

19 浏览量更新于2024-08-26 收藏 2.06MB PDF 举报

"基于示例的深度修补，具有任意形状的贴片和交叉模态匹配" 这篇研究论文探讨了深度图像修复（深度图补丁）的问题，主要关注如何处理由于有效值丢失导致的深度地图中的空洞，从而提升深度质量。在现代RGB-D相机的帮助下，实时提供纹理和深度信息，促进了各种依赖深度的应用的蓬勃发展。然而，这些深度地图常常受到无效值损失的影响，产生空洞，对研究和应用造成困扰。作者提出了一种新颖的基于示例的方法来填充深度空洞。这种方法的核心是利用任意形状的贴片进行深度修补，并结合交叉模态匹配技术。传统的基于示例的修复方法通常依赖于矩形或规则形状的补丁，而该方法则允许使用任意形状的补丁，这更符合真实世界中不规则边缘的场景。这种灵活性可以更好地适应复杂环境下的深度图像修复。具体来说，该方法包括以下几个关键步骤： 1. 深度空洞检测：首先，算法会识别出深度图中的无效区域（即空洞）。 2. 边缘保护：为了保持图像边缘的连续性和自然性，该方法采用了边缘保护策略。这确保了修复过程不会破坏原有的边缘信息，从而提高修复结果的真实感。 3. 任意形状补丁匹配：与传统方法不同，该方法能够寻找与空洞形状相匹配的补丁。这意味着它可以利用更广泛的上下文信息，提高了修复的准确性。 4. 交叉模态匹配：为了进一步提升修复效果，论文引入了交叉模态匹配。这意味着它不仅考虑了深度信息，还结合了RGB（颜色）信息进行匹配。这样可以确保修复后的深度值与对应的RGB图像内容相协调，增加视觉一致性。 5. 深度补丁融合：找到合适的示例补丁后，算法会进行融合操作，将补丁的深度信息平滑地集成到原始深度图中，填补空洞。 6. 优化过程：最后，通过优化算法，确保补丁的插入既符合局部一致性，又保持全局连贯性，从而提高整体的深度图像质量。通过以上步骤，该方法能够在保留原有图像结构的同时，有效地修复深度地图中的空洞，提升3D视频和其他依赖深度信息应用的质量。这项工作对于深度图像处理领域具有重要的理论和实践意义，为未来深度学习和计算机视觉应用提供了新的工具和思路。

Signal Processing: Image Communication 71 (2019) 56–65

Contents lists available at ScienceDirect

Signal Processing: Image Communication

journal homepage: www.elsevier.com/locate/image

Exemplar-based depth inpainting with arbitrary-shape patches and

cross-modal matching

Sen Xiang

a,b

, Huiping Deng

a,b,

*, Lei Zhu

a,b

, Jin Wu

a,b

, Li Yu

School of Inform. Sci. & Engn., Wuhan Univ. of Sci. & Tech., Wuhan, 430081, China

Engin. Research Center of Metallurgical Auto. and Measurement Tech., Ministry of Education, Wuhan, 430081, China

School of Electron. Inform. & Coummun., Huazhong Univ. of Sci. & Tech., Wuhan, 430074, China

A R T I C L E I N F O

Keywords:

Depth map

Inpainting

Exemplar

Edge-preserving

3D video

A B S T R A C T

Commodity RGB-D cameras can provide texture and depth maps in real-time, and thus have facilitated the

booming development of various depth-dependent applications. However, depth maps suffer from the loss of

valid values, which leads to holes and impairs both research and applications. In this paper, we propose a

novel exemplar based method to fill depth holes and thus to improve depth quality. This novel method is

based on the fact that a depth map has many similar even identical parts, and the lost depth values can be

restored by referring to valid ones. Considering the intrinsic property of depth maps, i.e., the sharpness of object

boundaries, we propose to use arbitrary-shape matching patches, instead of fixed squares, to avoid inter-depth-

layer distortion and thus improve the boundary. In addition, since depth values do not have distinct features,

cross-modal matching, where both depth and texture are involved, is utilized. Moreover, we also investigate

the similarity criteria in cross-modal matching, in order to improve the accuracy between the source patch and

the target patch. Experimental results demonstrate that the proposed method can accurately recover lost depth

information, especially at boundaries, which outperforms state-of-the-art exemplar-based inpainting methods.

1. Introduction

Depth information is a fundamental element in various applica-

tions, such as free-viewpoint video [1], 3D reconstruction [2] and face

recognition [3]. In recent years, commodity RGB-D cameras, based on

structured light [4] or time-of-flight technique [5], have made depth

acquisition easy and convenient. However, due to the limitation of depth

generation principles and hardware, the reported depth maps have

many holes inside, which impairs further research and applications. To

solve this problem, many researchers have studied the topic of depth

inpainting. In general, these methods can be categorized into filtering-

based ones and exemplar-based ones.

In filtering-based methods, special filters are designed to diffuse valid

depth values to invalid ones. Min et al. [6] proposed the weighted mode

filter. Instead of spatial and intensity similarity, this filter uses statistic

data of valid depth values, and it yields sharp depth edges. Yang et al. [7]

proposed to use an auto-regressive model to estimate the coefficients of

the filter, and thus the filtering can adapt to local context. Miao et al. [8]

and Xiang et al. [9] considered the homogeneity of depth gradient,

and the depth values are obtained under the constraint of gradients

by solving partial differential equations. Milani et al. [10] proposed to

Corresponding author at: School of Inform. Sci. & Engn., Wuhan Univ. of Sci. & Tech., Wuhan, 430081, China.

E-mail address: denghuiping@wust.edu.cn (H. Deng).

use a set of local differential equations to interpolate the missing depth

samples. Xue et al. [11] introduced a low gradient regularization method

where the gradual depth changes are allowed by reducing the penalty for

small gradients while penalizing the non-zero gradients. Zhao et al. [12]

proposed a two-stage filtering on blurred depth maps. The distorted

depth maps are successively processed with binary segmentation-based

depth filtering and MRF-based reconstruction.

Owing to the consistency between texture and depth, texture-

guided filtering is also quite popular. These filters incorporate texture

similarity, and thus different objects can be distinguished and depth

boundaries can be well preserved. The simplest examples are joint

bilateral filter [13,14] and trilateral filter [15], where the weighting

kernel has a texture similarity term. Kim et al. [16] modified the

color weights by considering the texture-depth map consistency. Bapat

et al. [17] proposed a novel iterative median filter which takes into

account the RGB components as well. The color similarity is measured

with the absolute difference of the neighboring pixels and their median

value. Chang et al. [18] proposed adaptive texture-similarity-based

hole filling, where luminance, instead of RGB, are used as guidance.

Bhattacharya et al. [19] focused on removing depth edge distortions.

https://doi.org/10.1016/j.image.2018.07.005

Received 7 December 2017; Received in revised form 5 July 2018; Accepted 5 July 2018

Available online 10 July 2018

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38516658

粉丝: 6
资源: 955

任意形状贴片与跨模态匹配的深度修复方法

shape_based_matching.rar_linemod_linemod匹配拾取_somet5v_形状模板匹配_模板匹配

基于深度学习的多模态医学图像融合方法研究进展.pdf

基于深度学习的视觉—语言跨模态计算机匹配分析.docx

用于跨模态匹配的深度耦合度量学习

基于多模态图正则化的交叉模态检索的类中心判别分析

基于改进交叉模型交叉模态法的局部损伤识别方法 (2015年)

基于具有深度门的多模态长短期记忆网络的说话人识别

基于NVH性能的电动汽车车身模态匹配与优化.pdf

基于人工神经网络的复杂结构模态匹配* (2006年)

cmcm交叉模型交叉模态方法.rar_模型修正_模态_模态修正_结构损伤_结构模态

最新资源