多视图深度图估计：像素级关联适应性选择

需积分: 14 22 浏览量更新于2024-09-07 收藏 17.07MB PDF 举报

"这篇论文提出了一种多视角深度图估计方法，旨在自适应地确定参考图像与源图像集所有元素之间的像素级数据关联。通过在概率框架内解决像素级视图选择和深度图估计问题，利用局部两两图像一致性进行建模。相应的图形模型通过基于EM的视图选择概率推理和类似PatchMatch的深度采样和传播算法来求解。实验结果表明，这种方法在标准多视角基准测试中提供了最先进的估计精度，通过减少错误的像素级数据关联。此外，大量互联网众包数据的实验验证了我们方法对无结构和异构图像捕获特性的鲁棒性。此外，我们的方法线性的计算和存储需求以及固有的并行性，使得能够实现高效且可扩展的GPU基础实现。" 详细说明: 这篇论文的核心是提出了一种新的多视角深度图估计技术，该技术着重于如何选择最合适的源图像子集来估计参考图像中特定像素的深度。这解决了在多视角场景中，如何有效利用不同视角的信息以提高深度估计的准确性的问题。首先，作者将问题置于一个概率框架下，通过考虑局部的两两图像一致性（photoconsistency）来同时建模像素级视图选择和深度图估计。这种框架允许模型在估计深度的同时，选择出对目标像素深度估计最有贡献的源图像视图。然后，他们引入了一种基于期望最大化（EM）的视图选择概率推理方法，这有助于确定哪些源图像对于某个像素的深度估计最有价值。同时，他们借鉴了PatchMatch算法的思想，进行深度采样和传播，这使得深度估计过程更加高效。实验部分展示了该方法在标准多视角基准测试中的优秀性能，证明了避免错误像素级数据关联可以显著提高深度估计的准确性。此外，通过在大规模的互联网众包数据上进行测试，证明了该方法在面对不规则、多样化的图像捕获条件时仍能保持稳健性。最后，由于算法的线性计算和存储需求以及并行性，它能够适应GPU平台，从而实现高效和可扩展的实现，这对于处理大数据量的多视角深度图估计任务至关重要。这篇论文提供了一种创新的多视角深度图估计方法，通过优化像素级的数据关联和利用概率模型，提高了深度估计的精度和鲁棒性，同时考虑了实际应用中的计算效率。

PatchMatch Based Joint View Selection and Depthmap Estimation

Enliang Zheng, Enrique Dunn, Vladimir Jojic, and Jan-Michael Frahm

The University of North Carolina at Chapel Hill

{ezheng,dunn,vjojic,jmf}@cs.unc.edu

Abstract

We propose a multi-view depthmap estimation approach

aimed at adaptively ascertaining the pixel level data asso-

ciations between a reference image and all the elements of

a source image set. Namely, we address the question, what

aggregation subset of the source image set should we use to

estimate the depth of a particular pixel in the reference im-

age? We pose the problem within a probabilistic framework

that jointly models pixel-level view selection and depthmap

estimation given the local pairwise image photoconsistency.

The corresponding graphical model is solved by EM-based

view selection probability inference and PatchMatch-like

depth sampling and propagation. Experimental results on

standard multi-view benchmarks convey the state-of-the art

estimation accuracy afforded by mitigating spurious pixel-

level data associations. Additionally, experiments on large

Internet crowd sourced data demonstrate the robustness of

our approach against unstructured and heterogeneous im-

age capture characteristics. Moreover, the linear computa-

tional and storage requirements of our formulation, as well

as its inherent parallelism, enables an efﬁcient and scalable

GPU-based implementation.

1. Introduction

Multi-view depthmap estimation (MVDE) methods

strive to determine a view dependent depthﬁeld by leverag-

ing the local photoconsistency of a set overlapping images

observing a common scene. Applications beneﬁting from

high quality depthmap estimates include dense 3D model-

ing, classiﬁcation/recognition [20] and image based render-

ing [6]. However, achieving highly accurate depthmaps is

inherently difﬁcult even for well controlled environments

where factors such as viewing geometry, image-set color

constancy, and optical distortions are rigorously measured

and/or corrected. Conversely, practical challenges for ro-

bust depthmap estimation from non-controlled input im-

agery (i.e. Internet collected data) include mitigating het-

erogeneous resolution and scene illuminations, unstructured

viewing geometry, scene content variability and image reg-

istration errors (i.e. outliers). Moreover, the increasing

availability of crowd sourced datasets has explicitly brought

efﬁciency and scalability to the forefront of application re-

quirements, while implicitly increasing the importance of

data association management when processing such large

scale datasets.

The input for MVDE is commonly assumed to consist

of a convergent set of images along with reliable estimates

of their pose and calibration parameters. The extracted

depthmap will correspond to the pixel-wise 3D structure hy-

potheses that best explain the available image observations

in terms of some measure of visual similarity w.r.t. a ref-

erence image. Ironically, the potential robustness afforded

by having multiple available images is compromised by the

inherent variability in pairwise photoconsistency observa-

tions. In practice, correct depth hypotheses may provide

low photoconsistency in a source image subset (e.g. oc-

clusions or illumination aberrations), while incorrect depth

hypotheses may register high image similarity (e.g. repet-

itive structure or homogeneous texture). These technical

challenges render multi-view depth hypothesis evaluation

as a problem of robust model ﬁtting, where a demarcation

among inlier and outlier photoconsistency observations is

required. We tackle this implicit data association problem

by addressing the question: What aggregation subset of the

source image set should be used to estimate the depth of a

particular pixel in the reference image.

We propose a probabilistic framework for depthmap es-

timation that jointly models pixel-level view selection and

depthmap estimation given pairwise image photoconsis-

tency. An overview is depicted in Figure 1. The cor-

responding graphical model is solved by EM-based view

selection probability inference and PatchMatch-like depth

sampling and propagation. Our approach iteratively alter-

nates between exploration of the depth search space and

updating our formulated probabilistic model. The insight

leveraged by our method is the spatial smoothness in the

photoconsistency at the correct depth hypothesis of a given

pixel w.r.t. the images in the source image dataset [22, 13].

Our expectation of having a high overlap of photoconsistent

source images among neighboring pixels in the reference

下载后可阅读完整内容，剩余7页未读，立即下载

qq_39808441

粉丝: 31
资源: 2

多视图深度图估计：像素级关联适应性选择

IJCNN2016_P2_Cameraready_WSNmatlab_

matlab代码画界面-StoryGraphs_CVPR2014:StoryGraphs-将角色交互可视化为时间轴

Model Identification and Adaptive Control.pdf

semanticfusion.tar.gz

王五的绩效考核.xml

FA_HyperLink.xls

火炬用的76登陆器啊啊啊啊啊

GAPDLL111111111111111111111

Microsoft Project 2007 Guide.doc

全要素生产率_LP（剔除金融STPT）.dta

最新资源