PatchMatch：快速立体匹配算法与图像应用

立体匹配

需积分: 10 108 浏览量更新于2023-05-21 收藏 59.09MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"patchmatch stereo matching" PatchMatch是一种高效、随机化的图像匹配算法，最初由Connelly Barnes在其2011年的博士论文中提出，旨在快速寻找图像中小局部区域的对应关系。该算法在立体匹配领域得到了广泛应用，因为它能有效地解决亚像素级别的匹配问题，大大提升了立体视觉系统的性能。立体匹配是计算机视觉中的一个关键任务，目的是找到同一场景在不同视角下的两幅图像之间的对应像素，从而计算出场景的深度信息。这个过程通常涉及到大量的计算，尤其是在处理高分辨率图像时。PatchMatch算法的出现为这个问题提供了一个高效的解决方案。 PatchMatch的核心思想是基于自然图像的统计特性，即相邻像素的对应关系往往具有相似性或“一致性”。算法通过迭代的方式，利用这一观察来加速匹配过程。在每次迭代中，每个像素的匹配候选会传播到其邻居，如果邻居的像素找到了更优的匹配，则更新当前像素的匹配结果。这种传播和更新的过程使得算法能够在较短的时间内收敛到一个近似的最优解。除了基本的最近邻匹配，PatchMatch算法还可以扩展到寻找k-最近邻匹配，使用具有不同大小和形状的图像块（patch）来增加匹配的灵活性。这使得算法能够适应不同的图像特征和场景复杂性。在立体匹配中，PatchMatch算法通常与半全局匹配（Semi-Global Matching, SGM）等策略结合使用，进一步优化匹配结果，减少误匹配和不连续性。通过这些组合技术，可以生成更准确的深度图，为自动驾驶、虚拟现实、3D重建等领域提供支持。此外，PatchMatch算法不仅限于图像匹配，它还被应用于视频分析、纹理合成、图像修复等多个领域。例如，在视频处理中，通过利用时间连续性，可以加速帧间的匹配过程，提高视频分析的效率。 PatchMatch是一种革命性的快速匹配算法，它的出现极大地推动了立体匹配以及相关计算机视觉任务的发展。通过其随机化和迭代的特性，PatchMatch能够快速找到图像和视频中的对应关系，为各种应用场景提供了高效且实用的解决方案。

资源详情

资源推荐

Figure 2.3: Dataset of 108 image pairs used to produce the histograms in this chapter. This consists

of stereo pairs, similar images, dissimilar images, and frames taken from near and far times in videos.

Images are from the Caltech-256 dataset, the Middlebury stereo pair dataset, and from the short

ﬁlm Kind of a Blur by Jon Goldman. There are 24 image pairs from diﬀerent classes, 24 pairs from

the same class, 6 similar pairs of input and output from our image editing applications, 21 wide

baseline stereo pairs, 21 mismatched image pairs from the stereo dataset, and 12 pairs from near

and far times in video.

(a) D = 100 (b) D = 50 (c) D = 25 (d) D = 10

Figure 2.4: 2D histogram showing peaked distribution of where better matches are located. The

center denotes the current match’s location. For correspondences with high patch distance D (a)

better matches are uniformly distributed throughout the image. As patch distance is lowered, (b)

the better matches become more peaked, with more good matches nearby the current match. Even

lower patch distance (c), (d) causes further peaking behavior. These histograms are measured from

a dataset of 108 pairs of matched images, both similar and dissimilar.

Another way to visualize this peaked distribution is to ask: suppose we have a current, suboptimal

correspondence for a single patch, which has some distance D associated with it. Then keeping the

source location in image A ﬁxed, where are the better target positions for the correspondence in

image B relative to our current target position? Integrating over all possible correspondences, we

can plot a 2D histogram of intensity versus x, y relative position, where intensity represents the

probability that a given relative oﬀset in the target image B will yield a better correspondence.

These plots are shown in Figure 2.4, after integrating over our full dataset of 108 natural images.

Note that the 2D histograms of where better neighbors are located are not uniform, but instead

follow a peaked distribution. When the distance D of our correspondence is high, the better locations

to look at are uniformly distributed across the image. As the distance D is lowered, the better

locations become more clustered around the current location (origin), until eventually, for very low

distances, it is good to search very close to the current position.

Observe also that these priors do not hold for all possible input images, but only the large set

(a) D = 100 (b) D = 75 (c) D = 50 (d) D = 25

Figure 2.5: 2D histogram of where better matches are located, matching random images, with

Gaussian, uniform, or octave random noise. Unlike natural images, there is little peaking, and

better matches are distributed uniformly across the image. The small peak in the center is due to

the 7x7 patch size, which causes a small amount of coherence.

3.1 High Level Motivation

The high level intuition behind our algorithm is shown in Figure 3.1. We have two manifolds A

and B with descriptors computed at lattice points, visualized as colored circles. We wish to ﬁnd for

each descriptor in A, the most similar descriptor in B. We do this by taking advantage of spatial

locality properties observed in the previous chapter: when we have a good match, we can propagate

it to adjacent points on the lattice, and if we have a reasonable match, we can try to improve it by

randomly searching for better matches around the target position. The ﬁrst stage — propagation

— takes advantage of the property that many matches are coherent, or have the same relative

matching coordinates, as discussed in the previous chapter (Figure 2.1 and Figure 2.2). The second

stage — random search — looks for better correspondences relative to the current correspondence’s

target position, according to a peaked search distribution, similar to the measured distributions in

Figure 2.4.

In this chapter, we develop our algorithm for 2D images, which have a regular lattice of the

positions of all pixels. However, our algorithm has also been applied to 1D contours and 3D geometric

data, so we can imagine generalizing these propagation and random search operations to any space

that locally is Euclidean or nearly so.

3.2 Introduction

Many methods recently have been developed for manipulating images at a high level, such as

retargeting algorithms that change image aspect ratios, or image completion algorithms that remove

unwanted objects from photographs. Many of the most powerful of these methods are patch based:

they divide the image into many small, overlapping rectangles of ﬁxed size, called patches.

To understand our matching algorithm, we must consider the common

components of patch based algorithms: The core element of nonparametric patch

sampling methods is a repeated search of all patches in one image region for the

most similar patch in another image region. In other words, given images or

regions A and B, ﬁnd for every patch in A the nearest neighbor in B under a

patch distance metric such as L

. We call this mapping the Nearest-Neighbor Field

(NNF), illustrated schematically in the inset ﬁgure. Approaching this problem with

a na¨ıve brute force search is expensive – O(mM

) for image regions and patches

of size M and m pixels, respectively. Even using acceleration methods such as approximate nearest

剩余99页未读，继续阅读

交大雨声

粉丝: 0
资源: 10

会员权益专享

PatchMatch：快速立体匹配算法与图像应用

BM3D代码matlab-boostBM3D_betterPatchMatching:％这是WACV2019论文中增强的BM3D的实现：“用于

基于PatchMatch的图像修复代码

patchmatch 算法细节

介绍一下PatchMatch Stereo方法

PatchMatch Stereo怎么实现双目测距

pyramid stereo matching network

我要python实现stereo matching的代码

我想要python实现object stereo matching的代码

安卓上可以实现的双目深度算法有哪些

# stereo matching algorithm: 'tvl1', 'msmw', 'hirschmuller08', # hirschmuller08_laplacian', 'sgbm', 'mgm', 'mgm_multi'

它们比Guided Aggregation Net for End-to-end Stereo Matching的GA-Net泛化能力方面有什么优势

Halcon官方提供了哪些立体匹配算法

ResNet、DenseNet相比Guided Aggregation Net for End-to-end Stereo Matching的GA-Net的泛化能力方面有什么优势

PMS立体匹配的基本思想

stereo matching algorithm: 'tvl1', 'msmw', 'hirschmuller08', # hirschmuller08_laplacian', 'sgbm', 'mgm', 'mgm_multi'

createTrackbar("numDisparities:\n", "paramemnt", &numDisparities, 20, stereo_match

colmap中的多视图重建算法原理

轻量级的深度学习立体匹配有哪些

近四年提出的立体匹配算法

会员权益专享

最新资源