非局部匹配成本聚合方法提升立体匹配精度

需积分: 10 152 浏览量更新于2024-09-09 收藏 910KB PDF 举报

《非局部成本聚合方法在立体匹配中的应用》(A Non-Local Cost Aggregation Method for Stereo Matching)是由清雄杨(Qingxiong Yang)博士在 City University of Hong Kong 所撰写的一篇论文。这篇研究论文关注的是立体匹配中的一个重要技术环节——成本聚合。传统的成本聚合方法通常依赖于用户定义的局部支持区域，通过对邻域内像素的成本进行求和或平均来确定最佳匹配。然而，这种方法存在局限性：它仅能做到局部最优，且随着支持区域大小增加，计算复杂度也随之提高。论文作者重新审视了这一问题，并提出了一个非局部解决方案。核心思想是将匹配成本值根据立体图像对中像素间的相似性进行自适应聚合。这种聚合不是基于固定的邻域，而是建立在由图像对构建的树结构上。树的节点代表所有的像素，边则连接最近邻的像素。两个像素之间的相似度由它们在树上的最短距离决定。这种设计使得每个节点都能接收到整个树中所有其他节点的支持，实现了全局信息的融合。非局部方法的优势在于能够跨越局部限制，更好地保留深度边缘信息，从而可能提高匹配精度。由于每一像素都考虑了与其相连的所有像素的信息，这种方法能够在保持效率的同时，提高匹配的全局优化程度。尽管非局部成本聚合增加了计算复杂性，但它有望在处理复杂场景和保持细节完整性的挑战时提供更佳的结果。该论文对于改进传统的立体匹配算法有着重要的理论贡献，通过引入非局部概念，为解决立体匹配中的难题提供了新的思路。通过这种树结构和相似性度量，研究人员期待能在实际应用中实现更准确、更具鲁棒性的立体匹配性能。

A Non-Local Cost Aggregation Method for Stereo Matching

Qingxiong Yang

City University of Hong Kong

http://www.cs.cityu.edu.hk/

qiyang/

Abstract

Matching cost aggregation is one of the oldest and still

popular methods for stereo correspondence. While effec-

tive and efﬁcient, cost aggregation methods typically ag-

gregate the matching cost by summing/averaging over a

user-speciﬁed, local support region. This is obviously on-

ly locally-optimal, and the computational complexity of the

full-kernel implementation usually depends on the region

size. In this paper, the cost aggregation problem is re-

examined and a non-local solution is proposed. The match-

ing cost values are aggregated adaptively based on pixel

similarity on a tree structure derived from the stereo im-

age pair to preserve depth edges. The nodes of this tree

are all the image pixels, and the edges are all the edges

between the nearest neighboring pixels. The similarity be-

tween any two pixels is decided by their shortest distance

on the tree. The proposed method is non-local as every n-

ode receives supports from all other nodes on the tree. As

can be expected, the proposed non-local solution outper-

forms all local cost aggregation methods on the standard

(Middlebury) benchmark. Besides, it has great advantage

in extremely low computationalcomplexity: only a total of 2

addition/subtraction operations and 3 multiplication oper-

ations are required for each pixel at each disparity level. It

is very close to the complexity of unnormalized box ﬁltering

using integral image which requires 6 addition/subtraction

operations. Unnormalized box ﬁlter is the fastest local cost

aggregation method but blurs across depth edges. The pro-

posed method was tested on a MacBook Air laptop comput-

er with a 1.8 GHz Intel Core i7 CPU and 4 GB memory. The

average runtime on the Middlebury data sets is about 90

milliseconds, and is only about 1.25× slower than unnor-

malized box ﬁlter. A non-local disparity reﬁnement method

is also proposed based on the non-local cost aggregation

method.

1. Introduction

Stereo correspondence has traditionally been, and con-

tinues to be, one of the most extensively researched topics

in computer vision. Stereo algorithms generally perform

(subsets of) the following four steps:

1. matching cost computation;

2. cost (support) aggregation;

3. disparity computation/optimization; and

4. disparity reﬁnement.

Scharstein and Szeliski [21] developed a taxonomy and

categorization scheme for stereo algorithms, and separat-

ed different stereo algorithms into two broad classes: local

and global algorithms. In a local algorithm, the disparity

computation at a given image pixel depends only on image

intensity/color values within a window. All local algorithms

require cost aggregation (step 2), and usually make implic-

it smoothness assumptions by aggregating support. Global

algorithms, on the other hand, make explicit smoothness as-

sumptions and then solve an optimization problem. Such al-

gorithms typically omit the cost aggregation step, but rather

seek a disparity solution (step 3) that minimizes a global

cost function. Popular global methods include dynamicpro-

gramming [2, 23], belief propagation [13, 14, 15, 16] and

graph cuts [3]. Unlike local algorithms, a global algorithm

estimates the disparity at one pixel using the disparity esti-

mates at all the other pixels.

Cost aggregationmethods are traditionally performed lo-

cally by summing/averaging matching cost over windows

with constant disparity. The most efﬁcient local cost aggre-

gation method is unnormalized box ﬁltering which runs in

linear time (relative to the number of image pixels) using in-

tegral image [24] (also known as a summed area table [8])

but blurs across depth edges. Yoon and Kweon [6] demon-

strated that edge-aware ﬁlters like bilateral ﬁlter [22] are

very effective for preserving depth edges and Yang et al.

[5] used bilateral ﬁlter for depth superresolution. However,

full-kernel implementation of the bilateral ﬁlter is slow.

A number of approximation methods have been devel-

oped to accelerate the bilateral ﬁlter, including Paris and

Durand’s fast bilateral ﬁlter [12], Porikli’s O(1) bilateral ﬁl-

ter [17] and Yang’s real-time bilateral ﬁlters [25, 26]. These

methods rely on quantization, and will degrade the perfor-

mance as demonstrated in [18]. Paris and Durand’s method

聚合

立体的

对应，一致

有效的

高效的

指定的

最佳的

实现，实施，履行

代价

重新检查，调查

自适应地

导出的

保护，保持，保留

相似性

预料的

做的比...好

基准

极其

加/减法

操作

乘法

非标准化的

积分图像

使..模糊

视差

优化

广泛地

最优化

分类学

分类方案

分开，隔开

广阔

类别

强度

隐式地

明确的

忽略，遗漏，省略

最小化

动态规划

置信传播

图切割

估计

固定的

非正规化的

积分图像

也被称为一个总结区域表

使..模糊

演示

边缘

感知

双边的

有效的

深度分辨

加速

量化

降低

演示的

近似法，接近

视差

与...有关

下载后可阅读完整内容，剩余7页未读，立即下载

小白的进阶

粉丝: 1515
资源: 12

非局部匹配成本聚合方法提升立体匹配精度

杨庆雄的《A Non-Local Cost Aggregation Method for Stereo Matching》代码

A Non-Local Cost Aggregation Method for Stereo Matching code

A Non-Local Cost Aggregation Method for Stereo Matching

A Non-Local Cost Aggregation Method for Stereo Matching source code

A Non-Local Cost Aggregation method for stereo Matching配套ppt讲解

A Non-Local Cost Aggregation Method for Stereo Matching 核心算法PPT讲解

杨庆雄 立体匹配算法 A Non-Local Cost Aggregation Method for Stereo Matching代码

A Non-Local Aggregation Method Stereo Matching相关资料

航空公司客户满意度数据转换与预测分析Power BI案例研究

课题设计-基于MATLAB平台的图像去雾处理+项目源码+文档说明+课题介绍+GUI界面

最新资源

杨庆雄立体匹配算法 A Non-Local Cost Aggregation Method for Stereo Matching代码