改进的半全局匹配与平面拟合的基于人口普查的立体视觉算法

需积分: 13 18 浏览量更新于2024-09-08 收藏 440KB PDF 举报

"A Census-Based Stereo Vision Algorithm Using Modified Semi-Global Matching and Plane Fitting to Improve Matching Quality" 这篇论文提出了一种基于人口普查的立体视觉算法，旨在通过改进的局部半全局匹配（Modified Semi-Global Matching, SGM）和平面拟合来提高匹配质量。在立体视觉系统中，两个不同角度拍摄的图像（左图像和右图像）被用来计算场景深度，即视差图。这种技术广泛应用于机器人导航、自动驾驶和3D重建等领域。首先，论文引入了一种基于分割的方法，针对遮挡区域和纹理稀疏区域的匹配质量进行了显著提升。这通过将左彩色图像或计算出的纹理图像进行分割来实现。局部成本计算采用了一种基于人口普查的关联方法，这种方法可以更好地捕捉图像的局部结构信息，与传统的绝对差分和平方差分相比，能提供更稳健的匹配。论文进一步提出，对匹配的置信度进行测量，只对那些不自信或者无纹理的像素进行估算。这通过为对应分割区域计算一个视差平面来完成。这种方式能够减少噪声和不准确匹配的影响，特别是在边缘和纹理不明显的地方。为了进一步提高局部优化匹配的质量，论文采用了带有亚像素精度的修改版半全局匹配步骤。与标准的SGM方法不同，该算法不是在整个图像上进行视差优化，而是选择有高置信度的像素区域，这样可以降低全局优化过程中的计算复杂性，同时保持较高的匹配精度。此外，通过平面拟合，算法可以更好地处理平坦区域的视差估计，减少错误匹配，尤其是在物体表面平行于相机视线的情况下。这种方法有助于生成更加连续和平滑的视差图，从而提高立体视觉系统的整体性能。这篇论文为立体视觉匹配问题提供了一个创新的解决方案，通过结合分割、人口普查和改进的半全局匹配策略，提高了匹配的准确性，特别是对于困难场景，如遮挡和纹理稀疏区域。这对于实际应用中的立体视觉系统具有重要的实用价值。

A Census-Based Stereo Vision Algorithm Using Modiﬁed Semi-Global Matching

and Plane Fitting to Improve Matching Quality

∗

Martin Humenberger, Tobias Engelke, Wilfried Kubinger

AIT Austrian Institute of Technology

Donau-City-Strasse 1, 1220 Vienna, Austria

martin.humenberger@ait.ac.at, tobias.engelke@ait.ac.at, wilfried.kubinger@ait.ac.at

Abstract

This paper introduces a new segmentation-based ap-

proach for disparity optimization in stereo vision. The main

contribution is a signiﬁcant enhancement of the matching

quality at occlusions and textureless areas by segmenting

either the left color image or the calculated texture image.

The local cost calculation is done with a Census-based cor-

relation method and is compared with standard sum of ab-

solute differences. The conﬁdence of a match is measured

and only non-conﬁdent or non-textured pixels are estimated

by calculating a disparity plane for the corresponding seg-

ment. The quality of the local optimized matches is in-

creased by a modiﬁed Semi-Global Matching (SGM) step

with subpixel accuracy. In contrast to standard SGM, not

the whole image is used for disparity optimization but hor-

izontal stripes of the image. It is shown that this modi-

ﬁcation signiﬁcantly reduces the memory consumption by

nearly constant matching quality and thus enables embed-

ded realization. Using the Middlebury ranking as evalua-

tion criterion, it is shown that the proposed algorithm per-

forms well in comparison to the pure Census correlation.

It reaches a top ten rank if subpixel accuracy is supposed.

Furthermore, the matching quality of the algorithm, espe-

cially of the texture-based plane ﬁtting, is shown on two

real-world scenes where a signiﬁcant enhancement could

be achieved.

1. Introduction

3D data perception of the surrounding environment of a

robot platform or an autonomous vehicle is essential for re-

liable operation. Common sensors are based on laser, radar,

or time-of-ﬂight. These techniques enable high quality 3D

perception with the drawback of low resolution and high

costs. For a number of robot applications such as people or

∗

This work was has been supported by the European Union project

ROBOTS@HOME under grant FP6-2006-IST-6-045350.

scene recognition as well as robot navigation digital cam-

eras are used. Stereo vision is technology that uses two in

parallel mounted digital cameras to determine the depth of

a scene. Advantages are the low price, the high resolution

and the fact that the images can be used for any other appli-

cation as well. For home applications it is also quite useful

because it is purely passive technology and thus does not

effect the surrounding environment.

For depth calculation the so called correspondence prob-

lem (stereo matching), which is the search for correspond-

ing projections of the same scene point onto both camera

planes, has to be solved. The horizontal displacement of

corresponding pixels is denoted as disparity. Area-based

stereo matching algorithms try to calculate the complete

disparity map, which is an image of the same size as the

camera images with the disparity instead of the intensity

value for each pixel. The advantage is that with a single

capture a huge number of surrounding 3D points can be de-

termined. The matching process is based on similarity com-

parison of areas of the images (correlation), thus textureless

areas are a difﬁcult challenge. Pixels visible in only one of

the images are called occlusions and obviously cannot be

found by correlation.

In general, area-based matching algorithms calculate the

costs for each matching candidate and optimize them af-

terwards to ﬁnd the correct matches. Once the local costs

are calculated, a minimum search (winner takes all,WTA)

can be used to ﬁnd the best matching pixels. Another strat-

egy is to apply global optimization to the local costs to en-

hance the probability of correct matching. Here, not only

the pixels’ neighborhoods are used to calculate the costs,

but the whole scanline or even the whole image. With

these techniques, especially on textureless areas better re-

sults can be achieved. The drawback of global optimizing

algorithms is the huge processing time and memory con-

sumption. To the authors’ knowledge, no implementation of

a global optimization is commercially available for purely

embedded real-time platforms without dedicated hardware

such as ﬁeld programmable gate arrays (FPGA).

下载后可阅读完整内容，剩余7页未读，立即下载

jackknife999999

粉丝: 19
资源: 10

改进的半全局匹配与平面拟合的基于人口普查的立体视觉算法

Python库 code_census-0.0.10 从 PyPI 官网下载指南

Python官方库ps2_census-0.12.0发布，强化云原生应用支持

Python库census-0.8.4快速安装指南与介绍

Adult-Census-Income-Classification

liv-census-test-spa

census-custom-api-docs

census-fwmt-csv-service

US-Census-Foreign-Trade-Interactive-Vizualization

census-data-challenge

United-States-Census-Data-Analysis-using-MapReduce

最新资源