非校准图像匹配的鲁棒方法：恢复未知的等距几何

需积分: 10 195 浏览量更新于2024-07-25 收藏 4.45MB PDF 举报

本文主要探讨了张正友等人在1994年发表的一篇关于非标定图像匹配的创新性研究论文，题目为《通过恢复未知的基线几何实现两幅未校准图像的鲁棒匹配》(A Robust Technique for Matching Two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry)。该文章发表在INRIA的《机器人、图像与视觉》(Robotique, image et vision)杂志上，第2273号期，特别关注的是在摄像机参数未知的情况下，如何利用图像间的唯一几何约束——即基线约束，来实现图像匹配。由于图像未被校准，这意味着它们可能来自不同的相机或者同一相机在不同时间拍摄，这就需要一种鲁棒的方法来处理这种不一致性。论文的核心贡献在于提出了一种基于未知基线几何的匹配策略，这种方法能够在不依赖于精确相机参数的情况下，有效地解决图像对之间的对应关系。作者团队，包括张正友、Rachid Deriche和Olivier Faugeras以及Quang-Tuan Luong，针对这一问题，提出了一个稳健的算法，它能够充分利用图像间的内在联系，如像素间的对应关系，来估计或恢复出未知的基线信息。他们可能采用了松弛法或相关技术来优化求解过程，确保在实际应用中即使面对噪声和不完美匹配，也能获得相对准确的结果。论文的摘要概述了研究的主要目标：通过一个稳健的方法，克服图像未标定的挑战，找到最佳的匹配策略，以增强图像配准的稳定性和可靠性。这种方法的应用领域可能包括计算机视觉中的导航、三维重建、物体跟踪或运动分析，尤其是在那些无法轻易获取相机参数的场景下。这篇论文对于非标定图像匹配问题的研究具有重要意义，它提供了一种实用且鲁棒的解决方案，对于推动计算机视觉技术在实际环境中的广泛应用具有深远影响。通过理解和掌握这种技术，研究人员和工程师可以扩展其技术能力，处理更广泛和复杂的真实世界图像匹配问题。

Zhang, Deriche, Faugeras, Luong

review the epip olar geometry, and then describ e in detail the three steps of the proposed

approach. A preliminary version of this pap er app eared in the pro ceedings of the third

European Conference on Computer Vision [12].

A similar idea has b een indep endently exploited by Xu et al. [57, 40], who also searched

for image corresp ondences through the recovery of the epip olar geometry. There are however

two main dierences:



The weak p ersp ective camera mo del is used in their work, and a full persp ective model is

used in ours. The choice of the most appropriate criterion for the recovery of the epipolar

geometry is not addressed in their work.



The search for the epipolar geometry is carried out with an exhaustive strategy in their

work. The complexity is prohibitively high even for a weak p ersp ective mo del (

(

where

and

are the numberofpoints in the rst and second image, respectively). The

complexity is reduced bychecking only a few closest points. In our work, some classical

techniques are applied to nd an initial set of correspondences.

We could apply the same strategy as that of Xu et al. [57, 40]. In fact, it has been applied to

solve the corresp ondence problem between two sets of 3D line segments [59]. When applying

it to the problem addressed in this paper, we need 8 p oint corresp ondences in order to

estimate the epipolar geometry. The complexity is then

(

). Supp ose b oth

and

are 100, the complexity is in the order of 10

! Xu et al. [57, 40] deal with also the motion

segmentation problem using epip olar constraint, which is not addressed in this pap er.

Recently, computer vision researchers havepaidmuch attention to the robustness of vi-

sion algorithms because the data are unavoidably error prone [17, 60]. Many the so-called

ro-

bust regression

methods have been proposed that are not so easily aected by outliers [25,48].

The reader is referred to [48, Chap. 1] for a review of dierent robust metho ds. The two most

popular robust metho ds are the

M-estimators

and the

least-median-of-squares

(LMedS) me-

thod (see Sect. 6.3). Kumar and Hanson [26] compared dierent robust metho ds for p ose

renement from 3D-2D line correspondences, while Meer et al. [38], for image smo othing.

Haralick et al. [18] applied M-estimators to solve the pose problem from p oint corresp on-

dences. Thompson et al. [51] applied the LMedS estimator to detect moving ob jects using

point corresp ondences b etween orthographic views. Other recentworks on the application

of robust techniques to motion segmentation include [52, 42, 3].

Regarding the robust recovery of the epipolar geometry, our work is closely related to

that of Olsen [43] and that of Shapiro and Brady [49]. Olsen uses a linear method to estimate

the epip olar geometry, which has already been shown in one of our previous work [32]to

be insuciently accurate. He further assumes that knowledge of the epipolar geometry,as

in many practical cases, is available. In particular, he assumes the epipolar lines are almost

aligned horizontally. This knowledge is then used to nd matches between the stereo image

pair, and a robust metho d (the M-estimator, see Sect. 6.3) is used to detect false matches

and to obtain a b etter estimate of the epip olar geometry. Shapiro and Brady also use a

linear metho d. The camera model is however a simplied one, namely an ane camera.

Correspondences are established by tracking corner features over time. False matches are

INRIA

ARobust Technique for Matching Two Uncalibrated Images 5

rejected through a

regression diagnostic

, which computes an initial estimate of the epipolar

geometry over all matches, and sees how the estimate changes if a match is deleted. The

match whose removal maximally reduces the residual is identied to be an

outlier

and is

rejected. The pro cedure is then repeated with the reduced set of matches until all outliers

have been removed. These two approaches (M-estimators and Regression diagnostics) work

well when the p ercentage of outliers is small and more imp ortantly when their derivations

from the valid matches are not to o large, as in the abovetwoworks. In the case described

in this paper, two images can b e quite dierent. There may b e a large percentage of false

matches (usually around 20%, sometimes 40%) using heuristic matching techniques such

as correlation, and a false matchmay b e completely dierent from the valid matches. The

robust technique describ ed in this pap er deals with these issues and can theoretically detect

outliers when they makeupas much as 50% of whole data.

2 Notation

A camera is describ ed by the widely used pinhole model. The co ordinates of a 3-D point

x; y; z

]

inaworld co ordinate system and its retinal image co ordinates

u; v

]

are related by

;

where

is an arbitrary scale, and

is a 3



4 matrix, called the p erspective pro jection

matrix. Denoting the homogeneous coordinates of a vector

x; y;



]

, i.e.,

[

x; y;



;

,wehave

The matrix

can b e decomposed as

[

]

;

where

isa3



3 matrix, mapping the normalized image co ordinates to the retinal image

coordinates, and (

;

) is the 3D displacement (rotation and translation) from the world

coordinate system to the camera co ordinate system. The most general matrix

can b e

written as

;

cot

 u

;

sin



0 0 1

;

(1)

where



is the fo cal length of the camera,



and

are the horizontal and vertical scale factors, whose inverses characterize the

size of the pixel in the world coordinate unit,

RR n2273

剩余42页未读，继续阅读

casmli

粉丝: 0
资源: 1

非校准图像匹配的鲁棒方法：恢复未知的等距几何

A Practical and Robust Bump-mapping Technique for Today's GPUs

A Robust Delaunay Triangulation Matching for Multispectral/Multidate Remote Sensing Image Registration

A robust sequence image matching algorithm for solving flight position and heading

Scale and rotation robust line-based matching for high resolution images

A Simple and Robust Feature Point Matching Algorithm Based on Restricted Spatial Order Constraints for Aerial Image Registration

A robust method for inverse halftoning via two-dimensional nonlinear pyramid

Robust salient object detection for RGB images

A Robust Corner Matching Technique：一种新颖的角匹配技术，使用基于轮廓的 arcss 检测器检测到的角。-matlab开发

Fast Algorithm for Robust Template Matching.pdf

A Robust Algorithm for Portfolio.pdf

最新资源