3D物体识别中范围图像的不变表征特性研究：高斯曲率的应用

需积分: 9 120 浏览量更新于2024-07-31 收藏 14.37MB PDF 举报

在1986年的计算机视觉、图形学和图像处理领域的重要文献《3D对象识别中的不变表面特性：基于范围图像的方法》（"InvariantSurfaceCharacteristicsfor3DObjectRecognitioninRangeImages"，作者Paul J. Besl）中，研究者探讨了如何利用高斯曲率这一等距不变量来增强三维物体识别的精度。在范围图像（或深度映射）的研究中，这种类型的传感器输入数据因其直接表示表面信息而备受关注。早期的研究表明，通过数据驱动的方式处理传感器数据至关重要，目标是生成丰富的局部描述，以便于后续更深入的理解和分析。高斯曲率是表面的一种等距不变量，这意味着它只依赖于表面的E、F、G函数及其导数。这些函数在微分几何中起着关键作用，它们构成了光滑表面的两个基本形式——第一基本形式（测地线的度量）和第二基本形式（曲率）。第一基本形式提供了关于表面形状的局部信息，如曲率线和长度测度，而第二基本形式则描述了曲面的凹凸性，包括正曲率区域（如球面），负曲率区域（如鞍面）和零曲率区域（平面）。文章提出，通过对范围图像中的高斯曲率进行计算和分析，可以提取出对形状和结构有用的不变特征。这对于诸如工业自动化中的物体抓取、导航机器人中的障碍物避开等任务具有重要意义，因为这些任务要求系统能够准确识别和理解复杂的三维空间结构。此外，这种方法还可能应用于医学成像、机器人导航、虚拟现实等领域，通过不变性提高了环境理解和物体识别的鲁棒性。该论文的主要贡献在于开发了一套算法，能够有效地从范围图像中提取高斯曲率特征，并将其与传统的图像处理技术相结合，如边缘检测、纹理分析等，以实现更精确的3D物体识别。这种方法不仅扩展了计算机视觉的理论基础，也为实际应用中的复杂场景提供了有效的解决方案。这篇论文对现代计算机视觉和机器人技术的发展产生了深远的影响。

3D OBJECT RECOGNITION

purpose vision system must be aware that key object features may not be visible even

when an object is present and visible in thefierd of view.

In situations where it is not

possible to take intelligent actions based on what is currently visible, the general

purpose system should automatically request the acquisition of image data from new

vantage points [32].

The visible-invariant surface characteristics that we have decided to use are the

Gaussian curvature (K)

and the

mean curvature

(H), which are referred to collec-

tively as

surface curvature.

We abbreviate this term as

S-curvature.

When a surface

region is visible, its S-curvature is invariant to

changes in surface parameterization

and to

translations and rotations

of object surfaces. In addition, mean curvature is an

extrinsic

surface property whereas Gaussian curvature is

intrinsic.

These terms are

discussed later. Differential geometry emphasizes that these are quite reasonable

surface features to consider.

Since we can seldom obtain perfect sensor data from the real world, it is desirable

to compute a “rich” characterization of a surface that preserves the surface structure

information and is insensitive to noise. Noise insensitivity may be achieved by

computing redundant, or at least “overlapping,” information about a surface. In

order to have a very rich geometric representation, we propose to combine surface

critical points (local maxima, minima, and saddle points) and large metric determi-

nant points (depth-discontinuities) with the surface curvature information to char-

acterize a depth map surface in more detail. They provide useful complementary

information and can be computed for a small additional cost. Given a depth map

surface characterization, we suggest that depth map surface region characteristics

can be matched against pre-computed object model surface region characteristics

guided by depth-discontinuity and critical point information to achieve object

recognition.

The matching algorithm of a robust 3-D object recognition system must be

view-independent. One could use multiple view ideas similar to those of Koenderink

and van Doorn (visual potential) [30, 311 or Chakravarty and Freeman (characteris-

tic views) [lo], but we are pursuing a new, more compact, scheme that does not

increase its storage requirements so dramatically as object complexity increases.

After the matching algorithm has produced a list of possible objects and their

respective locations and orientations, we can use a depth-buffer algorithm to create a

synthetic depth map using the world model. Verification matching could be done

directly between the synthetic depth map and the sensor data, or we may run the

surface characterization algorithm on the synthetic data to yield a synthetic scene

description that could be matched against the surface characterization scene descrip-

tion computed from the sensor data. If major discrepancies exist, the system should

try to remedy the problems in its understanding automatically. It may also be

necessary to compute our surface characterization using different window sizes

(scales) and correlate features in this scale-space dimension to help overcome the

effects of noise. The matching algorithm, the matching object representation, the

feedback process, and scale-space ideas require further study.

5. REVIEW OF DIFFERENTIAL GEOMETRY OF SURFACES

In Section 3, we discussed how range-image object recognition might be decom-

posed into a surface recognition problem. We assume that surfaces can be recog-

nized by their characteristics. But what does this term “surface characteristic” mean?

BESL AND JAIN

We define a

characteristic

of a mathematical entity, such as a surface function, to be

any well-defined feature that can be used to distinguish between different mathe-

matical entities of the same type. We may consider characteristics that uniquely

determine a corresponding entity or characteristics that are many-to-one although

the former are more desirable. One simple example of a characteristic that uniquely

determines a function is a detailed description of the function itself. Another simple

example of a many-to-one characteristic is the following: A circle and an ellipse are

round figures. This round characteristic distinguishes them from rectangles, trian-

gles, and other polygons; however, it does not distinguish between circles and

ellipses. In this section, we aim to find a good mathematical characterization of

depth-map function surfaces.

It is well known that

curvature, torsion, and speed

uniquely determine the shape

of 3-D space curves [5, 15, 23, 37, 451. We must assume that the reader is familiar

with these basic concepts. These characteristics are the ideal type of characteristic for

a mathematical entity. They are invariant to coordinate transformations and they

have a one-to-one relationship with curve shapes. We now discuss surface character-

istics with similar properties.

We first write down the explicit parametric form of a general surface S with

respect to a known coordinate system:

S = {(x,

z): x = d(u, u), y = e(u, u), z =f(u, u), (u, u) E D c

R2).

We refer to this general parametric representation as x( U, u), where the x component

of the x function is

d(u,u),

the y component of x is e(

u,u),

and the z component is

f( U, u). In a later section, we use the graph surface (Monge patch surface) form to

describe depth map surface functions. In the graph surface case,

d(u,

u) = u and

e( U, u) = u, which are extremely simple functions. We consider only

smooth

surfaces,

where all three parametric functions possess continuous second partial derivatives.

There are two basic mathematical entities that are considered in the classical

analysis of smooth surfaces. They are referred to as the first and second fundamental

forms of a surface [23, 37,451. It is shown subsequently how complete knowledge of

these forms uniquely characterizes and quantifies general smooth surface shape.

Modern mathematics favors an equivalent formulation of this knowledge in terms of

the metric tensor and the Weingarten mapping (the “shape” operator), which we

also discuss. We begin our review by defining the fundamental forms of a surface in

terms of the general explicit surface parameterization x( U, u).

The first fundamental form I of a surface defined by x(u, u) is given by the

following quadratic form:

Z(u, u, du, du) = dx * dx = [du du]

where the [g] matrix elements are defined to be

811

= E = x, . x,

g22

= G = x, . x,

g12 = ET21 = F = x, * xv

剩余47页未读，继续阅读

xieyong444091658

粉丝: 0

3D物体识别中范围图像的不变表征特性研究：高斯曲率的应用

Invariant Local Feature for Object Recognition

Convolutional-Recursive Deep Learning for 3D Object Classification

Bionic RSTN invariant feature extraction method for image recognition and its application

PATTERN RECOGNITION IN GREY LEVEL IMAGES USING MOMENT BASED INVARIANT FEATURES

sift-Object Recognition from Local Scale-Invariant Features

Learning complex cell features with cooperating pooling operation for object recognition

Visual Object Recognition

Invariant pattern recognition

View-Invariant Gait Recognition Method by 3D Convolutional Neural Network

The illumination-invariant recognition of 3D objects using local color invariants.pdf

最新资源