3D形状检索：计算机视觉中的几何不变性方法

需积分: 9 22 浏览量更新于2024-09-10 收藏 2.05MB PDF 举报

"计算机视觉几何图形提取" 在计算机视觉领域，几何形状提取是一项关键的技术，它涉及到从图像或3D模型中识别、分析和提取出特定的几何特征。这篇描述的文献可能聚焦于一种名为"ShapeGoogle"的方法，该方法旨在实现3D非刚性形状的搜索和检索，同时保持对等距不变性的鲁棒性。等距不变性意味着算法能够识别形状尽管经过了缩放、旋转、平移或不规则变形，但其基本结构保持不变的情况。计算机视觉和模式识别社区最近对基于特征的方法越来越感兴趣，这些方法在对象识别和图像检索等应用中表现出色。文献中的"ShapeGoogle"方法借鉴了这一思想，并将其扩展到3D世界，解决了大规模3D模型数据库中非刚性形状搜索的问题。由于许多3D形状存在丰富的变异性，所以形状检索系统需要对各种变换和形状变化具有不变性。例如，当用户在3D模型数据库中查找相似形状时，算法需要能够区分出虽然形态各异但实际上代表同一类形状的对象。常见的形状变化类别包括形状的几何变形（如扭曲、拉伸）、光照条件的变化、部分遮挡以及噪声的引入。解决这些问题的关键在于设计出有效的特征描述符，这些描述符能够在不同的形状表示之间建立联系。 "ShapeGoogle"方法可能采用了先进的几何分析和表示技术，如表面参数化、形状描述子、图谱理论和机器学习。通过这些工具，算法可以捕获形状的拓扑信息、几何细节以及形状之间的相似度。此外，文献可能还探讨了如何构建高效的索引结构，以便在大型数据库中快速检索到与查询形状最匹配的候选结果。在实际应用中，这种技术可以广泛应用于3D内容的搜索、动画制作、游戏开发、医疗图像分析、虚拟现实和增强现实等领域。例如，它可以用于帮助设计师找到类似的设计元素，或者在医疗中帮助识别和分类不同的解剖结构。 "ShapeGoogle"的工作为3D形状检索提供了一个新颖且强大的框架，该框架能够处理复杂的形状变化，从而推动了计算机视觉在理解和操作3D世界方面的进步。通过深入研究和优化这种技术，我们可以期待更高效、准确的形状匹配和检索算法的出现，这对于推动计算机视觉领域的未来发展至关重要。

Shape Google: a computer vision approach to isometry invariant shape retrieval

Maks Ovsjanikov

ICME

Stanford University

maks@stanford.edu

Alexander M. Bronstein

Dept. of Computer Science

Technion

bron@cs.technion.ac.il

Michael M. Bronstein

Dept. of Computer Science

Technion

mbron@cs.technion.ac.il

Leonidas J. Guibas

Dept. of Computer Science

Stanford University

guibas@cs.stanford.edu

Abstract

Feature-based methods have recently gained popularity

in computer vision and pattern recognition communities, in

applications such as object recognition and image retrieval.

In this paper, we explore analogous approaches in the 3D

world applied to the problem of non-rigid shape search and

retrieval in large databases.

1. Introduction

Large databases of 3D models available in the public do-

main have created the demand for shape search and retrieval

algorithms capable of ﬁnding similar shapes in the same

way a search engine responds to text queries. Since many

shapes manifest rich variability, shape retrieval is often re-

quired to be invariant to different classes of transformations

and shape variations. One of the most challenging settings

is the case of non-rigid or deformable shapes, in which the

class of transformations may be very wide due to the capa-

bility of such shapes to bend and assume different forms.

An analogous problem in the image domain is image

retrieval, the problem of ﬁnding images depicting similar

scenes or objects. Images, as well as three-dimensional

shapes, may manifest signiﬁcant variability and the big

challenge is to create retrieval techniques that would be in-

sensitive to such changes, at the same time providing sufﬁ-

cient discrimination power to distinguish between different

shapes. In the computer vision and pattern recognition com-

munities, feature-based methods have recently gained pop-

ularity with the introduction of the scale invariant feature

transform (SIFT) [12] and similar algorithms [14, 1]. The

ability of these methods to demonstrate sufﬁciently good

performance in many problems such as object recognition

and image retrieval and the public availability of code made

SIFT-like approaches a commodity and de facto standard.

One of the advantages of feature-based approaches in

image retrieval problems is that they allow to think of im-

ages as a collection of primitive elements (visual “words”),

and hence use the well-developed methods from text search.

One of the best implementations that manifest the use of

these ideas is Video Google,

a web application for object

search in large collection of images and videos developed in

Oxford university by Zisserman and collaborators [28, 6],

named this way appealing to the analogy with the famous

text search engine. Video Google makes use of feature de-

tectors and descriptors to represent an image as a collection

of visual words indexed in a “visual vocabulary.” Count-

ing the frequency of the visual word occurrence in the im-

age, a representation referred to as “bag of features” is con-

structed. Images containing similar visual information tend

to have similar bags of features, and thus comparing bags

of features allows one to retrieve similar images. Such a

method is suitable for indexing and searching very large

(Internet-scale) databases of images.

While very popular in computer vision, feature-based

approaches are less known and used in the shape analy-

sis community. The ﬁrst reason is the lack of efﬁcient

and robust feature descriptors similar to SIFT to be so

ubiquitously adopted. One of the important properties of

SIFT is its discrimination power combined with robust-

ness to different image transformations. While several

works proposed feature-based approaches for rigid shapes

[20, 10, 13, 7, 9], very few are capable of dealing with non-

rigid shape deformations [18, 22, 3, 32]. Secondly, shapes

are usually poorer in features compared to images, and thus

descriptors are less discriminative.

In this paper, we bring the spirit of feature-based com-

The Oxford Video Google project is not afﬁliated with the company

Google, Inc.

320

2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops

下载后可阅读完整内容，剩余7页未读，立即下载

general_zgd

粉丝: 0
资源: 2

3D形状检索：计算机视觉中的几何不变性方法

模拟生物视觉：计算机视觉与计算机图形学的深度解析

快速准确的几何图形轮廓折点识别算法研究

平面几何图形识别与理解：基于线段的霍夫变换方法

边缘检测在计算机视觉几何测量中的应用.pdf

计算机视觉与计算机图形学

计算机视觉与计算机图形学.pdf

计算机视觉与计算机图形学.doc

计算机视觉视觉系统的几何特性PPT学习教案.pptx

计算机视觉检测中特征点线提取方法研究.pdf

基于计算机视觉的牛脸轮廓提取算法及实现.pdf

最新资源