LSOD：图像匹配的新型局部稀疏正交描述符

171 浏览量更新于2024-08-26 收藏 440KB PDF 举报

"LSOD是Local Sparse Orthogonal Descriptor的缩写，是一种用于图像匹配的新型局部稀疏正交描述符。该方法受到了自动编码器（autoencoder）的启发，自动编码器是一种人工神经网络，旨在学习高效的编码。在自动编码器的基础上，LSOD引入了稀疏性和正交性约束，使得生成的描述符具有高度的判别能力。实验表明，LSOD不仅对几何变换（如视角变化、强度变化）和光度变换（如噪声、图像模糊、JPEG压缩）具有不变性，而且在效率上也表现出色。与现有的最先进的描述符在标准基准数据集上的比较显示，LSOD方法在准确性和效率两方面都表现更优。关键词包括：图像匹配、自动编码器、局部描述符。" LSOD（局部稀疏正交描述符）是一种创新的图像处理技术，主要应用于图像匹配。它借鉴了自动编码器的原理，自动编码器是一种能自我学习并压缩输入数据的深度学习模型。在图像处理中，特征描述符是用来识别和比较图像中的关键点的关键元素，而LSOD的目标是生成一种能够有效描述图像特征的表示方式。在LSOD中，通过在自动编码器结构中施加稀疏性和正交性约束，可以得到更加独特且区分度高的特征描述符。稀疏性意味着描述符中大部分元素接近于零，只保留最关键的信息，这样可以减少计算量，提高效率；而正交性则有助于提高描述符的独立性和互异性，使得每个特征向量在特征空间中彼此正交，从而增强描述符的稳定性。 LSOD的这种设计使其能够抵抗各种图像变换的影响，包括视点变化（例如旋转和平移）、光照强度的变化、图像噪声、模糊以及常见的数字图像压缩格式如JPEG压缩。这些特性使得LSOD在实际应用中更具鲁棒性。在性能评估中，LSOD与其他当前最先进的描述符进行了比较，结果表明，无论是在匹配的准确性还是执行速度上，LSOD都有明显的优势。这意味着它可能成为图像匹配领域的一个有力工具，尤其对于需要快速准确地进行图像分析和识别的任务，如目标检测、场景理解或自动驾驶等应用场景。 LSOD通过结合自动编码器和正交稀疏约束，提供了一种高效且具有强大鲁棒性的图像特征描述方法，为图像匹配带来了改进的解决方案。它的成功在于将机器学习的理论应用于实际问题，并通过优化设计提升了实际应用的性能。

LSOD: Local Sparse Orthogonal Descriptor for Image

Matching

Yiru Zhao, Yaoyi Li, Zhiwen Shao, Hongtao Lu

∗

Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering

Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R.China

{yiru.zhao, dsamuel, shaozhiwen, htlu}@sjtu.edu.cn

ABSTRACT

We propose a novel method for feature description used for

image matching in this paper. Our method is inspired by the

autoencoder, an artiﬁcial neural network designed for learn-

ing eﬃcient codings. Sparse and orthogonal constraints are

imposed on the autoencoder and make it a highly discrimi-

native descriptor. It is shown that the proposed descriptor

is not only invariant to geometric and photometric transfor-

mations (such as viewpoint change, intensity change, noise,

image blur and JPEG compression), but also highly eﬃcient.

We compare it with existing state-of-the-art descriptors on

standard benchmark datasets, the experimental results show

that our LSOD method yields better performance both in

accuracy and eﬃciency.

Keywords

Image matching; autoencoder; local feature descriptor

1. INTRODUCTION

Local feature descriptor is basal research of many com-

puter vision problems, such as image stitching [11], camera

calibration [19], object detection [14], and so on. SIFT key-

point detector and descriptor [12], which was proposed a

decade ago, has been proved eﬀective in many image match-

ing scenarios [18, 20], but it imposes a large computational

cost, especially when used for real-time applications such

as simultaneous localization and mapping (SLAM) systems.

Many algorithms were proposed to improve SIFT in the fol-

lowing years, SURF [3] is one of them, which is faster but

less accurate than SIFT. DSP-SIFT [5] raises a modiﬁcation

based on pooling gradient orientations. KAZE [1] introduces

a feature detection and description algorithm in nonlinear

scale spaces. It is accelerated in [2], by a descriptor called

AKAZE.

On the other hand, machine learning and neural network

are two of the rapidly growing ﬁelds in recent years and

∗

Corresponding author.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full cita-

tion on the ﬁrst page. Copyrights for components of this work owned by others than

ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-

publish, to post on servers or to redistribute to lists, requires prior speciﬁc permission

and/or a fee. Request permissions from permissions@acm.org.

MM ’16, October 15-19, 2016, Amsterdam, Netherlands

 2016 ACM. ISBN 978-1-4503-3603-1/16/10. . . $15.00

DOI: http://dx.doi.org/10.1145/2964284.2967217

(1)

(2)

(3)

(n-2)

(n-1)

(n)

...

(1)

(2)

(3)

(n-2)

(n-1)

(n)

...

Image Patch Sparse Orthogonal Autoencoder

LSOD

descriptor

Figure 1: Illustration of calculating the LSOD de-

scriptor for an image patch.

have achieved great success in many classical computer vi-

sion problems, such as image classiﬁcation [9] and action

recognition [8]. Inspired by sparse autoencoder, one of the

well-known neural network models, we propose a new im-

age local feature descriptor. With the orthogonal features

learned from image dataset, autoencoder encodes an image

patch as the descriptor. Our method is called Local Sparse

Orthogonal Descriptor(LSOD), an example is shown in Fig-

ure 1. The main contributions of this paper include:

• Enhancing FAST detector with median ﬁlter scale pyra-

mid and intensity centroid.

• Proposing a method of training a sparse orthogonal

autoencoder used to describe the local image feature

patch.

2. RELATED WORK

Detector: The ﬁrst step in image matching is detect-

ing interest points in the image and there have been many

productive interest point detectors. Harris corner detector

[7] gives a mathematical approach for determining whether

an image patch is ﬂat, edge or corner. SIFT calculates his-

tograms of gray level gradient and chooses the peak orien-

tation as the main direction. SURF uses approximation of

block patterns, which is faster than computation of gradi-

ents. FAST and its extensions [16, 17] are good choices for

keypoints detecting in real-time systems. They are stable

and eﬃcient to ﬁnd corner keypoints, but sensitive to scale

variance. Therefore the FAST detector is often applied with

pyramid schemes for scale change.

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38546817

粉丝: 8
资源: 911

LSOD：图像匹配的新型局部稀疏正交描述符

LS7366:一个与LS7366正交编码器计数器接口的Arduino库

figure64.rar_CS_图像_图像 块 matlab_图像处理_图像稀疏分解

OrthogonalPolynomia​ls:用于评估正交多项式和使用它们进行信号近似的工具箱。-matlab开发

tinyMOBY:用于BioMoby服务的语义丰富的WSDL 2.0描述符-开源

image-matching-benchmark-baselines:图像匹配基准和挑战的基准

正交匹配追踪算法（OMP）：正交匹配追踪算法（OMP）是一种贪婪的压缩感知恢复算法。-matlab开发

改进的WLS-SVM：增强鲁棒性和稀疏近似

提升JPEG-LS效率：两步编码法图像压缩算法

LS-DYNA: 图像处理中的直线测长与角度计算

Linux基础：文件I/O、描述符与权限管理详解

最新资源

figure64.rar_CS_图像_图像块 matlab_图像处理_图像稀疏分解

OrthogonalPolynomials:用于评估正交多项式和使用它们进行信号近似的工具箱。-matlab开发