SIFT特征在双语印刷文档图像检索中的应用

需积分: 5 35 浏览量更新于2024-08-12 收藏 772KB PDF 举报

"基于SIFT特征的双语印刷文档图像检索" 本文主要探讨了一种基于尺度不变特征变换（Scale-Invariant Feature Transform，SIFT）的双语印刷文档图像检索系统，该系统能够从印刷文档图像中检索出中文和维吾尔语的关键词。作者来自新疆大学软件学院和信息科学与工程研究所。在摘要中，作者介绍了印刷文档检索系统的框架和处理步骤，这些步骤可以作为基于单词识别的文档检索系统的基础。系统的核心是利用局部特征，特别是SIFT特征，从图像中提取关键信息。SIFT特征是一种强大的图像描述符，它对尺度、旋转和亮度变化具有不变性，因此非常适合于在不同条件下的图像匹配任务。系统采用欧氏距离为基础的匹配算法来查询并找到印刷文档图像中的匹配单词。文章的关键词包括单词识别、SIFT和文档检索系统，表明了研究的主要焦点。作者指出，本系统中的一些创新思路可能对其他双语印刷文档图像检索系统有所启发。引言部分提到，世界上有大量的印刷文献，这强调了开发有效检索系统的重要性。在信息爆炸的时代，能够快速准确地定位到双语文档中的特定信息是一项挑战，而本文提出的基于SIFT特征的方法旨在解决这一问题。正文可能会详细介绍SIFT特征的提取过程，包括高斯差分尺度空间的构建、关键点的检测和描述符的计算。接着，会讨论欧氏距离匹配算法的工作原理以及如何优化匹配效率和精度。此外，系统的设计和实现细节，如数据预处理、特征匹配后的后处理策略、检索性能评估等方面也会有所阐述。在实验部分，作者可能会展示系统在各种测试集上的表现，比较与其他方法的性能差异，并分析可能影响检索效果的因素，比如图像质量、噪声、字体变化等。最后，讨论部分会总结研究成果，指出系统的优点和局限性，并对未来的研究方向提出建议。这篇研究论文深入研究了如何利用SIFT特征进行双语印刷文档图像的检索，对于多语言信息检索领域有着重要的理论和实践价值。

Bilingual printed Document Image retrieval Based on SIFT Feature

Eksan Firkat

, Abdusalam Dawut

, Palidan Tuerxun

, Askar Hamdulla

School of software, Xinjiang University Urumqi 830046, P.R. China

Institute of Information Science and Engineering, Xinjiang University Urumqi 830046, P.R.China

*corresponding author’s email: askarhamdulla@sina.com

Abstract—

This paper present a printed document

retrieval system which can retrieve Chinese and

Uyghur keywords from printed document images. In

this paper we introduce the framework of the

printed document retrieval system and processing

step behind it which can be based line for the word

spotting based document retrieval system, we also

describe the extraction algorithm that use local

feature as SIFT to extract the feature from image

and use Euclidean based matching algorithm to

query the matching word in printed document

image. Some novel idea applied in this system might

be helpful for some Other Bilingual printed

document image retrieve system.

Keywords- word spotting; SIFT; document

retrieval system

I. INTRODUCTION

The world has Huge amount of printed

literature which is a valuable asset for people to

study. But lots of them has not turn into the

search-able format which cause some difficulty

for people to use it very well , such as search

some specific information is very time

consuming. But with the development of the

information retrieval approach made this kind of

problem not so much obstacle anymore. However,

there are still some challenges in providing

effective search mechanisms. There are two main

retrieval approaches currently been proposed .

The first proposed method is Optical

Character Recognition (OCR) approach , this

approach just convert the printed document

image into text file not only an efficient storage

of the content but also makes it thoroughly

search-able. However ,when the quality of the

image became weak and some other noises

interruption, The OCR approach doesn’t work

very well . As alternative to OCR, keyword

spotting (KWS) approach can retrieving

document image very effectively, because of it’s

independent of the complex language structure

and focuses on the features of the given word, in

this approach , spotting is done by matching the

feature points between the query word and the

printed document images[1 2].

This paper presents a word spotting based

printed document retrieval system that uses

information of local descriptors as SIFT [3] . The

main steps of the proposed system is as follows :

Firstly the document image is segmented into

words which is the basic unit for matching , and

extract the SIFT features from the matching

units . In the next phase , the segmented words’

feature and location information of the matching

units it are stored as matrix prepare for the

matching stage. during the retrieval phase , the

query word is turned into image and it’s SIFT

feature is extracted , then Euclidean distance

based ratio matching algorithm is used to achieve

printed document image retrieval . The most

astonishing point of this system is that it can

retrieve both Chinese and Uyghur printed

document Image , and the detail of this approach

will be describe in the following part of this

paper.

II. SYSTEM FRAMEWORK

The framework of this proposed system is

describe as follow. The user input the query

word , then the word is turn into the image

prepared for matching step. The printed

document images are segmented into word image

clusters also prepare for matching step. To

overcome the aforementioned steps, with the help

of extraction algorithm to extract the local

features of the printed document image and query

word image to build feature vector cluster and

together with the Euclidean distance based

matching algorithm to retrieve the target word

and pinpoint the location of the query word

which satisfy the user’s propose for retrieve the

query word from printed document image . The

flow diagram is shown is figure 1:

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38669093

粉丝: 4
资源: 874

SIFT特征在双语印刷文档图像检索中的应用

两篇基于sift特征的图像检索论文

基于SIFT特征点提取的图像检索研究

基于SIFT特征向量的图像检索优化 (2013年)

基于SIFT特征的图像检索.docx

基于SIFT特征的图像检索.pdf

基于SIFT特征点改进聚类的图像检索方法研究.docx

基于 SIFT 特征匹配的监控图像自动拼接

基于SIFT特征的图像检索 (2).docx

基于SIFT特征图像检索的分布式应用.pdf

基于SIFT特征匹配的监控图像自动拼接.docx

最新资源