PLSA与卡方模型结合的视觉词袋分类方法

162 浏览量更新于2024-07-15 收藏 461KB PDF 举报

"这篇研究论文提出了一种基于概率潜在语义分析(PLSA)和卡方模型的视觉词袋方法，旨在解决传统BoVW模型中的视觉单词同义和歧义问题，以及去除视觉停用词，提升图像分类的性能。通过PLSA分析，推断出图像中的潜在主题，然后利用KL散度确定语义相关的同义词。接着，应用自适应软分配策略进行特征映射，并使用卡方模型重构视觉词汇直方图，以消除视觉停用词的影响。最终，借助支持向量机(SVM)进行对象分类。实验结果显示，这种方法能有效地提高视觉语义分辨率和分类性能。" 本文的核心内容围绕改进传统的基于视觉单词袋(BoVW)模型的方法展开。BoVW模型在对象分类中常遇到视觉单词的同义性和歧义性问题，这些问题可能导致分类效果下降。为了解决这一问题，研究者引入了PLSA(概率潜在语义分析)。PLSA是一种统计模型，能够分析视觉单词在不同图像中的共现概率，以此推断出图像中的潜在主题，进一步得到每个单词引起潜在主题的概率分布。这有助于揭示隐藏在大量视觉单词背后的语义结构。接下来，研究者使用KL散度来衡量视觉单词间的语义距离，识别出语义相关的同义词。KL散度是一种信息论中的距离度量，可以度量两个概率分布的相似程度。通过这种方式，可以创建一个更精确的视觉词汇表，降低同义词带来的影响。随后，文章提出了自适应软分配策略，它允许SIFT特征在多个同义词之间进行软映射，而不是传统的硬分配，这样可以更好地捕捉到特征的模糊边界，增强模型的表达能力。为了消除“视觉停用词”的影响，研究引入了卡方模型。视觉停用词指的是那些频繁出现但对图像分类贡献不大的视觉单词。卡方检验是一种统计分析方法，用于检测变量间的关系强度。在这里，它被用来识别和排除那些对图像语义分辨率影响较小的视觉单词，从而提升模型的语义分辨率。最后，支持向量机(SVM)作为分类器，用于根据改进后的视觉词汇直方图对图像进行分类。SVM是一种强大的监督学习模型，特别适合处理小样本和高维数据，因此在图像分类任务中表现出色。通过以上一系列步骤，该方法显著提高了视觉语义分辨率，增强了对象分类的准确性和鲁棒性。实验结果验证了这种方法的有效性，与传统方法相比，它在克服同义词和歧义问题上表现出色，同时提升了图像分类的性能。

2636 Zhao et al.: Bag of Visual Words method based on PLSA and Chi-Square Model for Object Category

solution using the “visual phrase” technique based on an improved frequency itemset mining

algorithm and a likelihood ratio test method, but this method only considers the co-occurrence

information among visual words and ignores the spatial information of visual words. Chen et

al. [25] gave a high discriminative visual phrase (DVP) method which can filter noise

efficiently overcoming the problem of feature information loss in traditional visual phrases

construction methods [26]. Roman et al. [27] proposed a new methodology for the automatic

estimation of the optimal amount of visual words that can be removed from a visual dictionary.

This method relies on a special definition of the entropy of each visual word when considered

as a random variable, and a new definition of the overlap of class models computed with a

normalized Bhattacharyya coefficient.

However, these methods all ignore the interrelationships between the visual words and

semantic concepts of different images. Therefore, some visual words with less occurrence

frequency but with high discrimination are easily mistaken for “visual stop-words”.

For the purpose that identifying the semantic relevance more accurately among visual

words, selecting soft assignment words number for different local features adaptively, as well

as eliminating “visual stop-words” effectively, a novel bag of visual words method based on

PLSA and chi-square model for object category is proposed in this paper. The main

contribution is to mine image semantic topics using PLSA model and K-L divergence,

accurately measuring the semantic distance between words. Meanwhile, the analyzing of the

ambiguity of SIFT features perform soft-assignment more accurately between features and

homoionym. Based on that, some “visual stop-words” are eliminated by chi-square model,

Therefore, the method described in this paper can effectively solve the problem of synonymy

and ambiguity of visual words, and improve the image classification accuracy by enhancing

distinguishing ability of visual dictionary.

3. Bag of visual words method based on PLSA and chi-square model for

object category

SIFT

features

Visual

words

Visual

dictionary

AKM

clustering

mapping

PLSA

K-L divergence

SVM classifer

Classification

result

Test

image

SIFT features

Image training set

category1

category2

categoryk

……

Homoionyms

obtained

Adaptive

soft assignment

Visual

vocabulary

histogram

Adaptive

soft assignment

Homoionyms

obtained

Visual

vocabulary

histogram

Visual

stop-words

elimination

Visual

vocabulary

histogram

reconstructed

test

Visual

stop-words

elimination

Visual

vocabulary

histogram

reconstructed

training

(| )

Pz w

(,)PwI

Fig. 2. The flow of bag of visual words method based on PLSA and chi-square model for object

PLSA与卡方模型结合的视觉词袋分类方法

pLSA_Topic_Model:基于pLSA的话题模型

人体动作识别新方法：基于pLSA的主题模型

基于PLSA的新闻评论情绪类别自动标注方法

基于双层PLSA模型的多标签图像标注

PLSA主题模型

一种融合PLSA 模型和树模型的文本病历语义分析新方法 (2013年)

PLSA模型详解

基于PLSA模型的中文文本分割方法

滚动轴承故障检测新方法：基于pLSA的小波包变换分析

LDA与pLSA：主题模型的贝叶斯视角

最新资源