利用Fisher判别式优化无监督词典聚类方法

108 浏览量更新于2024-08-27 收藏 1.17MB PDF 举报

"这篇研究论文提出了一种新颖的Fisher判别无监督词典学习（FD-UDL）方法，旨在提升无监督场景下词典学习方法的聚类性能。通过在字典元素上应用Fisher判别准则，促进不同子字典之间的多样性，并保持每个子字典内部的一致性，实现这一目标。这种判别被纳入无监督词典学习的优化问题中，并且提出了该优化问题的解析解，从而学习到理想的字典。" 正文: 在机器学习和计算机视觉领域，无监督词典学习已经成为数据表示和分析的重要工具。词典学习的目标是找到一组基，这些基可以有效地用于稀疏表示输入数据，而无需预先知道数据的类别信息。本文提出的Fisher判别无监督词典学习（FD-UDL）方法，是在无监督学习框架下，利用Fisher判别分析的理论来改进词典学习的聚类效果。 Fisher判别分析是一种统计方法，它通过最大化类间距离与类内距离的比率来寻找最优的特征空间投影，以提高分类或聚类的准确性。在FD-UDL中，这种方法被扩展到词典学习领域，目的是促进字典元素间的差异性和每个子字典内的协同性。这样，不同的子字典可以更好地捕获数据的不同特性，从而改善聚类结果。具体来说，FD-UDL方法通过引入一个新的Fisher判别准则，该准则考虑了字典元素之间的相互关系。通过优化这个准则，可以学习到一组能够增强数据类间差异性的字典，同时保持每个子字典内部的紧凑性。这有助于在无标签数据中发现潜在的结构和模式。此外，作者提供了该优化问题的解析解，使得学习过程更加高效。这使得算法能够在不增加过多计算复杂性的情况下，精确地找到最优解，进一步提高了聚类的效率和精度。关键词如"Fisherdiscriminant"、"Dictionarylearning"、"Sparserepresentation"和"Unsupervisedlearning"揭示了文章的核心内容。Fisher判别强调了基于统计的特征选择，词典学习关注的是数据的稀疏表示，而无监督学习则意味着在没有标签信息的情况下进行模型训练。这些关键概念共同构成了FD-UDL方法的基础，使得它在无监督聚类任务中具有潜在的优势。这篇研究论文对无监督词典聚类提出了新的见解，通过引入Fisher判别分析，为提高无监督学习中的聚类性能开辟了新的途径。这种方法不仅可以应用于图像处理、语音识别等领域，也可以拓展到其他需要从大量无标签数据中挖掘内在结构和模式的应用中。

Unsupervised dictionary learning with Fisher discriminant

for clustering

Mai Xu

, Haoyu Dong

, Chen Chen

, Ling Li

Beihang Univeristy, Beijing 100191, China

University of Kent, UK

article info

Article history:

Received 10 September 2015

Received in revised form

8 December 2015

Accepted 28 January 2016

Communicated by Deng Cai

Available online 23 February 2016

Keywords:

Fisher discriminant

Dictionary learning

Sparse representation

Unsupervised learning

abstract

In this paper, we propose a novel Fisher discriminant unsupervised dictionary learning (FD-UDL)

approach, for improving the clustering performance of state-of-the-art dictionary learning approaches in

unsupervised scenarios. This is achieved by employing a novel Fisher discriminant criterion on dictionary

elements to encourage the diversity between different sub-dictionaries, and also the coherence within

each sub-dictionary. Such a discriminant is incorporated to formulate the optimization problem of

unsupervised dictionary learning. Furthermore, we provide an analytical solution to the proposed

optimization problem, obtaining the learned dictionary for clustering tasks. Unlike previous approaches

for unsupervised clustering, the proposed FD-UDL approach takes into account both within-class and

between-class scatters of sub-dictionaries, rather than only considering diversity between different sub-

dictionaries. Finally, experiments on synthetic data, face and handwritten digit clustering tasks show the

improved clustering accuracy over other state-of-the-art dictionary learning and clustering approaches.

1. Introduction

Dictionary learning [1] aims at utilizing machine learning

approaches to generalize the over-complete dictionary from a set of

training signals (e.g. image patches). The elements (also called

atoms) of such a learned over-complete dictionary can be seen as

basic texture patterns of generic signals. In the late 1990s, Olshausen

and Field [2] have shown that the sparse representation of texture

elements, from a learned dictionary, is very similar to simple-cell

receptive ﬁeld in mammalian primary visual cortex. Thereby, an

image patch can be efﬁciently approximated using the linear com-

bination of a few elements (so called sparse representation) from an

over-complete dictionary, which is learned from training image

patches either ofﬂine [3] or online [4]. Together with sparse repre-

sentation, dictionary learning has been extensively applied in both

image processing and computer vision communities [1,5],suchasin

the tasks of super-resolution [6] and visual tracking [7].

1.1. Related work

At the beginning, reconstructive dictionaries, aiming at faith-

fully reconstructing signals, are learned for sparse representation

based image reconstruction tasks [3,8–11]. A representative

algorithm for effectively learning the reconstructive dictionary is

K-SVD [3]. Speciﬁcally, K-SVD iteratively alternates between sparse

representation and dictionary update steps. In the ﬁrst step, K-SVD

utilizes the orthogonal matching pursuit (OMP) [12] to obtain the

sparse coefﬁcients of all training signals. For the next step, each

dictionary element is updated sequentially by using singular value

decomposition (SVD) to minimize the error of reconstructing

relevant training signals. Such an update is similar to k-means

algorithm [13], hence named K-SVD.

Most recently, discriminative dictionary learning [14–19] has

emerged as a promising way to deal with the tasks of classiﬁcation

and clustering in computer vision area. The approaches on dis-

criminative dictionary learning can be categorized into two clas-

ses: supervised and unsupervised forms. For supervised classiﬁ-

cation, the discriminants need to be developed to enhance the

discriminative ability of the learned dictionary. For example,

Mairal et al. [15] proposed a softmax discriminative cost term for

multiple sub-dictionaries, each of which is relevant to one-class

training signals. In this way, the classiﬁcation of an image patch

can be achieved by seeking the minimum reconstruction error of

these sub-dictionaries over the test image patch. Besides, Yang

et al. [16] proposed a Fisher discriminant on sparse coefﬁcients to

learn a structured dictionary in supervised scenario, so that the

sparse coefﬁcients have small within-class scatters but large

between-class scatters. Then, the test patches can be classiﬁed

using sparse representation with discriminative information in

both minimum reconstruction error and sparse coef

ﬁcients. Gao

Contents lists available at ScienceDirect

journal homepage: www.elsevier.com/locate/neucom

Neurocomputing

http://dx.doi.org/10.1016/j.neucom.2016.01.076

Corresponding author.

E-mail address: MaiXu@buaa.edu.cn (M. Xu).

Neurocomputing 194 (2016) 65–73

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38500948

粉丝: 3

利用Fisher判别式优化无监督词典聚类方法

基于Fisher判别的分布式K-Means聚类算法.pdf

无监督聚类算法

使用监督正交判别投影进行聚类的无监督维约化

FCM.rar_unsupervised_无监督_无监督聚类_聚类 无监督

K-Nearest Neighbor无监督聚类.zip_K._instantvpo_无监督聚类_聚类方法

论文研究-基于判别分析的半监督聚类方法.pdf

SODP-KSCE：一种结合监督正交判别投影的无监督聚类降维方法

无监督情感聚类：基于维度判别的文本情感分析方法

SPSS聚类与判别分析详解：两步聚类与快速聚类

SPSS聚类与判别分析指南：两步聚类与快速聚类

最新资源

FCM.rar_unsupervised_无监督_无监督聚类_聚类无监督