最大化熵与图正则化的非负矩阵分解在聚类与分类中的应用

PDF格式 | 314KB | 更新于2024-08-26 | 49 浏览量 | 举报

"本文主要探讨了非负矩阵分解（Non-negative Matrix Factorization, NMF）在图正则化中的应用，并提出了一种通过最大化互信息（Correntropy）来优化NMF的方法，以提高聚类和分类任务的效果。" 正文: 非负矩阵分解(NMF)是一种流行的数学工具，广泛应用于数据挖掘、图像处理、文本分析等领域，特别是在聚类和分类任务中表现出色。它的基本思想是将一个非负的原始数据矩阵分解为两个非负矩阵的乘积，即V = WH，其中V是原始数据矩阵，W是特征矩阵，H是系数矩阵。然而，传统的NMF方法通常使用l2距离或Kullback-Leibler (KL) 分差作为误差度量，这些度量在处理非线性情况时可能效果不佳。为了克服这一问题，研究者们开始探索基于非线性核函数的误差度量，如互信息（Correntropy）。互信息是一种衡量两个随机变量之间依赖性的度量，它能更好地捕捉非高斯噪声和异常值的影响。在NMF中引入互信息，可以增强对数据分布细节的敏感性和对噪声的鲁棒性，从而提高聚类性能。本文提出的“通过最大化熵使图形正则化非负矩阵分解”方法，旨在利用互信息来改进NMF的性能。作者们提出了一种新的优化策略，通过最大化互信息（correntropy）来更新W和H，这有助于捕捉数据中的非线性结构。同时，他们引入了图正则化，以利用数据之间的拓扑关系，增强聚类的稳定性。图正则化通过在NMF的过程中考虑数据点之间的相似性，形成了一个邻接矩阵，从而在分解过程中考虑了全局的信息。具体实现过程中，作者们提出了一个迭代算法，每个迭代步骤中既最大化互信息又进行图正则化。这种算法可能包括交替最小化过程，即在固定一个矩阵的同时优化另一个，直到达到预设的停止条件（如达到最大迭代次数或误差下降到一定阈值）。实验部分，作者们对比了新方法与传统NMF以及基于互信息的其他方法在各种数据集上的性能，包括聚类准确率、视觉信息模糊度（VI）和调整兰德指数（AR）。结果显示，通过最大化互信息的图正则化NMF在处理非线性数据时，确实能够提供更优的聚类结果，尤其是在处理带有噪声或异常值的数据时。总结来说，这篇文章为非负矩阵分解提供了一个新的视角，即通过最大化互信息和图正则化来改进其在复杂数据集上的性能。这种方法不仅考虑了数据的非线性特性，还利用了数据间的结构信息，有望在实际应用中提升聚类和分类任务的准确性。

Graph Regularized Non-negative Matrix

Factorization By Maximizing Correntropy

Le Li

, Jianjun Yang

, Kaili Zhao

, Yang Xu

, Honggang Zhang

, Zhuoyi Fan

School of Computer Science, University of Waterloo, Ontario N2L3G1, Canada

Department of Computer Science, University of North Georgia, Oakwood, GA 30566, USA

PRIS Lab, Beijing University of Posts and Telecommunications, Beijing 100876, P.R.China

SEEE, Huazhong University of Science and Technology, Hubei 430074, P.R.China

Email: l248li@uwaterloo.ca, jianjun.yang@ung.edu, {xj992adolphxy, kailizhao1989}@gmail.com, zhhg@bupt.edu.cn,

fanzhuoyi@hust.edu.cn

Abstract— Non-negative matrix factorization (NMF) has

proved effective in many clustering and classiﬁcation tasks.

The classic ways to measure the errors between the original

and the reconstructed matrix are l

distance or Kullback-

Leibler (KL) divergence. However, nonlinear cases are not

properly handled when we use these error measures. As

a consequence, alternative measures based on nonlinear

kernels, such as correntropy, are proposed. However, the

current correntropy-based NMF only targets on the low-

level features without considering the intrinsic geometrical

distribution of data. In this paper, we propose a new

NMF algorithm that preserves local invariance by adding

graph regularization into the process of max-correntropy-

based matrix factorization. Meanwhile, each feature can

learn corresponding kernel from the data. The experiment

results of Caltech101 and Caltech256 show the beneﬁts of

such combination against other NMF algorithms for the

unsupervised image clustering.

I. INTRODUCTION

Given a collection of images, the clustering algorithms

attempt to group the dataset into multiple clusters such

that images in the same cluster are similar to each other

in terms of the semantics information. In this process,

a good feature extraction method is vital to the clus-

tering performance. Essentially, clustering (e.g. k-means)

or classiﬁcation algorithms (e.g. support vector machines

[1]–[4]) map the low-level image features to semantic in-

formation. These algorithms have a variety of applications

in different areas [5]–[19]. If the extracted image features

can reﬂect the latent semantic concepts, we believe it

can somehow better boost the clustering/classiﬁcation

performance.

Recently, non-negative matrix factorization (NMF) has

proven to be a powerful matrix factorization tool for data

representation. Matrix factorization decomposes the orig-

inal matrix X into multiple low-rank matrices, such that

their product approximates X. In NMF, X is decomposed

into two non-negative matrices H (basis matrix) and W

(coefﬁcient matrix). The vectors in H spans a latent

semantic space where each basis vector deﬁnes a semantic

Manuscript submitted to Journal of Computers.

topic. By doing so, unlike other matrix factorization meth-

ods, such as singular value decomposition, that interpret

the data as both additive and subtractive combination of

semantics NMF only allows additive relationship. This

constraint has proved to be closer to the way how humans

perceive and understand the data [20]–[22]. Based on this

methodology, we can map the low-level image features

into the additive combination of latent semantics for the

clustering.

Extensive work has been done to investigate the NMF

algorithm for different clustering tasks. Generally, the

matrix decomposition is done by minimizing the errors

between original and reconstructed matrix using l

dis-

tance or KL divergence [23]–[25]. One of the concerns is

that they are linear similarity measures, which may not be

suitable for data with nonlinear structure, like images [26].

A possible solution is to use nonlinear similarity measure

to model the error (e.g. kernlized nonlinear mapping [27],

[28]). Among these nonlinear methods, we are especially

interested in the NMF based on maximizing correntropy

criterion (MCC). Correntropy is a generalized nonlinear

measure between two variables. MCC-based methods

have proved effective in many areas, e.g. cancer clustering

[29], face recognition [30] and software defect prediction

[31]. Another approach is preserving the geometric struc-

ture of data based on the manifold assumption of data

distribution. To be more speciﬁc, the authors in [32], [33]

exploit the local invariance and encode such geometrical

information by constructing a nearest neighbor graph.

Thus, at least two factors are included in modeling this

process: the distance between X and H ∗ W based on

distance (or KL divergence) at low-level feature space,

and the graph regularization. Furthermore, the authors in

[34] incorporate such intrinsic geometric information in

multiple manifolds.

In this paper, we propose a graph regularized NMF

algorithm based on maximizing correntropy criterion for

unsupervised image clustering. We can leverage MCC

to properly model the errors in low-level feature space.

Furthermore, the graph regularization can keep the correct

geometric information in our factorization process. To our

knowledge, this is the ﬁrst work that combines MCC and

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38747144

粉丝: 4

最大化熵与图正则化的非负矩阵分解在聚类与分类中的应用

NMF.rar_nmf_正则化 非负_矩阵正则化_非负矩阵分解

非负矩阵分解matlab代码（全）

稀疏诱导流形正则化凸非负矩阵分解算法

L3/2正则化图非负矩阵分解算法

稀疏诱导流形正则化凸非负矩阵分解算法.docx

图正则化鲁棒非负矩阵分解用于聚类和选择差异表达基因

稀疏流形正则化提升非负矩阵分解抗噪性能

稀疏诱导流形正则化在非负矩阵分解中的应用

L2,1正则化加权非负矩阵分解提升不完整视图聚类效果

基于稳健图正则化的非负矩阵分解在特征基因选择中的应用

最新资源

NMF.rar_nmf_正则化非负_矩阵正则化_非负矩阵分解