概念分解与局部流形正则化的多视图聚类新方法

PDF格式 | 339KB | 更新于2024-08-28 | 30 浏览量 | 举报

在当前的信息时代，多视图数据已经成为许多领域研究的重要组成部分，如计算机视觉、生物信息学、社交网络分析等。多视图聚类的目标是通过整合不同视角或来源的数据，挖掘出更精确的潜在结构和类别信息，从而提高聚类性能。本文介绍了一种新颖的多视图聚类方法——基于概念分解和局部流形正则化的多视图概念聚类（Multi-View Concept Clustering via Concept Factorization with Local Manifold Regularization）。首先，概念分解是一种强大的数据分析工具，它将复杂的数据表示转化为一组简单的概念或者特征向量，有助于揭示数据的内在结构。在多视图聚类中，作者提出通过概念分解来提取各视图中的共同特征，形成一个共享的共识表示，这有助于克服单个视图可能存在的噪声和冗余，提升整体的聚类效果。局部流形正则化则是为了保护数据在高维空间中的局部几何结构。在多模态数据中，数据点往往不是均匀分布的，而是沿着低维的局部结构聚集。因此，将局部流形信息融入聚类过程，可以更好地捕捉数据点之间的相似性，避免了全局优化可能导致的局部最优问题。该方法的核心是将局部流形正则化与概念分解相结合，这既保留了数据的局部特性，又推动了不同视图之间的融合。同时，通过自动学习每个视图的权重，算法能够自适应地衡量各个视角的重要性，确保了多视图融合的有效性和一致性。此外，为了使多视图融合更为有意义，作者设计了一种加权归一化策略，它确保了在形成共同共识表示的过程中，每个视图的贡献是均衡且有意义的。这种策略有助于提高聚类结果的稳定性，使得最终的聚类更加准确和可靠。整个算法采用迭代优化的方式，通过不断更新和调整模型参数，使得概念分解的结果逐渐逼近真实数据的内在结构，从而达到理想的多视图聚类效果。这种方法具有很好的理论基础和实践潜力，对于处理复杂多源数据集的聚类任务具有重要意义，可以显著提升聚类的精度和鲁棒性。

Multi-View Clustering via Concept Factorization with

Local Manifold Regularization

Hao Wang, Yan Yang

∗

and Tianrui Li

School of Information Science and Technology

Southwest Jiaotong University, Chengdu 611756, China

∗

Corresponding Author, email: yyang@swjtu.edu.cn

Abstract—Real-world datasets often have representations in

multiple views or come from multiple sources. Exploiting

consistent or complementary information from multi-view data,

multi-view clustering aims to get better clustering quality

rather than relying on the individual view. In this paper, we

propose a novel multi-view clustering method called multi-view

concept clustering based on concept factorization with local

manifold regularization, which drives a common consensus

representation for multiple views. The local manifold regular-

ization is incorporated into concept factorization to preserve

the locally geometrical structure of the data space. Moreover,

the weight of each view is learnt automatically and a co-

normalized approach is designed to make fusion meaningful

in terms of driving the common consensus representation. An

iterative optimization algorithm based on the multiplicative

rules is developed to minimize the objective function. Experi-

mental results on nine reality datasets involving different ﬁelds

demonstrate that the proposed method performs better than

several state-of-the-art multi-view clustering methods.

I. INTRODUCTION

Many datasets in real life naturally comprise of different

representations in multiple views or come from multiple

sources, which are called multi-view data. Some examples

are illustrated in Fig. 1. Among these examples, each in-

dividual view sufﬁces for mining knowledge on its own,

but multiple views provide more information which may

improve the performance and quality (e.g., clustering). How-

ever, the main challenge is how to integrate these informa-

tion and give a compatible solution across all views.

Multi-view clustering provides a hopeful way to cluster

data with multiple views, which has attracted more and more

attentions in recent years [1], [2], [3], [4]. Among these

methods, one of the most widely used technique is nonneg-

ative matrix factorization (NMF) [5]. A joint nonnegative

matrix factorization process with the consistency constraint

was formulated in [3], which pushed each view’s solution

towards a common consensus. Besides, one method based

on weighted NMF with L

2,1

regularization was proposed

to learn a latent representation for all views and generate

a consensus matrix in [6]. A feature extraction method via

NMF with local graph regularization for multi-view data

was presented in [7]. Then, the extracted features were used

to cluster the data. However, NMF is not appropriate to

Figure 1. Multi-view data

handle the negative data. As an extension of NMF, concept

factorization (CF) was proposed in [8], which is adapted

to deal with the data containing negative and also is easily

performed in the kernel space.

Besides, both NMF and CF only consider the global

structure of the data space and fail to preserve the locally

geometrical structure. To handle this issue, the graph regu-

larized nonnegative matrix factorization [9] and the locally

consistent concept factorization (LCCF) [10] were proposed

by imposing the manifold regularization on NMF and CF

formulation respectively for single-view data. While, LCCF

does not give a consensus solution for multi-view data.

Additionally, all views were treated equally and the differ-

ence of each view was not considered in such methods [1],

[2], [3], which may degrade the clustering quality. In this

paper, a novel multi-view clustering method called multi-

view concept clustering (MVCC) is proposed. It incorporates

CF, the local manifold regularization and the consistency

constraint into a uniﬁed framework, which learns a solution

for each view as well as drives a consensus solution across

all the views. The co-normalized approach is designed

and the weight of each view is respected, which make

the solution of every view compatible and the consensus

meaningful during the factorization process. Besides, an

alternating iterative optimization algorithm is developed to

solve the proposed objective function.

The rest of this paper is organized as follows. In Section

II, a presentation of CF and the local manifold regularization

2016 IEEE 16th International Conference on Data Mining

DOI 10.1109/ICDM.2016.34

1245

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38576922

粉丝: 6

概念分解与局部流形正则化的多视图聚类新方法

集成流形正则化多视图聚类生成模型：挖掘非线性结构的有效方法

矩阵三分解与流形正则化的零镜头学习算法

多视图学习的MATLAB实现：流形正则化与向量值RKHS框架

Graph-Multi-NMF-Feature-Clustering:通过局部图正则化的多视图非负矩阵分解进行特征提取

人脸图像特征提取matlab代码-multi_view-learning-summarize:我正在做一些关于多视图学习的研究，我想总结一下我

用matlab对图像进行频谱分析代码-multiview_learning:multiview_learning

局部图正则化多视图NMF特征提取方法研究

多视图正则化矩阵分解算法：一种有效处理多特征数据的聚类方法

大规模数据下非负矩阵分解的单通道多视图聚类优化算法

MATLAB图像频谱分析与多视图学习方法

最新资源