非负数据的多任务多视图聚类框架与算法

62 浏览量更新于2024-08-26 收藏 2.55MB PDF 举报

"这篇研究论文探讨了非负数据的多任务多视图聚类问题，提出了一种集成在视图内任务聚类、多视图关系学习和多任务关系学习的MTMVC框架，并设计了一个特定算法来优化该框架。实验结果证明了所提算法在处理多任务多视图数据聚类时优于传统的多任务或多视图聚类算法。" 在机器学习领域，聚类是一种无监督学习方法，旨在发现数据内在的结构和群体。多任务学习（Multi-Task Learning）和多视图学习（Multi-View Learning）是两种增强聚类效果的重要技术。多任务学习假设不同的任务之间存在相关性，通过共享信息来提高各个任务的学习性能；多视图学习则考虑同一数据可以从多个角度或特征表示（视图）进行分析，每种视图提供不同的信息。这篇论文特别关注的是非负数据的多任务多视图聚类问题。非负数据通常出现在诸如文档、图像或音频等场景，其中每个样本的特征值都是非负的。例如，在文档分析中，词频通常是非负的，表示单词在文档中出现的次数。作者提出的MTMVC框架综合了多任务和多视图的优点，旨在处理那些任务间紧密相关且每个任务都可以从多个视图分析的情况。这个框架包括三部分：在视图内任务聚类，旨在利用每个视图内的信息进行任务聚类；多视图关系学习，目的是发现不同视图之间的相关性和一致性；以及多任务关系学习，用于挖掘不同任务之间的共性。为了优化MTMVC框架，他们开发了一个特定的优化算法。这个算法可能基于梯度下降或其他优化策略，以有效地平衡和协同这三个组件，从而最大化聚类的准确性和任务间的相关性。实验部分对比了他们的方法与单独的多任务聚类算法和多视图聚类算法，证明了在处理多任务多视图数据时，所提的MTMVC框架能够提供更优的聚类结果。这表明，结合多任务和多视图的信息可以显著提升非负数据聚类的性能，尤其在那些任务和视图都具有复杂关联性的场景下。这篇论文为非负数据的聚类提供了新的视角和解决方案，对于理解和处理复杂数据集的聚类问题具有重要的理论和实践意义。

Multi-Task Multi-View Clustering for Non-Negative Data

Xianchao Zhang and Xiaotong Zhang and Han Liu

School of Software

Dalian University of Technology

Dalian 116620, China

xczhang@dlut.edu.cn, zxt.dut@hotmail.com, liu.han.dut@gmail.com

Abstract

Multi-task clustering and multi-view clustering

have severally found wide applications and re-

ceived much attention in recent years. Neverthe-

less, there are many clustering problems that in-

volve both multi-task clustering and multi-view

clustering, i.e., the tasks are closely related and

each task can be analyzed from multiple views. In

this paper, for non-negative data (e.g., documents),

we introduce a multi-task multi-view clustering

(MTMVC) framework which integrates within-

view-task clustering, multi-view relationship learn-

ing and multi-task relationship learning. We then

propose a speciﬁc algorithm to optimize the MT-

MVC framework. Experimental results show the

superiority of the proposed algorithm over either

multi-task clustering algorithms or multi-view clus-

tering algorithms for multi-task clustering of multi-

view data.

1 Introduction

Multi-task clustering improves individual clustering perfor-

mance by learning the relationship among related tasks.

Multi-view clustering makes use of the consistency among

different views to achieve better performance. Both multi-

task clustering and multi-view clustering have severally

found wide applications and received much attention in re-

cent years. Nevertheless, there are many practical problems

that involve both multi-task clustering and multi-view clus-

tering, i.e., the tasks are closely related and each task can

be analyzed from multiple views. For example, the tasks for

clustering the web pages from four universities are four re-

lated tasks. The four tasks all have word features in the main

texts, they also have many other features, such as the words

in the hyperlinks pointing to the web pages, and the words in

the titles of the web pages. For another example, the tasks for

clustering the web images collected from Chinese web sites

and English web sites are two related tasks. The two tasks

both have visual features in the images, they also have word

features in the surrounding texts in Chinese and English re-

spectively. To tackle the clustering problem of such data sets,

existing algorithms can only utilize limited information, i.e.,

multi-view clustering algorithms only use the information of

the views in a single task, multi-task clustering algorithms

only exploit the mutual information shared by all the related

tasks from a single view. However, we can get better per-

formance if both the multi-task and multi-view information

could be utilized.

Recently, multi-task multi-view learning algorithms, which

learn multiple related tasks with multi-view data, have been

proposed. The graph-based framework in

[

He and Lawrence,

2011

]

takes full advantages of both the feature heterogene-

ity and task heterogeneity. Within each task, the consistency

among different views is obtained by requiring them to pro-

duce the same classiﬁcation function, and across different

tasks, the relationship is established by utilizing the similar-

ity constraint on the common views. The inductive learning

framework in

[

Zhang and Huan, 2012

]

uses co-regularization

and task relationship learning, which increases the practi-

cality of multi-task multi-view learning. These methods

have demonstrated their superiorities over either multi-task

or multi-view learning algorithms. However, they all tackle

classiﬁcation. To the best of our knowledge, there is no exist-

ing approach to the multi-task multi-view clustering problem.

In this paper, we aim to deal with the multi-task multi-view

clustering of non-negative data, which arises in many appli-

cations, such as various types of documents. Based on the ob-

servation that the related tasks have both common views and

task speciﬁc views, we propose a bipartite graph based multi-

task multi-view clustering (MTMVC) framework, which con-

sists of three parts. (1) Within-view-task clustering: this part

clusters the data of each view in each task. It is the base

of the framework and mutually boosts with the other two

parts. (2) Multi-view relationship learning: this part uses

the consistency among different views to improve the clus-

tering performance. (3) Multi-task relationship learning: this

part learns the relationship among related tasks to improve

the clustering performance. We integrate the three parts into

one objective function and optimize it with a gradient as-

cent method. Because of the unitary constraints, we further

solve the optimization problem by mapping the variables to

the Stiefel manifold

[

Manton, 2002

]

. Experimental results on

several real data sets show the superiority of the proposed al-

gorithm over either multi-task clustering algorithms or multi-

view clustering algorithms for multi-task clustering of multi-

view data.

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015)

4055

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38705723

粉丝: 5
资源: 917

非负数据的多任务多视图聚类框架与算法

基于LLE和LE的异构场景下的多任务多视图聚类算法。

多视图聚类数据集mfeat

L2,1正则化加权非负矩阵分解提升不完整视图聚类效果

基于聚类分析的多源异构数据挖掘技术研究.pdf

联合非负矩阵分解在多层网络中的社区检测

无参数自动加权多图正则化非负矩阵分解与图像聚类技术

鲁棒多视图嵌入：噪声环境下的非负矩阵分解方法

图正则多视图非负矩阵分解：结构化稀疏与信息整合

AI企联系统 Ai企业级系统开心版 uniapp适配 Web+H5+微信小程序+抖音小程序+双端APP

2000d.doc

最新资源