低阶内核学习优化图聚类方法

需积分: 5 64 浏览量更新于2024-08-12 收藏 709KB PDF 举报

"这篇研究论文探讨了低阶内核学习在基于图的聚类中的应用。作者包括Zhao Kang、Liangjian Wen、Wenyu Chen和Zenglin Xu，他们都来自中国电子科技大学的计算机科学与工程学院。文章在2018年5月9日初次提交，经过修订后于同年9月4日再次提交，并于9月7日被接受，最终于9月26日在线发布。关键词包括低秩内核矩阵、图构建、多核学习、聚类和噪声。" 正文: 低秩内核学习对于基于图的聚类是一种有效的方法，它在处理大量数据集时表现出色。在聚类问题中，构建邻接图是至关重要的一步，因为这决定了数据点之间的相似性或连接性。图学习在内核空间中的应用能够揭示数据的非线性结构，从而提高聚类的准确性。然而，选择合适的内核矩阵对聚类效果有着显著影响。不同的内核对应于不同的特征空间映射，因此，选择错误的内核可能会导致不理想的聚类结果。为了解决这个问题，研究人员提出了多核学习算法，该算法能够在一组预定义的内核中学习最佳内核。这种方法允许结合多种内核的优点，以适应不同类型的数据和任务。尽管多核学习具有一定的优势，但其对噪声敏感，这意味着如果输入数据包含噪声或异常值，可能会影响内核的选择和聚类性能。此外，预定义的内核集合可能限制了模型的灵活性，因为它不能自动适应未知的或复杂的数据模式。论文中提出的低秩内核学习方法旨在克服这些局限性。通过利用低秩假设，可以捕获数据的主要结构，同时减少噪声的影响。低秩矩阵表示假设数据的大部分信息可以通过少数几个主要成分来描述，这有助于降低计算复杂性并提高聚类的稳定性。具体来说，该方法可能包括以下步骤：首先，使用多个内核函数生成内核矩阵；然后，通过矩阵分解技术（如奇异值分解SVD）来寻找低秩表示；接着，根据低秩成分来构建邻接图，这将指导聚类过程；最后，应用聚类算法（如谱聚类或K-means）来分割数据。这种方法的一个关键优点是它能够自动学习内核权重，从而适应数据的内在结构，而不是依赖于手动选择的内核。同时，通过去除噪声和冗余信息，可以提高聚类的准确性和鲁棒性。总结来说，这篇论文聚焦于如何通过低秩内核学习优化基于图的聚类。这种方法不仅能够提高聚类性能，而且对噪声有较好的抵抗力，增加了模型的泛化能力。这对于实际应用，特别是在大数据分析和机器学习领域，具有重要的理论和实践价值。

Knowledge-Based Systems 163 (2019) 510–517

Contents lists available at ScienceDirect

Knowledge-Based Systems

journal homepage: www.elsevier.com/locate/knosys

Short communication

Low-rank kernel learning for graph-based clustering

Zhao Kang, Liangjian Wen, Wenyu Chen

∗

, Zenglin Xu

∗

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, China

a r t i c l e i n f o

Article history:

Received 9 May 2018

Received in revised form 4 September 2018

Accepted 7 September 2018

Available online 26 September 2018

Keywords:

Low-rank kernel matrix

Graph construction

Multiple kernel learning

Clustering

Noise

a b s t r a c t

Constructing the adjacency graph is fundamental to graph-based clustering. Graph learning in kernel

space has shown impressive performance on a number of benchmark data sets. However, its performance

is largely determined by the chosen kernel matrix. To address this issue, previous multiple kernel learning

algorithm has been applied to learn an optimal kernel from a group of predefined kernels. This approach

might be sensitive to noise and limits the representation ability of the consensus kernel. In contrast to

existing methods, we propose to learn a low-rank kernel matrix which exploits the similarity nature of

the kernel matrix and seeks an optimal kernel from the neighborhood of candidate kernels. By formulating

graph construction and kernel learning in a unified framework, the graph and consensus kernel can be

iteratively enhanced by each other. Extensive experimental results validate the efficacy of the proposed

method.

1. Introduction

Clustering is a fundamental and important technique in ma-

chine learning, data mining, and pattern recognition [1–3]. It aims

to divide data samples into certain clusters such that similar ob-

jects lie in the same group. It has been utilized in various do-

mains, such as image segmentation [4], gene expression analy-

sis [5], motion segmentation [6], image clustering [7], heteroge-

neous data analysis [8], document clustering [9,10], social media

analysis [11], subspace learning [12,13]. During the past decades,

clustering has been extensively studied and many clustering meth-

ods have been developed, such as K-means clustering [14,15],

spectral clustering [16,17], subspace clustering [18,19], hierarchi-

cal clustering [20], matrix factorization-based algorithms [21–23],

graph-based clustering [24,25], and kernel-based clustering [26].

Among them, the K-means and spectral clustering are especially

popular and have been extensively applied in practice.

Basically, the K-means method iteratively assigns data points

to their closest clusters and updates cluster centers. Nonetheless,

it cannot partition arbitrarily shaped clusters and is notorious for

its sensitivity to the initialization of cluster centers [27]. Later, the

kernel K-means (KKM) was proposed to characterize data nonlin-

ear structure information [28]. However, the user has to specify a

kernel matrix as input, i.e., the user must assume a certain shape of

the data distribution which is generally unknown. Consequently,

the performance of KKM is highly dependent on the choice of

the kernel matrix. This will be a stumbling block for the practical

use of kernel method in real applications. This issue is partially

∗

Corresponding authors.

E-mail addresses: cwy@uestc.edu.cn (W. Chen), zlxu@uestc.edu.cn (Z. Xu).

alleviated by multiple kernel learning (MKL) technique which lets

an algorithm do the picking or combination from a set of candidate

kernels [29,30]. Since the kernels might be corrupted due to the

contamination of the original data with noise and outliers. Thus,

the induced kernel might still not be optimal [31]. Moreover,

enforcing the optimal kernel being a linear combination of base

kernels could lead to limited representation ability of the optimal

kernel. Sometimes, MKL approach indeed performs worse than a

single kernel method [32].

Spectral clustering, another classic method, presents more ca-

pability in detecting complex structures of data compared to other

clustering methods [33,34]. It works by embedding the data points

into a vector space that is spanned by the spectrum of affinity

matrix (or data similarity matrix). Therefore, the quality of the

similarity graph is crucial to the performance of spectral cluster-

ing algorithm. Previously, the Gaussian kernel function is usually

employed to build the graph matrix. Unfortunately, how to select

a proper Gaussian parameter is an open problem [35]. Moreover,

the Gaussian kernel function is sensitive to noise and outliers.

Recently, some advanced techniques have been developed to

construct better similarity graphs. For instance, Zhu et al. [36]

used a random forest-based method to identify discriminative fea-

tures, so that subtle and weak data affinity can be captured. More

importantly, adaptive neighbors method [37] and self-expression

approach [38] have been proposed to learn a graph automatically

from the data. This automatic strategy can tackle data with struc-

tures at different scales of size and density and often provides a

high-quality graph, as demonstrated in clustering [37,39], semi-

supervised classification [40,41], and many others.

In this paper, we learn the graph in kernel space. To address

the kernel dependence issue, we develop a novel method to learn

https://doi.org/10.1016/j.knosys.2018.09.009

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38501610

粉丝: 4
资源: 917

低阶内核学习优化图聚类方法

基于谱聚类的高阶模糊时序自适应预测方法

基于图聚类与蚁群算法的社交网络聚类算法

基于谱聚类的聚类集成算法

基于免疫聚类的自动分层强化学习方法研究 (2007年)

基于区域聚类的SAR图像分割

Matlab基于GG聚类以及改进的GG聚类实现数据聚类.zip

基于模糊聚类的图像分割算法研究

基于均值聚类的车牌定位

K-Means聚类算法及其内核方法源码解析

提升半监督学习效率：基于图的聚类与主动学习方法

最新资源