多子空间图像聚类的局部一致性低秩表示方法综述

94 浏览量更新于2024-08-27 收藏 1.23MB PDF 举报

图像聚类的局部一致低秩表示是一种针对高维数据处理的有效方法，特别适用于诸如计算机视觉、模式识别、生物信息学等领域，这些领域中的数据往往具有高度复杂性和庞大的存储需求。高维度数据的问题在于它增加了算法的计算复杂性，增加了内存负担，并且在面对噪声和样本量相对于数据空间维度不足的情况时，性能往往会受到影响。 SC（Subspace Clustering）方法正是为了解决这些问题而生，这类技术通过同时对数据进行分组，将其划分到各自的子空间，并为每个对象群体寻找一个低维度的嵌入空间。这种方法的出现促使了一系列SC算法的发展，大致可以分为以下四类： 1. **基于图的方法**：这类方法利用数据点之间的相似性构建图谱，然后通过社区检测或谱聚类算法来识别子空间。例如，自组织映射（Self-Organizing Map, SOM）和邻域嵌入（Neighborhood Embedding）就是这类策略的代表。 2. **基于矩阵分解的方法**：它们将数据视为矩阵，通过矩阵分解（如奇异值分解SVD或非负矩阵分解NMF）来发现潜在的低秩结构。这些方法试图找到一组基础矩阵，数据可以近似地表示为这些基础矩阵的线性组合。常见的有LRR（Low-Rank Representation, LRR）和RPCA（Robust Principal Component Analysis）。 3. **流形学习和局部保持方法**：这类方法假设数据在高维空间中形成低维流形，通过捕捉数据点之间的局部一致性来实现聚类。例如，Laplacian Eigenmaps和Isomap等算法就利用了这个理念。 4. **基于稀疏编码的策略**：这些方法利用稀疏表示的思想，试图找到每个数据点可以用少数几个基向量表示的解决方案。典型的例子有Sparse Subspace Clustering (SSC)和它的变种，如Sparse Subspace Clustering with Norm Constraints (SSCAN)。局部一致低秩表示是上述分类中的一种，它强调的是在子空间内数据点之间的低秩关系和局部一致性。相比于全局低秩模型，这种表示方法更加灵活，因为它允许在每个子空间内部的表示是低秩的，而不同子空间之间的数据点可能不是。这在处理多模态数据，如视频序列中的不同物体运动，或者混合分布的数据集时尤为有效，因为每个子空间可以对应不同的数据特征或行为模式。图像聚类的局部一致低秩表示是现代数据挖掘和机器学习领域中的一个重要研究分支，其核心思想是利用数据内在的低秩特性与局部结构信息来进行有效的高维数据聚类，以提高算法的鲁棒性和准确性。通过深入理解并优化这类方法，我们可以开发出更为高效和精确的图像分析和处理工具。

Method of Multipliers (ADMM) [7] for solving the

problem (5) .

By introducing some auxiliary variables, the problem (5)

is converted to the following equivalent problem

*2,1

,,,,

min ( ),

WEZJK

Tr ZLZ

.. , , .st X XWZ E Z J W K=+= =

(6)

This problem can be solved by the ADMM method,

which minimizes the following augmented Lagrangian

function:

*2,1

(,,,,) ( )

O W E Z J K Tr ZLZ

[( )] [( )]

Tr Y X XWZ E Tr Y Z J+−−+−

(

[( )]

Tr Y W K X XWZ E

+−+−−+

)

ZJ WK+− + −

, (7)

where

is a penalty parameter, Y

and

are the

lagrange multiplier.

Then, we use the alternating minimization strategy to

update

, and

, sequentiality, i.e. updating one of

the five variables with the other four being fixed.

Updating J

The above problem is unconstrained. So minimization of

the Eq. (7) with respect to

is the following equation:

min [ ( )] 2

JZJ

Tr Y Z J

+−

. (8)

Eq. (8) can be further reformulated as:

()

min 2

JJZ

μμ

−+

, (9)

which can be solved by the well-know singular

thresholding (SVT) operator.

Updating

To update

, Eq. (9) with respect to

can be rewritten by

min ( ) [ ( )]

Tr ZLZ Tr Y X XWZ E

+−−

( )

[( )]

Tr Y Z J X XWZ E Z J

+−+−−+−

Above problem is a smooth convex program.

Differentiating the above function with respect to

and

setting it to zero, the optimal solution

satisfy

()()

TT TT

WXXW IZ Z L WXY Y

μτ

++ = −

[()]

WX X E J

+−+

. (10)

Eq. (10) is a standard Sylvester equation related to variable

Updating

To update W, we need to solve

min [ ( )]

Tr Y X XWZ E−−

[( )]

Tr Y W K

+−

()

XXWZE WK

+−−+−

Like updating Z, the optimal solution of W* should

satisfies

(

TTTT TT

WXXWZZ XYZ Y XXZ

+=−+

)

XEZ K

−+

. (11)

The above function is a Stein equation of variable W.

Updating

Optimizing Eq. (7) with respect to E is converted to the

following problem:

2,1

min [ ( )]

Tr Y X XWZ E

+−−

XWZ E

+−−

This problem can be written in the following form:

2,1

min

++ −

, (12)

where

YXXWZY

=−

This equation can be solved via the following formula

(:, )

0, .

if y

otherwise

≤

−



(13)

Updating

and Lagrangian multipliers

It is easy to get the iteration of

KWY

. (14)

After updating the variables, Lagrange multipliers are

updated by

()

Y Y X XEZ E

=+ − −

()

YY ZJ

=+ −

, (15)

()

YY WK

=+ −

4 EXPERIMENTS

In this section, we conduct several experiments on image

data to demonstrate the efficacy of our proposed approach.

We also compare the clustering results obtained by CLRR

and LCLRR with other related methods, including LRR [8],

K-means, Principal Component Analysis (PCA), Graph

regularized Sparse Coding (GSC) [15] and Graph

regularized Nonnegative Matrix Factorization (GNMF)

[16]. For a fair comparison, K-means is applied for

clustering after dimension reduction by all algorithms.

All numerical experiments are performed on the personal

computer with a 3.10 GHz and 4G bytes of memory. This

computer runs on Windows 8, with Matlab 8.0 compiler

installed.

The Accuracy and Normalized Mutual Information

(NMI) [15,16], two standard clustering metric is used to

measure

the performance. For both the two measures, a larger value

indicates a better clustering performance. The result is

evaluated by comparing the cluster label of each sample

with its label provided by the data set. Our empirical studies

on clustering were accomplished on the images database:

COIL20.

2016 28th Chinese Control and Decision Conference (CCDC) 3963

剩余10页未读，继续阅读

weixin_38502915

粉丝: 5
资源: 914

多子空间图像聚类的局部一致性低秩表示方法综述

非负低秩表示在高光谱图像半监督聚类中的应用

非负局部约束低秩子空间聚类算法：新模型与性能提升

二进制图像聚类新法：HSIC提升效率与扩展性

通过局部约束的低秩表示实现鲁棒的人脸超分辨率

半监督概念分解在文档聚类中的应用

"多视图数据的一致性约束半监督分类：全局结构与局部流形结合的特征投影方法研究

图像聚类程序：集美观界面与多算法于一体

毕设和企业适用springboot企业健康管理平台类及活动管理平台源码+论文+视频.zip

基于layui框架的省市复选框组件设计源码

LABVIEW程序实例-代码连线.zip

最新资源