低秩表示的鲁棒子空间分割

需积分: 49 158 浏览量更新于2024-09-11 收藏 308KB PDF 举报

"robust subspace segmentation by low rank representation - 刘光灿的低秩方法在数据分割中的应用" 本文介绍了一种基于低秩表示（Low-Rank Representation, LRR）的鲁棒子空间分割方法，由刘光灿等人提出。LRR是一种用于处理来自多个线性或仿射子空间的数据分割技术，特别适用于处理噪声和异常值的情况。它与传统的稀疏表示（Sparse Representation, SR）不同，SR是针对单个数据向量寻找最稀疏的表示，而LRR则关注于找到一组向量的整体最低秩表示，从而更好地捕获数据的整体结构。 1. 引言在科学和工程领域，数据通常可以被看作是从多个潜在子空间中抽取的样本集合。这些子空间可能对应于不同的类别、模式或者特征。然而，实际获取的数据往往受到噪声、缺失值和异常点的影响，这给子空间分割带来了挑战。LRR方法通过寻找数据集的低秩表示，旨在克服这些问题，实现对污染数据的高效分割。 2. 低秩表示理论低秩表示的基本思想是，如果数据集中的大部分样本都源自有限的几个子空间，那么整个数据集应该有一个较低的矩阵秩。LRR通过最小化表示的秩来找到最优的基，使得所有数据向量可以表示为这些基的线性组合。相比于最大化稀疏性，最小化秩能够更好地保留数据的内在结构，尤其是在存在噪声的情况下。 3. 鲁棒子空间分割在LRR框架下，子空间分割问题转化为求解一个矩阵的秩最小化问题。通过引入正则化项，可以进一步提高模型的鲁棒性，例如，L1范数正则化可以帮助处理异常值。通过优化算法，如交替方向乘子法（ADMM），可以有效地解决这个非凸优化问题。 4. 实验结果与分析实验部分展示了LRR在图像拼接、视频背景建模、人脸识别等多个应用场景中的优越性能。与传统方法相比，LRR在处理噪声和异常点时能更好地保持子空间的纯度和完整性，提高了分割的准确性和稳定性。 5. 结论 LRR提供了一个强大的工具来处理复杂数据环境中的子空间分割问题。其鲁棒性和对全局结构的敏感性使其成为一种有前景的方法。未来的研究可能会进一步探索LRR在其他领域的应用，以及如何结合深度学习等现代技术提升其性能。这篇论文详细阐述了LRR的基本概念、优化过程以及其在实际问题中的应用，对于理解和应用低秩表示进行数据分割的读者来说，是一份有价值的研究资料。

Robust Subspace Segmentation by Low-Rank Representation

Guangcan Liu

†

roth@sjtu.edu.cn

Zhouchen Lin

‡

zhoulin@microsoft.com

Yong Yu

†

yyu@apex.sjtu.edu.cn

†

Shanghai Jiao Tong University, NO. 800, Dongchuan Road, Min Hang District, Shanghai, China, 200240

‡

Microsoft Research Asia, NO. 49, Zhichun Road, Hai Dian District, Beijing, China, 100190

Abstract

We propose low-rank representation (LRR)

to segment data drawn from a union of mul-

tiple linear (or aﬃne) subspaces. Given a

set of data vectors, LRR seeks the lowest-

rank representation among all the candidates

that represent all vectors as the linear com-

bination of the bases in a dictionary. Unlike

the well-known sparse representation (SR),

which computes the sparsest representation

of each data vector individually, LRR aims

at ﬁnding the lowest-rank representation of a

collection of vectors jointly. LRR better cap-

tures the global structure of data, giving a

more eﬀective tool for robust subspace seg-

mentation from corrupted data. Both the-

oretical and experimental results show that

LRR is a promising tool for subspace segmen-

tation.

1. Introduction

In scientiﬁc data analysis and system engineering, one

usually needs a parametric model to characterize a

given set of data. To this end, linear models such

as the linear subspaces

are possibly the most com-

mon choice, mainly because they are easy to compute

and are also often eﬀective in real applications. So the

There is no loss of generality in assuming that the sub-

spaces are linear, i.e., they all contain the origin. For the

aﬃne subspaces that do not contain the origin, we can al-

ways increase the dimension of the ambient space by one

and identify each aﬃne subspace with the linear subspace

that it spans. So we always use the term “subspace” to de-

note “linear subspace” and “aﬃne subspace” in this work.

App earing in Proceedings of the 27

International Confer-

ence on Machine Learning, Haifa, Israel, 2010. Copyright

2010 by the author(s)/owner(s).

subspaces have been gaining much attention in recent

years. For example, the hotly discussed matrix compe-

tition (Cand`es & Recht, 2009; Keshavan et al., 2009;

Cand`es et al., 2009) problem is essentially based on

the hypothesis that the data is drawn from a low-rank

subspace. However, a given data set is seldom well

described by a single subspace. A more reasonable

model is to consider data as lying near several sub-

spaces, leading to the challenging problem of subspace

segmentation. Here, the goal is to segment (or cluster)

data into clusters with each cluster corresponding to

a subspace. Subspace segmentation is an important

data clustering problem as it arises in numerous re-

search areas, including machine learning (Lu & Vidal,

2006), computer vision (Ho et al., 2003), image pro-

cessing (Fischler & Bolles, 1981) and system identiﬁ-

cation.

Previous Work. According to their mechanisms

of representing the subspaces, existing works can be

roughly divided into four main categories: mixture

of Gaussian, factorization, algebraic and compressed

sensing.

In statistical learning, mixed data are typically mod-

eled as a set of independent samples drawn from a

mixture of probabilistic distributions. As a single

subspace can be well modeled by a Gaussian dis-

tribution, it is straightforward to assume that each

probabilistic distribution is Gaussian, so known as

the mixture of Gaussian model. Then the prob-

lem of segmenting the data is converted to a model

estimation problem. The estimation can be per-

formed either by using the Expectation Maximization

(EM) algorithm to ﬁnd a maximum likelihood esti-

mate, as done in (Gruber & Weiss, 2004), or by iter-

atively ﬁnding a min-max estimate, as adopted by K-

subspaces (Ho et al., 2003) and Random Sample Con-

sensus (RANSAC) (Fischler & Bolles, 1981). These

methods are sensitive to the noise and outliers. So

下载后可阅读完整内容，剩余7页未读，立即下载

sinat_34140943

粉丝: 0
资源: 1

低秩表示的鲁棒子空间分割

Multi-view Low-rank Sparse Subspace Clustering Algorithm代码及各种数据集

Rank-based deactivation model for aging networks

low rank 的问题解决方法之一 双边解决法

patchwork++: fast and robust ground segmentation solving partial under-segme

子空间聚类最新英文文献有哪些，请给出年限与DOI号

复现Transferring Adversarial Robustness Through Robust Representation Matching需要配置什么样的环境

toward fast, flexible, and robust low-light image enhancement(cvpr2022)代码

lrmr低秩恢复算法

robust control toolbox下载

最新资源

low rank 的问题解决方法之一双边解决法