基于CP分解与MoG的张量鲁棒主成分分析：克服噪声与结构限制

需积分: 50 160 浏览量更新于2024-08-12 收藏 3.52MB PDF 举报

本文探讨的是"具有复杂噪声的Tensor鲁棒主成分分析"（TensorRPCA by Bayesian CP Factorization with Complex Noise），这是一种针对高阶张量数据的鲁棒性改进方法，以克服传统RPCA模型在处理矩阵形式数据时的局限性。原RPCA模型在许多应用中表现出色，但其主要问题在于： 1. **局限于矩阵数据结构**：原始RPCA假设数据以矩阵形式存在，而在实际应用中，如视频、图像处理、生物医学数据等领域，高阶张量数据（如多维数组）蕴含着丰富的结构信息。作者提出的新方法利用张量结构，能够更好地利用这些内在的结构特征，从而提高分析的准确性和效率。 2. **处理噪声的局限**：传统RPCA采用L1范数来处理噪声部分，这种策略对稀疏噪声效果较好，但在处理复杂或非稀疏噪声时表现不足。作者借鉴高斯混合模型（Mixture of Gaussians, MoG）的概念，将数据噪声建模为连续分布的混合，这样能适应更广泛的噪声类型，包括混合噪声，增强了鲁棒性。 **新方法介绍**：本文的核心贡献是提出了一种结合CP分解（CANonical Polyadic Decomposition）和MoG噪声模型的TensorRPCA方法。CP分解是一种常见的多线性表示，它将高阶张量分解成多个因子，每个因子对应一个低维子空间。通过这种分解，原始数据的复杂结构得以保留，同时利用MoG模型的灵活性来捕捉不同类型的噪声分布。 **算法实现**：新算法是在变分贝叶斯（Variational Bayesian）框架下设计的，这是一种统计学习方法，通过优化过程推断出模型参数，尤其是对于高维、大规模数据集，这种方法在计算效率上具有优势。作者通过大量的合成数据和真实数据实验，验证了新方法相对于现有最新RPCA方法在处理复杂噪声情况下的优越性。总结来说，本文的工作旨在扩展RPCA模型的应用范围，使之能更好地适应高阶张量数据，并能处理更加复杂的噪声场景。通过引入CP分解和MoG模型，新的TensorRPCA算法不仅保持了原有的鲁棒性，还能够更有效地利用和处理高阶数据的特性，为实际问题提供了更为稳健的解决方案。

Tensor RPCA by Bayesian CP Factorization with Complex Noise

Qiong Luo

1,2

, Zhi Han

1∗

, Xi’ai Chen

1,2

, Yao Wang

, Deyu Meng

, Dong Liang

, Yandong Tang

State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences;

University of Chinese Academy of Sciences;

Xi’an Jiaotong University

{luoqiong, hanzhi, chenxiai, ytang}@sia.cn, {dymeng, liangdong}@mail.xjtu.edu.cn, yao.s.wang@gmail.com

Abstract

The RPCA model has achieved good performances in

various applications. However, two defects limit its effec-

tiveness. Firstly, it is designed for dealing with data in ma-

trix form, which fails to exploit the structure information of

higher order tensor data in some pratical situations. Sec-

ondly, it adopts L

-norm to tackle noise part which makes

it only valid for sparse noise. In this paper, we propose a

tensor RPCA model based on CP decomposition and model

data noise by Mixture of Gaussians (MoG). The use of ten-

sor structure to raw data allows us to make full use of the

inherent structure priors, and MoG is a general approxima-

tor to any blends of consecutive distributions, which makes

our approach capable of regaining the low dimensional lin-

ear subspace from a wide range of noises or their mixture.

The model is solved by a new proposed algorithm inferred

under a variational Bayesian framework. The superiority of

our approach over the existing state-of-the-art approaches

is demonstrated by extensive experiments on both of syn-

thetic and real data.

1. Introduction

In the ﬁelds of data analysis, principal component anal-

ysis (PCA) has been a classical and prevalent tool and has

extensive applications [16]. Originally, PCA aims to ﬁnd

the best L

-norm low-rank approximation of a speciﬁed

matrix due to its smoothness and has many fast numerical

solvers [9, 24, 25, 26, 35, 41]. But L

-norm is only suitable

for Gaussian noise and too susceptible to outliers and gross

noise. To increase the robustness of PCA, a series of works

have been conducted in recent years [12, 17, 13, 19].

Inspired by the improvement of low-rank matrix analy-

sis [4, 5, 30], the robust principal component analysis (RP-

CA) [40] has been proposed for remedying the deﬁciency of

traditional PCA, in which, a high dimensional observation

matrix is assumed to consist of a low-rank component and

∗

Corresponding author.

a sparse component. Speciﬁcally, let Y ∈ R

m×n

be the ob-

servation data matrix, X ∈ R

m×n

be the low-rank matrix,

E ∈ R

m×n

be the sparse noise matrix, and then we can

describe the RPCA as the following optimization problem:

min

X,E

X

∗

+ λE

s.t. Y = X + E, (1)

where X

∗



(X) denotes the nuclear norm of

X, σ

(X) (r =1, 2, ..., min (m, n)) is the r

singular val-

ue of X, E



| denotes the L

-norm of E, and

is the element in the i

row and j

column of E. It has

been proved that if L and S satisfy a certain incoherence

condition, the RPCA can uniquely extract X and E from Y

[6]. RPCA has played an important role in handling vari-

ous problems, including robust matrix recovery [40], face

alignment [27], subspace segmentation [21] and so forth.

Recently, it has been noticed that more and more modern

applications contain data with a higher order tensor struc-

ture, such as background extraction [7], face recognition

and representation [40, 34, 38, 2], structure from motion

[36], object recognition [37] and motion segmentation [39].

Matrices can be viewed as second order tensors, howev-

er, moving from matrices to higher order tensors presents

signiﬁcant new challenges. A direct way to address these

challenges is to unfold tensors to matrices and then directly

apply the matrix RPCA model. Unfortunately, as recently

pointed out by [7], the multilinear structure is lost in such

matricization and as a result, methods constructed based on

these techniques often lead to suboptimal results. As such,

it is helpful to handle such raw data by using a direct ten-

sor representation, and several researches have been made

in the literatures [11, 20].

Moreover, L

-norm and L

-norm can characterize spe-

ciﬁc Laplace and Gaussian distributions, respectively, but

the real noise is generally not of a particular kind of noise

conﬁgurations, as already shown in [42]. Mixture of Gaus-

sians (MoG) is capable to commonly approximate wider

range of distributions due to its universal approximation ca-

pability, and Laplacian and Gaussian are regarded as a spe-

cial case of MoG [3]. It has been demonstrated that MoG

2017 IEEE International Conference on Computer Vision

DOI 10.1109/ICCV.2017.537

5029

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38552536

粉丝: 6
资源: 918

基于CP分解与MoG的张量鲁棒主成分分析：克服噪声与结构限制

RPCA分解MATLAB

TRPCA-t-Gamma

图像矩阵matlab代码-IRTPCAcode:Matlab代码用于论文``通过低秩核心矩阵改进的鲁棒张量主成分分析''

通过张量分解进行鲁棒的人脸聚类

An Efficient Algorithm for Tensor Principal Component Analysis via Proximal Linearized Alternating Direction Method of Multipliers

【构建鲁棒性压缩模型】：分析与改进压缩后模型的鲁棒性策略

OpenCV轮廓识别轮廓分析与理解：图像语义理解

时间序列分析迁移学习：成功策略与技巧揭秘

AI驱动的监控数据分析：安防领域的深度学习革命

【生物统计学中的mboost应用】：案例分析与实践技巧

最新资源