高效张量主成分分析：线性化交替方向乘子法

177 浏览量更新于2024-08-29 收藏 194KB PDF 举报

"本文提出了一种基于近似线性化交替方向乘子法的高效张量主成分分析算法" 本文是一篇研究论文，专注于解决张量主成分分析（Tensor Principal Component Analysis，简称TPCA）的问题。张量PCA是数据挖掘和机器学习领域中的一个重要课题，特别是在多维数据处理和模式识别中。当张量的阶数为偶数时，该问题可以归约到矩阵的形式，通常被理论化为低秩矩阵补全问题。在张量PCA问题中，目标是找到一个能够最大化方差的低秩张量表示。传统的处理方式是将张量转化为矩阵，并利用矩阵的核范数作为秩的代理。核范数被视为在一定条件下的秩操作的最紧致凸下界，因此在许多基于核范数最小化的算法中被广泛使用，如矩阵恢复和低秩矩阵分解。然而，尽管核范数在理论上有效，但实际计算中可能会遇到效率和稳定性的问题。为了解决这些挑战，作者提出了一个高效的算法——基于近似线性化的交替方向乘子法（Proximal Linearized Alternating Direction Method of Multipliers，PL-ADMM）。该算法通过对原问题进行线性化，以及结合 proximal 运算来改进优化过程，旨在提高计算效率并保持解的准确性。 PL-ADMM算法的关键在于其交替优化步骤，它分别对每个变量进行优化，然后通过乘子更新来协调各个变量的优化结果，从而达到全局最优。通过线性化处理，该算法减少了每次迭代的复杂性，使其更适应大规模高阶张量的数据集。此外，引入proximal项有助于处理非平滑优化问题，增强算法的稳定性和收敛性。论文中，作者详细讨论了算法的实现细节，包括步长选择、停止准则以及如何处理张量的结构信息。他们还提供了算法的收敛性分析，并通过一系列实验验证了该方法在不同场景下，特别是在图像处理、多模态数据挖掘和社交网络分析等领域的优越性能。相比于现有的张量PCA方法，PL-ADMM在计算速度和解决方案的质量上都有显著提升。这篇研究为高效处理高阶张量数据提供了一个强大的工具，不仅有助于理论上的研究，也对实际应用有重要价值，尤其是在大数据背景下需要快速且准确地进行张量分析的场景。

An Efﬁcient Algorithm

for Tensor Principal Component Analysis

via Proximal Linearized Alternating Direction

Method of Multipliers

Linbo Qiao

1,2

, Bofeng Zhang

, Lei Zhuang

, Jinshu Su

1,2

College of Computer, National University of Defense Technology ChangSha 410073, China

National Laboratory for Parallel and Distributed Processing, National University of Defense Technology ChangSha 410073, China

Department of Statistics, Shanghai University of Finance and Economics,Shanghai 201800 China

{qiao.linbo,bfzhang,sjs}@nudt.edu.cn,

shirleyshufe@gmail.com

Abstract—In paper, we focus on the computation of the

principal components for a general tensor, known as the ten-

sor principal component analysis (PCA) problem. It has been

proven that the general tensor PCA problem is reducible to

its matricization form when the order is even. Usually it is

considered as low-rank matrix completion problems theoretically.

It is common to consider nuclear norm as a surrogate of the rank

operator since it is the tightest convex lower bound of the rank

operator under certain condition. However, most nuclear norm

minimization based approaches involve numbers of singular value

decomposition (SVD) operations. Given a matrix X ∈ R

m×n

the time complexity of SVD operation is O(mn

), which brings

prohibitive computational burden to apply these methods in real

applications. However, the problem is non-convex, and the prox-

imal mapping associated with non-convex regularization is not

easy to compute. It is always solved by the Linearized Alternating

Direction Method of Multipliers (LADMM). Despite the success

of LADMM in practice, it remains unknown if LADMM is

convergent in solving such non-convex compositely regularized

optimization. In this paper, we ﬁrstly present a detailed conver-

gence analysis of the LADMM algorithm for solving non-convex

compositely regularized optimization with a large class of non-

convex penalties. Furthermore, we propose a new efﬁcient and

scalable algorithm for matrix principal component analysis called

Proximal Linearized Alternating Direction Method of Multipliers

for Principal Component Analysis(PLADMPCA). Different from

traditional matrix factorization methods, PLADMPCA utilizes

the linearization technique to formulate the matrix as an outer

product of vectors, which greatly improves the computational

efﬁcacy compared to matrix factorization method. We empirically

evaluate the proposed algorithm PLADMPCA on synthetic tensor

data with different order. Results have shown that PLADMPCA

have much better computational cost to matrix factorization

based method. At the same time, it outperforms the state-of-art

SVD based matrix completion algorithms by similar or better

reconstruction accuracy with enormous advantages on efﬁciency.

Index Terms—Tensor; Principal Component Analysis; Proxi-

mal Linearized Alternating Direction Method of Multipliers

I. INTRODUCTION

A tensor is a multidimensional array. More formally, an N-

way or Nth-order tensor is an element in N vector tensor prod-

uct spaces, and each of which has its own coordinate system.

For example, a ﬁrst-order tensor is a vector, a second-order

tensor is a matrix, and tensors with three or higher-order are

called higher-order tensors. There are two classes of methods

focus on tensor decomposition: the “CP” decomposition[5],

[9] and the Tucker decomposition[26]. Canonical decomposi-

tion(CANDECOMP) is proposed by Carroll and Chang [5] and

parallel factors(PARAFAC) by Harshman [9]. And the “CP”

is the abbreviation of CANDECOMP and PARAFAC.

Principal component analysis (PCA) ﬁnds a few linear

combinations of the original variables. It is a powerful tool

to compress data along the direction of maximum variance

with minimum information loss. The PCA plays an important

role in dimension reduction and data analysis related research

areas. Speciﬁcally, let ε =(ε

, ··· ,ε

) be an m-dimensional

random vector, then for a given data matrix M ∈ R

m×n

which

consists of n samples and each sample of the m variables,

ﬁnding the principal components which explain the largest

variance of the variables (ε

, ··· ,ε

) corresponds to the

following optimization problem:

(λ

∗

):=min

λ,x,y

M − λx ⊗ y,

(1)

where λ ∈ R,x∈ R

,y ∈ R

, and the symbol ⊗ denotes the

outer product of two vectors.

Although the PCA and eigenvalue problem for the matrix

has been well studied in the literature, there is still little

work has been done on the research of PCA for tensors.

Nevertheless, the tensor PCA is of great importance in practice

and has many applications.Similar to its matrix form, the

problem of ﬁnding the PC is related to the most variance of

a tensor F, which can be speciﬁcally formulated as:

min

,··· ,x

F − λx

⊗ x

⊗···⊗x



s.t. λ ∈ R, x

 =1,i =1, 2, ··· ,m.

(2)

⊗F

)

···i

=(F

)

···i

)

m+1

···i

m+l

2016 International Conference on Advanced Cloud and Big Data

DOI 10.1109/CBD.2016.14

283

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38696339

粉丝: 4
资源: 908

高效张量主成分分析：线性化交替方向乘子法

An adaptive algorithm for mel-cepstral analysis of speech

Cluster-GCN An Efficient Algorithm for Training Deep and Large Graph...ppt pdf

FAST EFFICIENT ALGORITHM FOR ENHANCEMENT OF LOW LIGHTING VIDEO

alternating direction descent algorithm

Spatial Sampling Points Selection Algorithm Based Principal Component

利用范式改进ADMM实现图像结构层纹理层分解 matlab详细完整代码

请帮我编写一段ADMM结构的代码

sortingefficiencys

引用Distributed adaptive coverage control algorithm for mobile sensor networks

最新资源