多超图正则化低秩矩阵分解

166 浏览量更新于2024-08-28 收藏 495KB PDF 举报

"这篇论文提出了一种新的低秩矩阵分解方法——MultiHMMF，它将多重超图谱正则化引入到低秩矩阵分解中，以利用数据样本间的高阶信息，通过构建超图来建模内在流形的局部结构。具体来说，通过单独构建多个超图正则化项来考虑局部不变性，从而找到最优的内在流形。这种方法旨在提高矩阵分解的性能和准确性，特别适用于处理复杂数据结构和模式识别问题。" 在当前的计算机科学领域，矩阵分解是一种广泛使用的工具，尤其在数据挖掘、推荐系统、图像处理等多个领域。低秩矩阵分解的基本思想是将一个大而复杂的矩阵分解为两个或更多较小的矩阵的乘积，以揭示隐藏的结构和模式。在本研究中，低秩矩阵分解与超图理论相结合，引入了新的层次和复杂性。超图是图论的一个扩展，传统图由节点和边组成，而超图允许边连接三个或更多的节点，从而能更有效地捕获数据中的高阶关系。在数据建模中，超图可以用来表示数据样本之间的复杂关联，这在处理具有非线性结构的数据时特别有用。例如，在社交网络中，用户之间的关系可能不仅仅是二元的“朋友”关系，还可能包括共同的兴趣、活动等多元联系，这些都可以用超图来表示。 MultiHMMF方法通过引入多个超图正则化项，进一步增强了模型的表达能力。每个超图正则化项关注数据局部的一致性和不变性，这有助于保持分解后的矩阵在局部区域的结构一致性。交替优化算法被用于求解这个优化问题，该算法通过迭代更新各个矩阵分量，直到达到收敛条件，以找到最佳的分解结果。在实际应用中，MultiHMMF可以应用于各种场景，如协同过滤推荐系统中，通过捕捉用户和项目之间的复杂关系来提供更精确的推荐；在图像分析中，可以发现和分类图像中的物体和模式；在社区检测中，可以识别社交网络中的紧密群体。此外，由于其对高阶信息的敏感性，MultiHMMF在处理异常检测、模式识别、信号处理等任务中也可能有显著效果。 MultiHMMF是低秩矩阵分解的一个重要进展，它通过引入超图正则化来增强模型的表达能力和解释力，对于理解和解析复杂数据集具有重要意义。通过交替优化策略，该方法能够有效地处理和利用高阶信息，有望在未来的数据驱动应用中发挥重要作用。

variant of NMF), which requires the obtained basis vectors to be

close to the original data, thereby guaranteeing the real sparse

representation of the data. Their method solves the problem that

for the ℓ

and ℓ

1=2

constraints, the new representations of some

data samples are still highly dense, although, on the average, the

new representation is very sparse. In addition, label constraints

[16] are added to the NMF framework to introduce the label

information from the training data sample. From the aspect of the

cost functions that quantify the quality of the approximation, the

Earth Mover's distance error between the data and the matrix

product [17] is proposed to more effectively quantify the error in

image or histogram matching. New tools, such as multiple kernel

learning, are also applied to NMF [18], which leads to better

performance.

Among the regularizations, the manifold regularization, which

considers the geometry of data space, has been recently incorpo-

rated into the original NMF framework resulting in a novel low-

rank matrix factorization [19], named GNMF, where a p-nearest

neighbor simple graph is constructed to encode the geometrical

information. This modiﬁed NMF framework outperforms the

original NMF method. The minimization problem is formulated as

min

B;F

jjX BFjj

þαTrðFLF

s:t: BZ 0; FZ 0: ð5Þ

An alternate iterati ve algorithm, with the following updating rules,

’B

ððXF

=ðBFF

Þ, F

’F

ððB

Xþ αFSÞ

=ðB

BFþ αFDÞ

Þ,is

applied in each iteration to solve the optimization problem (5).

Howev er, the performance of Eq. (5) is sensitiv e to the construction

of the graph. The performance differs for various graphs with different

weighting schemes and parameter selections. Due to the inﬁnite

number of graphs embodied within the intrinsic manifold, an exhaus-

tive search for the optimal graph for a special task (such as matrix

factorization) is very time consuming, expensive, difﬁcult, ev en

impossible to se lect by hand, and easily prone to o verﬁt the training

set. T o address this problem, a multiple graph regularized NMF [12 ] is

proposed, where graph selection and negative matrix factorization are

jointly completed. T ests on the datasets demonstrat e that it outper-

forms the GNMF method.

As for the TSVD framework, many improvements have been

applied to face recognition [20– 22], document classiﬁcation [23],

etc. However, the improvements made to the TSVD framework are

meager compared with those made to the NMF on the data

analysis ﬁeld. Recently, Zhang and Zhao [13] proposed a novel,

low-rank matrix factorization method (named MMF) that differs

from the GNMF method in that their graph regularization is

incorporated into the original TSVD framework. Their optimization

problem is formulated as

min

B;F

jjX BFjj

þαTrðFΦF

s:t: B

B ¼ I

; ð6Þ

where Φ is a symmetric and positive deﬁnite matrix which

characterizes the data manifold. The most widely used matrix is

the graph Laplacian matrix. A direct algorithm and alternate

iterative algorithm are proposed for solving this minimization

problem. Testing demonstrates that the MMF method is superior

to GNMF, which adds the manifold regularization term to the

original NMF framework.

2.2. Hypergraph learning

Different from the simple graph, where the edge connects two

vertices, the hyperedge in the Hypergraph connects more than

two vertices and indicates a high-order relationship among data

samples. Suppose that Hypergraph G ¼ðV; E; WÞ is composed of

the vertex set V, the hyperedge set E, where each hyperedge e is a

subset of the vertex set V. Denote the weight corresponding to the

hyperedge e,aswðeÞ. If we use the incidence matrix H, to denote

Hypergraph G, then Hðv; eÞ¼1ifvA e, and Hðv; eÞ¼0, otherwise.

Based on matrix H, the vertex degree of each vertex is deﬁned

as dðvÞ¼Σ

e A E

wðeÞHðv; eÞ. The edge degree of a hyperedge e is

deﬁned as δðeÞ¼∑

e A E

Hðv; eÞ. The weight matrix W, where

¼ ∑

e A E

∑

ði;jÞ A e

ðwðeÞ=δðeÞÞ is the weight between any two ver-

tices in the Hypergraph. If we use D

, D

and W

to denote

diagonal matrices of vertex degrees, hyperedge degrees, and the

edge weight, respectively, then an un-normalized Hypergraph

Laplacian matrix is formulated as L

Hyper

¼ D

S, where S ¼

 1

Hypergraph learning has already been applied to many tasks

such as classiﬁcation, clustering, retrieval and embedding. For

instance, Agarwal et al. [24] used Hypergraph for clustering by

using a clique average to transform a Hypergraph into a simple

graph. Zass and Shashua [25] adopted the Hypergraph for image

matching by using convex optimization. Hypergraph has been

applied to the problems of multi-label learning [26] and video

segmentation [27]. Yu et al. [28] proposed a novel Hypergraph-

based semi-supervised learning method for image classiﬁcation, in

which the weights of hyperedges were adaptively coordinated.

Huang et al. [29] formulated the image clustering task as a

problem of Hypergraph partition. In [30], a Hypergraph-based

image retrieval approach is proposed. This approach builds a

Hypergraph by generating a hyperedge from each sample and its

neighbors; Hypergraph-based ranking is then performed. Gao

et al. [31] integrated Hypergraph learning into the framework of

sparse coding. The Hypergraph manifold enhanced the geometric

structure of the sparse codes. The aim of this paper is to integrate

the Hypergraph manifold into low-rank factorization.

3. The proposed method

3.1. Hypergraph regularization term

Local invariance assumption considers that close data samples

are similarly embedded. That is, if, in the intrinsic manifold of data

distribution, the feature vectors of data samples are close, then the

corresponding coding vectors with respect to the new base are

also close and vice versa. Due to the high order information caught

by the Hypergraph, the regularization term to measure the

smoothness of low-dimensional coding vectors in F can be

formulated as

HGr

ðFÞ¼

F



∑

e A E

∑

ði;jÞ A e

wðeÞ

δðeÞ

F



¼ TrðFL

Hyper

FÞ; ð7Þ

where F ¼½F

 and F

is the ith coding vector, W

¼ ∑

e A E;

∑

ði;jÞ A e

ðwðeÞ=δðeÞÞ is the weight between two vertices in Hypergraph.

By minimizing (7), we obtain the desired coding vectors

corresponding to the data points within the same hyperedge;

the coding vectors are close to each other if the data points are

close and, consequently, the locality information among these data

points within the same hyperedge is preserved. We emphasize

here that Hypergraph regularization is different from the graph

regularization framework [12] in that the graph regularization

framework guarantees only the smoothness of low-dimensional

data representation by pair-wise relationships between two data

samples.

T. Jin et al. / Pattern Recognition 48 (2015) 1011–1022 1013

剩余11页未读，继续阅读

weixin_38501826

粉丝: 9
资源: 893

多超图正则化低秩矩阵分解

"非负矩阵分解在聚类中的高效应用研究：算法与发展

非负矩阵分解在基因表达数据分析中的应用与改进算法研究

基于小波核WKNMF的高光谱图像RVM分类提升方法

low-rank-factorization

Non-negative Matrix Factorization with sparseness constraints

Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent

Low-Rank_Matrix_Factorization

Multi-view non-negative matrix factorization by patch alignment framework with view consistency

MahNMF Manhattan Non-negative Matrix Factorization

Algorithms for Non-negative Matrix Factorization

最新资源