联合矩阵图形Lasso：多稀疏矩阵高斯图形模型学习

146 浏览量更新于2024-07-15 收藏 2.39MB PDF 举报

"本文主要探讨了联合学习多个稀疏矩阵高斯图形模型的方法，并提出了一种称为联合矩阵图形套索（Joint Matrix Graphical Lasso）的新型算法。该方法旨在发现不同条件下矩阵变量中行与列之间的条件独立性结构。通过借用不同图形模型之间的强度，该方法基于最大似然性原则并施加了惩罚行和列精度矩阵的约束。与传统的联合矢量图形模型相比，此模型具有更高的简洁性和灵活性，并且在一致性和稀疏性方面有坚实的渐近性质。模拟实验和真实数据集分析证实了该方法在识别图形结构和估计精度矩阵方面的优越性能。" 在高斯图形模型中，矩阵正常分布被广泛用于处理具有结构信息的多维数据。在这种模型中，每个节点代表矩阵的一行或一列，边的存在与否表示对应行或列之间的条件独立性。当矩阵稀疏时，这些模型可以帮助我们理解和提取数据的主要结构，特别是在大型高维数据集中。联合学习多个稀疏矩阵高斯图形模型是一种有效的方法，可以捕捉不同条件或环境下的复杂依赖关系。通过联合学习，可以发现共有的模式和特定条件下的差异，这对于数据分析和预测任务非常有用。联合矩阵图形套索引入了新的优化策略，它同时考虑了行和列的精度矩阵，这不仅增强了模型的表达能力，也提高了学习的效率。文章中提到的“一致性”（Consistency）是指在样本数量趋于无穷大时，模型参数的估计值能收敛到真值的特性。而“稀疏性”（Sparsistency）则是指在保证模型性能的同时，能够得到尽可能稀疏的解，即尽可能少的非零元素。这种特性对于理解和解释模型至关重要，因为过于复杂的模型可能难以解释和应用。在模型的渐近分析中，作者指出联合矩阵图形套索相比于联合矢量图形模型拥有更快的收敛速度。这意味着在相同数据规模下，新方法能够更快地达到稳定状态，从而提高学习效率和准确性。通过大量的仿真实验，作者验证了所提方法在识别图形结构上的优势，能够准确地发现条件独立性关系。同时，精度矩阵的估计也表现出优秀的性能，这对于进一步的统计推断和预测任务非常重要。此外，实际数据集的分析进一步证明了联合矩阵图形套索在处理真实世界问题时的有效性。这篇文章提出了一种新颖的联合学习框架，适用于稀疏矩阵高斯图形模型，其在理论和实践上都显示出了强大的性能。这种方法不仅有助于深入理解复杂数据结构，还为数据分析和机器学习领域提供了新的工具。

2608 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 26, NO. 11, NOVEMBER 2015

Danaher et al. [25] proposed the fused graphical

Lasso (FGL) by applying the fused Lasso penalty [30]

P(

,...,

) =



k<k





i=j





− 





to (1), which encourages K precision matrices to have identical

element values. In addition, they also propose the group graph-

ical Lasso (GGL) by applying the group Lasso penalty [31]

P(

,...,

) =



i=j









k=1







to (1), which encourages K precision matrices to have a

common pattern of sparsity.

B. Node-Based Joint Graphical Lasso

First, we review the RCON given in [26].

Deﬁnition 1: The RCON induced by a matrix norm ·is

deﬁned as

(

,

,...,

) = min

,...,V



⎡

⎢

⎣

⎤

⎥

⎦



s.t. 

= V

+ (V

)

for n = 1, 2,...,N.

Indeed, (·) is a norm for all matrix norms ·. Thus,

it is convex. In this paper, we consider only a particular class

of RCON, where · is an 

/

norm, given by

V =



j=1

V



, with V =[V

, V

,...,V

] and

1 ≤ r ≤∞. In the following, (·) is denoted by 

(·).

Based on RCON, the perturbed-node joint graphical

Lasso (PNJGL) [26] is proposed to detect the perturbed nodes

among multiple networks using the structure penalty

P(

,...,

) =



k<k



(

− 



When r = 2orr =∞, PNJGL encourages the precision

matrices {

}

k=1

to be a union of a few rows and the

corresponding columns, which can be interpreted as a set of

perturbed nodes across K networks.

Moreover, the common hub (cohub) node joint graphical

Lasso (CNJGL) [26] is proposed to detect the cohub nodes

among multiple networks using the structure penalty

P(

,...,

) = 

(

− diag(

),...,

− diag(

)).

CNJGL encourages the supports of {

}

k=1

to be the same,

and the union of a set of rows and columns among K precision

matrices, which can be interpreted as a set of common hub

nodes among K networks.

The perturbed nodes have a completely different connectivity pattern to

other nodes in the multiple networks [26].

The cohub nodes serve as hubs in each of the multiple networks [26].

III. JOINT MATRIX GRAPHICAL MODELS

In this section, we propose the joint matrix graphical

Lasso for learning multiple MGGMs sharing the same matrix

variable under distinct conditions. According to different appli-

cations, we propose the edge-based and the node-based joint

matrix graphical Lasso, respectively.

A. Problem Formulation

Suppose that a random matrix Y ∈ R

p×q

follows the matrix

normal distribution MN

p,q

(M,,)whose density function

is deﬁned in terms of

P(Y |M,,) =

(2π)

pq/2

||

q/2

||

p/2

·exp



−

tr((Y − M)



−1

(Y − M)

−1

)



where M ∈ R

p×q

is mean matrix;  ∈ R

p×p

and  ∈ R

q×q

are row and column covariance matrices, respectively. The row

precision matrix 

−1

encodes the conditional independence

among rows in the matrix variable, while the column precision

matrix 

−1

encodes that among columns.

Further suppose that Y

∈ R

p×q

(i = 1, 2,...,n

) are sam-

pled i.i.d. from MN

p,q

p×q

,

,

) for k = 1, 2,...,K

with K ≥ 2. Here n

is the number of samples in the kth class,

and the features are shared among K classes. For convenience,

let A

= (

)

−1

and B

= (

)

−1

(k = 1, 2,...,K ),and

further let {A}={A

,...,A

} and {B}={B

,...,B

Then the negative log likelihood for the data takes the

form of

L({A}, {B}) =



k=1





l=1









−

log|A

|−

log|B



Meanwhile, we can also consider the weighted negative

log likelihood. Clearly, minimizing L({A}, {B}) leads to the

maximum likelihood estimates (MLEs). However, the MLEs

are usually dense. The 

-regularization has been employed to

induce sparsity, resulting in sparse precision estimation.

In this paper, we propose the joint matrix graphical Lasso

for estimating multiple MGGMs by minimizing the penalized

negative log likelihood

min

∈S

}

k=1



L({A}, {B}) + λ



k=1

A



+ρ



k=1

B



+λ

({A}) + ρ

({B})



(2)

where λ

, λ

, ρ

,andρ

are nonnegative tuning parameters.

Here, P

({A}) and P

({B}) are convex structure penalty

functions, which aim at preserving the common structures in

the row and the column precision matrices, respectively. When

= I

or B

= I

(k = 1, 2,...,K ), our model reduces to

剩余14页未读，继续阅读

weixin_38621897

粉丝: 6

联合矩阵图形Lasso：多稀疏矩阵高斯图形模型学习

稀疏频谱高斯过程

稀疏矩阵的相加

联合稀疏惩罚的多重高斯图形估计

LDPC编译码MATLAB程序，可以直接运行程序，校验矩阵按照基础的G提出的原理生成的，用了高斯变换的到[I P]矩阵，译码是置信译码算法

GSGBN软件包：学习稀疏高斯贝叶斯网络结构

高斯-赛德尔法实现矩阵逆的算法类

有限元模型中3D刚度矩阵的计算方法

C++实现的统计分析库：涵盖LiNGAM模型与矩阵运算

【稀疏矩阵转置揭秘】：探索高效转置算法，释放计算潜能

【稀疏矩阵操作手册】：掌握核心原理与高级技巧，提升数据处理效率

最新资源