图神经网络自监督学习：一份综合综述

图神经网络

需积分: 50 138 浏览量更新于2024-07-15 收藏 1.47MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"图神经网络自监督学习的综述" 图神经网络（Graph Neural Networks, GNN）已经成为处理非结构化数据，特别是复杂网络数据的重要工具。然而，传统的监督学习方法通常依赖于大量标记数据，这在许多实际场景中是难以获取的。自监督学习（Self-Supervised Learning, SSL）则提供了一种有效利用未标记数据的新方法，它在自然语言处理和图像识别等领域已经展现出强大的潜力。这篇综述论文《图神经网络自监督学习：一个统一的回顾》由Yaochen Xie、Zhao Xu、Zhengyang Wang和Shuiwang Ji撰写，全面探讨了如何将自监督学习应用于GNNs，以解决标记数据不足的问题。文章中，作者将自监督学习方法分为对比学习（Contrastive Models）和预测模型（Predictive Models）两大类，并为每一类构建了一个统一的框架，详细阐述了这些方法在不同组件下的差异。在对比学习类别中，作者分析了如何通过构建图的表示并对比不同的表示形式来学习节点或图的内在特征。这种方法通常涉及生成正样本和负样本，然后优化模型以区分它们。而在预测模型类别中，研究焦点在于设计预训练任务，如预测图中的缺失部分、恢复边或者预测节点的上下文信息。通过对现有方法的统一分析，该综述揭示了各种自监督学习方法之间的相似性和差异性，为未来的研究提供了基础。论文还总结了不同的自监督学习设置，以及在每种设置中使用的对应数据集，这对于研究人员复现实验和开发新算法非常有帮助。自监督学习在图神经网络上的应用不仅扩展了深度学习在图数据上的能力，而且有助于克服数据标注的瓶颈，使得模型在大规模无标签数据上也能进行有效的学习。随着对这一领域的深入理解和方法的不断优化，可以预见，图神经网络在社交网络分析、生物信息学、化学分子结构分析等多个领域将有更广泛的应用。

资源详情

资源推荐

Downstream

task

Encoder

Fine-tune

Label

Prediction

head

Encoder

Pre-train

SSL task

Parameter

Downstream

task

Representation

Encoder

Prediction

head

Unsupervised pre-training

Unsupervised representation learning

Auxiliary learning

Encoder SSL task

Auxiliary

SSL task

Main task

Label

( )

Label

Fig. 3. Paradigms for self-supervised learning. Top:in unsupervised rep-

resentation learning, graphs only are used to train the encoder through

the self-supervised task. The learned representations are ﬁxed and

used in downstream tasks such as linear classiﬁcation and clustering.

Middle: unsupervised pre-training trains the encoder with unlabeled

graphs by the self-supervised task. The pre-trained encoder’s parame-

ters are then used as the initialization of the encoder used in supervised

ﬁne-tuning for downstream tasks. Bottom: in auxiliary learning, an aux-

iliary task with self-supervision is included to help learn the supervised

main task. The encoder is trained through both the main task and the

auxiliary task simultaneously.

training graphs, contrastive learning aims to learn one or

more encoders such that representations of similar graph

instances agree with each other, and that representations

of dissimilar graph instances disagree with each other. We

unify existing approaches to constructing contrastive learn-

ing tasks into a general framework that learns to discrimi-

nate jointly sampled view pairs (e.g. two views belonging to

the same instance) from independently sampled view pairs

(e.g. views belonging to different instances). In particular,

we obtain multiple views from each graph in the training

dataset by applying different transformations. Two views

generated from the same instance are usually considered

as a positive pair and two views generated from different

instances are considered as a negative pair. The agreement

is usually measured by metrics related to the mutual infor-

mation between two representations.

One major difference among graph contrastive learning

methods lies in (a) the objective for discrimination task

given view representations. In addition, due to the unique

data structure of graphs, graph contrastive learning meth-

ods also differ in (b) approaches that views are obtained,

and (c) graph encoders that compute the representations of

views. A graph contrastive learning method can be deter-

mined by specifying its components (a)–(c). In this section,

we summarize graph contrastive learning methods in a

uniﬁed framework and then introduce (a)–(c) individually

used in existing studies.

3.1 Overview of Contrastive Learning Framework

In general, key components that specify a contrastive learn-

ing framework include transformations that compute mul-

tiple views from each given graph, encoders that compute

the representation for each view, and the learning objective

to optimize parameters in encoders. An overview of the

framework is shown in Figure 4. Concretely, given a graph

(A, X) as a random variable distributed from P, multiple

transformations T

, · · · , T

are applied to obtain different

views w

, · · · , w

of the graph. Then, a set of encoding net-

works f

, · · · , f

take corresponding views as their inputs

and output the representations h

, · · · , h

of the graph from

each views. Formally, we have

= T

(A, X), (6)

= f

), i = 1, · · · , k. (7)

We assume w

= (

) = T

(A, X) in this survey since

existing contrastive methods consider their views as graphs.

However, note that not all views w

are necessarily graphs or

sub-graphs in a general sense. In addition, certain encoders

can be identical to each other or share their weights.

During training, the contrastive objective aims to train

encoders to maximize the agreement between view rep-

resentations computed from the same graph instance. The

agreement is usually measured by the mutual information

I(h

, h

) between a pair of representations h

and h

. We

formalize the contrastive objective as

max

}

i=1

i6=j





i6=j

I(h

, h

)





, (8)

where σ

∈ {0, 1}, and σ

= 1 if the mutual information

is computed between h

and h

, and σ

= 0 otherwise,

and h

are considered as two random variables

belonging to either a joint distribution or the product of two

marginals. To enable efﬁcient computation of the mutual

information, certain estimators

I of the the mutual informa-

tion are usually used instead as the learning objective. Note

that some contrastive methods apply projection heads [9, 42]

to the representations. For the sake of uniformity, we con-

sider such projection heads as parts of the computation in

the mutual information estimation.

During inference, one can either use a single trained

encoder to compute the representation or a combination of

multiple view representations such as the linear combina-

tion or the concatenation as the ﬁnal representation of a

given graph. Three examples of using encoders in different

ways during inference are illustrated in Figure 5.

3.2 Contrastive Objectives

3.2.1 Mutual Information Estimation

Given a pair of random variables (x, y), the mutual infor-

mation I(x, y) measures the information that x and y share,

given by

I(x, y) = D

(p(x, y)||p(x)p(y)) (9)

= E

p(x,y)



log

p(x, y)

p(x)p(y)



, (10)

剩余16页未读，继续阅读

syp_net

粉丝: 158
资源: 1187

图神经网络自监督学习：一份综合综述

图神经网络无监督学习

最新《图神经网络》综述论文

SUITESPARSE库

libcxparse官网

怎么下载cxsparse库

windows 安装metis_Eigen+suitesparse for windows 安装

cannot find -lcxsparse

c++如何用suitesparse解稀疏矩阵方程组

Awesome_mixins-0.4-py2-none-any.whl.zip

小契约（交友互动小程序源码）.zip

服装图像检索-基于深度特征+基于内容的服装图像检索算法-附项目源码-优质项目实战.zip

2024-2030中国大肠杆菌在线分析仪市场现状研究分析与发展前景预测报告 Sample zxk.pdf

avatar_utils-1.0.1-py3-none-any.whl.zip

毕业设计基于Spring Cloud微服务架构的AI生成式网站的设计与实现

Axelrod-2.2.0-py2.py3-none-any.whl.zip

智能优化算法-海洋捕食者算法（MPA）（附源码）

和鲸社区Kesci 水下目标检测算法赛（光学图像赛项）三等奖 单模方案.zip

半导体集成电路 模拟集成电路设计与仿真 何乐年

libqt5sql5-psql-5.15.13+dfsg-1ubuntu1-arm64.deb

Avatar_Utils-1.8.8-py3-none-any.whl.zip

最新资源

和鲸社区Kesci 水下目标检测算法赛（光学图像赛项）三等奖单模方案.zip

半导体集成电路模拟集成电路设计与仿真何乐年