Trans-DLR：简单模型超越GAN，提升知识表示学习性能

需积分: 10 184 浏览量更新于2024-09-08 收藏 348KB PDF 举报

本文档《Defeats GAN：A Simpler Model Outperforms in Knowledge Representation Learning》关注于知识图谱（KG）领域的一个重要问题——知识表示学习。知识表示学习的目标是将实体和关系映射到低维度、连续的向量空间中，这在许多实际应用中，如推荐系统、信息检索和问答系统中至关重要。作者们提出了一种名为Trans-DLR的简单而优雅的方法，它在训练过程中引入了动态学习率控制策略，从而显著提升了模型性能。与当前基于生成对抗网络（GAN）的模型相比，Trans-DLR展示了更高的效能。这种方法摒弃了传统上单一的负样本抽取方式，创新地采用了一种新的负采样技巧，即在不同的概率下同时干扰实体和关系，这有助于模型更好地理解复杂的关系结构。这种策略能够促使模型学习到更丰富的知识表示。此外，为了提高模型在链接预测任务中的评估效率，作者开发了一种高效的并行计算方法，充分利用多进程处理技术，使得模型的性能评估过程得到了显著加速。这种方法在大规模知识图谱中尤其有价值，因为大规模数据的处理通常需要高效的计算资源管理。通过一系列实验，作者证明了Trans-DLR方法的有效性和优越性，其在知识图谱的各种基准测试中表现出色，不仅在准确性方面超越了现有的竞争者，而且在模型效率和可扩展性上也有所提升。这对于知识图谱的研究和实际应用具有重要的指导意义，表明在保持模型简洁性的同时，合理的训练策略和优化手段对于提升知识表示学习的效果至关重要。

Defeats GAN: A Simpler Model Outperforms in Knowledge Representation

Learning

Heng Wang

School of Data and Computer Science

Sun Yat-sen University

Guangzhou, China

e-mail: wangh376@mail2.sysu.edu.cn

Mingzhi Mao

School of Data and Computer Science

Sun Yat-sen University

Guangzhou, China

e-mail: mcsmmz@mail.sysu.edu.cn

Abstract—The goal of knowledge representation learning is to

embed entities and relations into a low-dimensional,

continuous vector space. How to push a model to its limit and

obtain better results is of great significance in knowledge

graph's applications. We propose a simple and elegant method,

Trans-DLR, whose main idea is dynamic learning rate control

during training. Our method achieves remarkable

improvement, compared with recent GAN-based method.

Moreover, we introduce a new negative sampling trick which

corrupts not only entities, but also relations, in different

probabilities. We also develop an efficient way, which fully

utilizes multiprocessing and parallel computing, to speed up

evaluation of the model in link prediction tasks. Experiments

show that our method is effective.

Knowledge representation; dynamic learning rate; negative

sampling; multiprocessing; parallel computing

I. INTRODUCTION

Knowledge Graph is a directed graph structure which is

composed of various kinds of entities and their relations in

our world. Typical knowledge graphs include Wordnet [1],

Freebase [2], Yago [3], to name a few. Knowledge graph is

playing a pivotal role in many NLP applications, such as

relation extraction [4], question answering [5], and social

network mining [6].

Facts in a knowledge graph are commonly represented as

triples (head entity, relation, tail entity), abbreviated as (h, r,

t). They are obtained by human labor, rules or distant

supervision [7], which are usually far from complete.

Knowledge graph representation aims to represent entities

and relations as symbols, numbers, or vectors, aiding in

completing missing links and finding new facts for a

knowledge graph. Inspired by [8], a great deal of effort have

been made to embed entities and relations into a low-

dimensional, continuous vector space, such as [9-12], with

different loss functions adopted. Let  denote all the triples

in a knowledge base. A triple

󰇛

  

󰇜

is positive if

󰇛

  

󰇜



, otherwise negative if

󰇛

  

󰇜

 . The basic ideas behind

these models is that the loss of negative triples should be at

least  greater than the loss of positive triples, which is

known as margin loss. Readers can refer to Section II to get

more detailed introduction.

During training, all the models mentioned above suffer

Figure 1. Illustration of local optimum in training. After training for a while,

the model pushes nearly the same number of negative triples out of and into

the margin, resulting in no improvement of performance.

from the problem of local optimum and inability to step

forward in performance (See Fig. 1). How to push a model to

its limit and learn a better representation is of great

significance in knowledge graph's downstream applications.

Recently, [13] proposes a knowledge embedding framework

which utilizes GAN in negative sampling, called Trans-

GAN, to mine the potential of models by generating high-

level negative samples. However, it has several drawbacks.

Firstly, GAN often faces the problem of non-convergence or

collapse in training, leading to a poor result when it happens.

Secondly, GAN consists of generator and discriminator

networks, which needs more parameters.

In this paper, we propose a simpler and more elegant

method whose main idea behind is dynamic learning rate

(DLR). Experiments show that our DLR-based methods

outperforms GAN-based methods remarkably under most

circumstances.

Our contributions in this paper are as follows:

 We incorporate DLR in knowledge representation

learning which can dynamically adjust the learning rate

of a model, pushing the model to a better optimum.

 We propose a new negative sampling method which not

only corrupts entities, but also relations in different

probabilities. So the model can learn better

representation for both entities and relations.

244

下载后可阅读完整内容，剩余4页未读，立即下载

Jayxp

粉丝: 6
资源: 137

Trans-DLR：简单模型超越GAN，提升知识表示学习性能

陕西师范大学-《写作》（专升本）考评作业-含答案.pdf

摩托罗拉XTS3000写频软件

用python写一个王者荣耀

基于大模型技术的算力产业监测服务平台设计

This_honeypot_supports_Telnet_and_SSH_two_protocol_FF-Pot.zip

吉他谱_What I've Done - Linkin Park.pdf

吉他谱_Too sweet - Hozier.pdf

Linux使用的一些笔记，包括shell命令，软件，一些实用的网站的整理_Linux_note.zip

基于ssm的机房预约系统设计与实现.docx

app执行Linux命令_app-Linux-.zip

最新资源