使用对抗学习改进远程监督关系抽取

88 浏览量更新于2024-07-15 收藏 635KB PDF 举报

"Adversarial learning for distant supervised relation extraction" 这篇研究论文主要探讨了对抗性学习在远程监督关系抽取（Distant Supervised Relation Extraction, DSRE）中的应用。DSRE是自然语言处理领域的一个重要任务，目标是从大规模无标注文本中自动抽取出实体之间的关系。由于标注数据的获取成本高昂，远程监督方法通过映射已知的关系到知识库中的实体对，以此来生成大量的训练数据，但这种方法往往引入了大量的噪声。传统的方法通常采用基于神经网络的模型，结合softmax分类器和交叉熵损失函数进行学习。然而，这种策略会将人工类别NA（表示无关系或未定义的关系）的噪声引入到分类过程中，从而影响模型的性能。为了解决这个问题，论文提出采用排名损失函数（Ranking Loss）替代传统的交叉熵损失。排名损失通过比较正确关系与错误关系的得分，来提高模型区分正负样本的能力。但是，随机选择或者根据错误关系的最高得分来生成负样本的方式可能会导致生成的负样本过于简单，对模型训练的贡献有限。受到生成对抗网络（Generative Adversarial Networks, GANs）的启发，论文作者设计了一种新的方法，用神经网络作为负类生成器来协助训练过程。这个生成器的目标是生成足够逼真的负样本，以欺骗主模型，使其难以区分这些负样本和真实的正样本。通过这种方式，模型被迫学习更精细的特征，以区分高仿真度的负样本，从而提高对真实关系的识别能力。在实验部分，论文对比了提出的对抗性学习方法与其他基线方法的性能，可能包括基于softmax的模型、基于排名损失的模型等。结果表明，通过引入对抗性学习，模型在处理远程监督数据集上的性能得到了显著提升，尤其是在处理噪声较大的数据时，其抗干扰能力和关系识别准确性都有所增强。这篇论文为解决DSRE中的噪声问题提供了一个创新的解决方案，通过引入对抗性学习机制，增强了模型在处理大规模无标注文本中提取关系的能力。这种方法不仅对DSRE领域有重要贡献，也为其他需要处理大量噪声数据的机器学习任务提供了新的思路。

Zeng et al. [Zeng, Liu, Chen et al. (2015)] incorporate multi-instance learning with neural

network model, which can build relation extractor based on distant supervision data.

Although the method achieves significant improvement in relation extraction, it only

selects the most likely sentence for each entity pair in their multi-instance learning

paradigm. To address this issue, Lin et al. [Lin, Shen, Liu et al. (2016)] propose sentence

level attention over multiple instances in order to utilize all informative sentences. Jiang

et al. [Jiang, Wang, Li et al. (2016)] employ cross-sentence max-pooling to select

features across different instances and then aggregates the most significant features for

each entity pair.

The aforementioned works, especially neural networks, have greatly promoted the

development of relation extraction. However, these works do not pay attention to the

noise of artificial class NA, which are unfortunately very common in DSRE. Zeng et al.

[Zeng, Zeng and Dai (2017)] proposed ranking loss and cost-sensitive to address the

noise of NA. They select the highest score among all incorrect relations as the negative

label. This approach is not ideal, because the quality of the selected label is often poor. In

this paper, we propose a novel pair-wise ranking loss whose negative samples are

provided by a generator of the GAN.

2.2 Generative adversarial networks

GANs [Goodfellow, Pouget-Abadie, Mirza et al. (2014)] was originally proposed for

generating samples in a continuous space such as images. A GAN consists of two parts,

the generator, and the discriminator. The generator accepts a noise input and outputs an

image. The discriminator is a classifier which classifies images as “true” (from the

ground truth set) or “fake” (generated by the generator). When training a GAN, the

generator and the discriminator play a minimax game, in which the generator tries to

generate “real” images to deceive the discriminator, and the discriminator tries to tell

them apart from ground truth images. GANs are also capable of generating samples

satisfying certain requirements, such as conditional GAN [Mirza and Osindero (2014)]. It

is not possible to use GANs in its original form for generating discrete samples like

natural language sentences or knowledge graph triples, because the discrete sampling step

prevents gradients from propagating back to the generator. SeqGAN [Yu, Zhang, Wang

et al. (2017)] is one of the first successful solutions to this problem by using

reinforcement learning, which trains the generator using policy gradient. Likewise, our

framework relies on policy gradient to train the generator which provides discrete

negative labels.

3 Task definition

DSRE is usually considered as a multi-instance learning problem. In multi-instance

learning paradigm, all sentences labeled by a relation triplet constitute a bag and each

sentence is called an instance.

Suppose that there are

bags

{ , , , }

B B B

in the training set and that the

-th bag

contains

instances

{ , , , }( 1, , )

i i i

B b b b i N

.The objective of multi-instance

剩余15页未读，继续阅读

weixin_38731553

粉丝: 4
资源: 899

使用对抗学习改进远程监督关系抽取

Adversarial Learning for Weakly-Supervised Social Network Alignment

Doubly Semi-supervised Multimodal Adversarial Learning for Classification, Generation and Retrieval

Pytorch实战4：(win10 +ubuntu)对抗语义分割源码调试《Adversarial Learning for Semi-supervised Semantic Segmentation》-附件资源

什么是Domain-Adversarial Learning

adversarial learning

Graph Convolutional Adversarial Networks for Spatiotemporal Anomaly Detection

Towards Deep Learning Models Resistant to Adversarial Attacks

Feature Representation Learning for Unsupervised Cross-domain Image Retrieval

# Adversarial Supervise Architecture E_Hat = self.generator_aux(Z) H_hat = self.supervisor(E_Hat) Y_fake = self.discriminator(H_hat) self.adversarial_supervised = Model(inputs=Z, outputs=Y_fake, name='AdversarialSupervised')

介绍Fast-ganfit: Generative adversarial network for high fidelity 3d face reconstruction的内容

最新资源