利用侧信息提升远程监督的神经关系抽取性能

需积分: 10 143 浏览量更新于2024-09-08 收藏 1.42MB PDF 举报

"这篇论文提出了一种名为RESIDE的新方法，用于改进基于远程监督的神经关系提取。通过利用知识库中的额外侧信息，如实体类型和关系别名，该方法能够在预测关系时施加软约束，从而提高关系提取的准确性。" 在自然语言处理（NLP）领域，关系抽取（Relation Extraction，简称RE）是一项关键任务，其目标是从非结构化的文本中识别并提取实体之间的关系。远程监督（Distant Supervision）是一种常用的技术，它能自动将知识库（KB）中的关系实例与文本对齐，以此训练关系提取器，无需人工标注大量数据。然而，这种方法通常忽略了知识库中可能存在的其他相关信息。论文“RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information”关注了这一问题，并提出了一种新的解决方案。RESIDE是一种利用知识库中额外侧信息（如实体类型和关系别名）的远程监督神经关系提取方法。这些侧信息可以显著提升模型对关系的识别能力，尤其是在存在同义关系或者模糊关系表述的情况下。首先，论文提到实体类型信息对于关系抽取的重要性。实体类型可以帮助模型理解实体间的潜在关系模式，例如，公司名称和人名常常与“创立者”关系相关联。通过利用这种类型信息，模型可以在预测过程中引入类型一致性，使得预测更加准确。其次，关系别名是另一个被忽视但至关重要的资源。知识库中同一关系可能有多种表达方式（例如，“founded”和“co-founded”都可表示“founderOfCompany”）。在RESIDE中，模型会考虑这些关系别名，通过软约束来指导关系预测，使得模型能够适应不同的语境和表达形式。为了有效地融合这些侧信息，论文中提到了RESIDE采用图卷积网络（Graph Convolutional Networks, GCNs），这是一种能够处理复杂网络结构的深度学习技术。GCNs允许模型在实体和关系的图结构上进行信息传播和聚合，从而捕捉到实体和它们关系的上下文信息。此外，论文可能会详细探讨实验部分，展示RESIDE在各种基准数据集上的性能提升，与其他现有方法进行对比，证明其有效性和优势。同时，可能还会讨论模型的泛化能力、参数优化策略以及未来的研究方向。这篇论文的贡献在于提供了一种新的方法，利用知识库的丰富信息来增强远程监督的关系抽取模型，提高了关系识别的准确性和鲁棒性，对于NLP领域的关系抽取研究具有重要价值。

RESIDE: Improving Distantly-Supervised Neural Relation Extraction

using Side Information

Shikhar Vashishth

Rishabh Joshi

2 ∗

Sai Suman Prayaga

Chiranjib Bhattacharyya

Partha Talukdar

Indian Institute of Science

Birla Institute of Technology and Science, Pilani

{shikhar,chiru,ppt}@iisc.ac.in

f2014102@pilani.bits-pilani.ac.in, suman.sai14@gmail.com

Abstract

Distantly-supervised Relation Extraction (RE)

methods train an extractor by automatically

aligning relation instances in a Knowledge

Base (KB) with unstructured text. In addi-

tion to relation instances, KBs often contain

other relevant side information, such as aliases

of relations (e.g., founded and co-founded are

aliases for the relation founderOfCompany).

RE models usually ignore such readily avail-

able side information. In this paper, we pro-

pose RESIDE, a distantly-supervised neural

relation extraction method which utilizes ad-

ditional side information from KBs for im-

proved relation extraction. It uses entity type

and relation alias information for imposing

soft constraints while predicting relations. RE-

SIDE employs Graph Convolution Networks

(GCN) to encode syntactic information from

text and improves performance even when

limited side information is available. Through

extensive experiments on benchmark datasets,

we demonstrate RESIDE’s effectiveness. We

have made RESIDE’s source code available to

encourage reproducible research.

1 Introduction

The construction of large-scale Knowledge Bases

(KBs) like Freebase (Bollacker et al., 2008) and

Wikidata (Vrande

c and Kr

otzsch, 2014) has

proven to be useful in many natural language pro-

cessing (NLP) tasks like question-answering, web

search, etc. However, these KBs are not exhaus-

tive. Relation Extraction (RE) attempts to ﬁll this

gap by extracting semantic relationships between

entity pairs from plain text. This task can be mod-

eled as a simple classiﬁcation problem after the

entity pairs are speciﬁed. Formally, given an en-

tity pair (e

) from the KB and an entity anno-

tated sentence (or instance), we aim to predict the

∗

This research was conducted during the author’s intern-

ship at Indian Institute of Science.

relation r, from a predeﬁned relation set, that ex-

ists between e

and e

. If no relation exists, we

simply label it NA.

Most supervised relation extraction methods re-

quire large labeled training data which is expen-

sive to construct. Distant Supervision (DS) (Mintz

et al., 2009) helps with the construction of this

dataset automatically, under the assumption that

if two entities have a relationship in a KB, then

all sentences mentioning those entities express the

same relation. While this approach works well in

generating large amounts of training instances, the

DS assumption does not hold in all cases. Riedel

et al. (2010); Hoffmann et al. (2011); Surdeanu

et al. (2012) propose multi-instance based learn-

ing to relax this assumption. However, they use

NLP tools to extract features, which can be noisy.

Recently, neural models have demonstrated

promising performance on RE. Zeng et al. (2014,

2015) employ Convolutional Neural Networks

(CNN) to learn representations of instances. For

alleviating noise in distant supervised datasets, at-

tention has been utilized by (Lin et al., 2016; Jat

et al., 2018). Syntactic information from depen-

dency parses has been used by (Mintz et al., 2009;

He et al., 2018) for capturing long-range depen-

dencies between tokens. Recently proposed Graph

Convolution Networks (GCN) (Defferrard et al.,

2016) have been effectively employed for en-

coding this information (Marcheggiani and Titov,

2017; Bastings et al., 2017). However, all the

above models rely only on the noisy instances

from distant supervision for RE.

Relevant side information can be effective for

improving RE. For instance, in the sentence, Mi-

crosoft was started by Bill Gates., the type infor-

mation of Bill Gates (person) and Microsoft (or-

ganization) can be helpful in predicting the cor-

rect relation founderOfCompany. This is because

every relation constrains the type of its target en-

arXiv:1812.04361v1 [cs.CL] 11 Dec 2018

下载后可阅读完整内容，剩余9页未读，立即下载

chengsl_2010

粉丝: 0
资源: 13

利用侧信息提升远程监督的神经关系抽取性能

"多模式CAD信息智能提取算法培训及实践

SIGIR2018 & WWW2018：知识图谱研究关键亮点

"基于MATLAB的扩频通信系统仿真

Improving-Deep-Neural-Networks-Hyperparameter-tuning-Regularization-and-Optimization:我从不断完善的深度神经网络进行编程作业的解决方案

Improving-FRT-Capability-of-DFIG-Based-WT-Using-S_dfig fault rid

Improving-Healthcare-Using-NLP---COMP-5360-Final

Clone Detection in Secure Messaging- Improving Post-Compromise

Secure layers for improving security-开源

Clone Detection in Secure Messaging- Improving Post-Compromise S

Improving Resampling-based Ensemble in Churn Prediction

最新资源