基于图推理模型的多关系抽取方法

需积分: 10 134 浏览量更新于2024-08-28 收藏 662KB PDF 举报

"这篇文章主要探讨了基于图推理模型的多关系抽取方法，旨在利用大规模语料库中的语言知识和分类知识来提升信息提取的效率和准确性。作者Heyan Huang、Ming Lei和Chong Feng等人提出了关系知识图（Relationship Knowledge Graph, RKG），通过构建语料子图和句子子图来分别挖掘无标注数据中的语言知识和训练数据中的分类知识，以解决关系抽取任务中的挑战。" 在自然语言处理（NLP）领域，关系抽取是信息提取的一个关键任务，它涉及识别文本中实体之间的关系。传统的基于规则或统计的方法往往受限于特定领域或关系类型的定义，难以应对复杂和多样化的语义关系。近年来，神经网络在关系抽取中表现出强大的能力，但如何有效利用大规模语料库中的潜在知识仍然是一个挑战。文章指出，语言知识对于各种NLP任务至关重要，但其表示和应用颇具难度。他们提出了一种新的方法，即从无标注的大量语料中构建一个语料子图，用于挖掘潜在的语言知识。这种子图可以捕捉到语料中的模式和上下文信息，有助于理解实体之间的复杂关联。同时，为了获取与实体和关系类型定义相关的分类知识，文章采用了句子子图。这些子图基于标注的训练数据，能够捕获特定实体和关系出现的特征，从而帮助进行关系分类。将这两部分知识结合，作者构建了一个关系知识图。在RKG上，实体识别可以视为属性值填充问题，而关系抽取则可以通过在图上进行推理来解决。这种图推理模型允许模型在考虑上下文和已知关系的同时，发现新的关系模式。文章进一步讨论了模型的实现细节，包括图的构建、节点和边的表示以及推理算法。模型的训练和评估可能涉及交叉验证、损失函数的设计以及性能指标如精确率、召回率和F1分数。这项工作提供了一种新颖的图基方法，通过整合无标注数据和标注数据中的知识，提高了多关系抽取的性能。这种方法不仅对关系抽取有直接的应用价值，也为其他依赖语料库知识的NLP任务提供了新的思路。

Graph-based reasoning model for multiple relation extraction

Heyan Huang

, Ming Lei

⇑

, Chong Feng

5 South Zhongguancun Street, Haidian District, Beijing, China

article info

Article history:

Received 7 November 2019

Revised 19 June 2020

Accepted 8 September 2020

Available online 28 September 2020

Communicated by Wu Jia

Keywords:

Relation extraction

Information extraction

Neural networks

Natural language processing

abstract

Linguistic knowledge is useful for various NLP tasks, but the difﬁcul ty lies in the representation and appli-

cation. We consider that linguistic knowledge is implied in a large-scale corpus, while classiﬁcation

knowledge, the knowledge related to the deﬁnitions of entity and relation types, is implied in the labeled

training data. Therefore, a corpus subgraph is proposed to mine more linguistic knowledge from the

easily accessible unlabeled data, and sentence subgraphs are used to acquire classiﬁcation knowledge.

They jointly constitute a relation knowledge graph (RKG) to extract relations from sentences in this

paper. On RKG, entity recognition can be regarded as a property value ﬁlling problem and relation clas-

siﬁcation can be regarded as a link prediction problem. Thus, the multiple relation extraction can be trea-

ted as a reasoning process for knowledge completion. We combine statistical reasoning and neural

network reasoning to segment sentences into entity chunks and non-entity chunks, then propose a novel

Chunk Graph LSTM network to learn the representations of entity chunks and infer the relations among

them. The experiments on two standard datasets demonstrate our model outperforms the previous mod-

els for multiple relation extraction.

1. Introduction

Relation extraction (RE) is a task of assigning appropriate rela-

tion types to the entity pairs from sentences. It is helpful for web

mining, information retrieval, question answering, machine trans-

lation and other natural language processing (NLP) tasks [1,2].In

addition, it is also an essential step for constructing knowledge

bases automatically [3,4]. So, RE is an important research topic in

information extraction. Generally, a triplet, (entity 1, relation type,

entity 2), is used as the format of the structured representation of a

relation. Sometimes a sentence could contain multiple entities and

relation triplets, and an entity may belong to multiple different tri-

plets. Thus there are Cn; 2ðÞcandidate relations to be classiﬁed in a

sentence with nentities in the multiple relation extraction task. As

shown in Fig. 1, 7 entities and 6 relation triplets are labeled in the

example sentence. PER (person), WEA (weapon) and GPE (geo-

graphical/political) are entity types. PHYS (physical), ART (agent-

artifact), ORG-AFF (organization-afﬁliation) and GEN-AFF (general

afﬁliation) are relation types.

We can see that according to the positions of the entity pairs,

the triplets have overlapping, nested, intersected structures and

so on. These complex structures are difﬁcult for the sequence-

based models [5–8] to handle. Recently some effective

graph-based models have been proposed to solve the multiple rela-

tion extraction. The work [9] presented a model for multiple rela-

tions extraction, which linked the newly identiﬁed entities to the

previous ones and used a feature matrix to learn graph structures.

The graph model for n-ary relation extraction [10], which is a spe-

cial case of overlapping relation extraction, partitioned sentences

into two directed acyclic graphs and classiﬁed relations by a Graph

LSTM. The model [11] deﬁned a series of entity and relation tran-

sition actions and treated the extraction task as a dynamic gener-

ative process of a graph. The work [12] ﬁrst adopted a bi-RNN

and a GCN (graph convolutional network) to extract both sequen-

tial and regional dependency word features and then applied a

relation-weighted GCN to extract implicit features among all word

pairs.

These graph-based models have achieved great success in mul-

tiple relation extraction. However, they mainly exploit the labeled

training data to learn the classiﬁcation knowledge but neglect the

easily accessible unlabeled corpus. They leverage the semantic

information from word embeddings produced by generative pre-

training on unlabeled corpus [13–16] or gain linguistic knowledge

from auxiliary tool kits, such as dependency parser, POS, NER tools

and so forth. We argue that a large-scale corpus contains abundant

linguistic knowledge. So a corpus subgraph is proposed to mine

task-related linguistic knowledge from the unlabeled corpus. It is

combined with sentence subgraphs to constitute a relation knowl-

edge graph (RKG) for multiple relation extraction. On RKG, entity

https://doi.org/10.1016/j.neucom.2020.09.025

⇑

Corresponding author.

E-mail address: 66529158@qq.com (M. Lei).

Neurocomputing 420 (2021) 162–170

Contents lists available at ScienceDirect

Neurocomputing

journal homepage: www.elsevier.com/locate/neucom

下载后可阅读完整内容，剩余8页未读，立即下载

SKSZ233

粉丝: 13
资源: 2

基于图推理模型的多关系抽取方法

Graph-Based Semi-Supervised Learning

A BERT-based Interaction Model For Knowledge Graph Alignment.rar

CIKM2019-graph-for-recommendation.pdf

基于超像素的Graph_Based图像分割算法.rar_Graph-Based 分割_graph-based_图像分割_基于超像

人工智能和机器学习之关联规则学习算法：Graph-Based Association：图神经网络基础.pdf

人工智能和机器学习之关联规则学习算法：Graph-Based Association：图嵌入技术详解.pdf

Graph-Based Representation and Reasoning

Low-complexity factor-graph-based MAP detector for f ilter bank multicarrier systems

boost-graph-library [Unlocked].pdf

AS-23-Wu-When-Knowledge-Graph-Meets-TTPs-wp.pdf

最新资源