利用上下文增强常识问答的知识图谱融合

需积分: 1 110 浏览量更新于2024-08-04 收藏 233KB PDF 举报

本文档《Fusing Context Into Knowledge Graph for Commonsense Question》发表在2021年ACL-IJCNLP会议上，由Yichong Xu、Chenguang Zhu等人代表微软认知服务研究团队撰写。研究关注于解决常识性问题回答中的挑战，即如何更好地将语言模型与知识图谱（KG）融合，以便更精确地理解和应用知识。在传统的常识性问题回答方法中，语言模型通常与知识图谱相结合，利用其丰富的结构信息来辅助解答。然而，知识图谱的一个局限是缺乏上下文，这在没有足够标注数据的情况下，可能导致理解上的偏差。为了弥补这一差距，作者提出了一种新的策略，即引入外部实体描述来提供额外的上下文信息。他们从维基百科（Wiktionary）等资源中获取相关概念的描述，并将其作为预训练阶段的输入，以增强模型对知识的理解。具体来说，这项工作旨在通过以下步骤改善知识融合： 1. **概念描述收集**：从维基词典等权威来源获取实体的定义和相关背景信息，这些描述提供了关于概念含义和用法的上下文。 2. **数据增强**：将这些描述作为额外输入，融入到知识图谱表示学习过程中，帮助模型理解实体之间的语境关系，增强对实体含义的理解。 3. **模型训练**：在训练语言模型时，通过结合结构化的知识图谱和文本描述，提升模型在处理常识性问题时的准确性和深度理解。 4. **效果评估**：通过实验验证，这种方法能够有效地提高模型在面对缺乏明确标注数据的情况下，对常识问题的回答能力，从而缩小了语言模型与知识图谱之间的理解鸿沟。 5. **潜在应用**：研究成果对于构建更智能的对话系统、问答系统以及AI助手具有重要意义，尤其是在需要推理和理解复杂情境的场景中。这篇论文提出了一种创新的方法，旨在通过结合外部上下文信息，提升知识图谱在常识性问题回答中的表现，为跨模态知识融合和自然语言处理领域的研究提供了有价值的新视角。

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 1201–1207

1201

Fusing Context Into Knowledge Graph for Commonsense Question

Answering

Yichong Xu

∗

, Chenguang Zhu

∗

, Ruochen Xu, Yang Liu, Michael Zeng, Xuedong Huang

Microsoft Cognitive Services Research Group

{yicxu,chezhu,ruox,yaliu10,nzeng,xdh}@microsoft.com

Abstract

Commonsense question answering (QA) re-

quires a model to grasp commonsense and

factual knowledge to answer questions about

world events. Many prior methods couple lan-

guage modeling with knowledge graphs (KG).

However, although a KG contains rich struc-

tural information, it lacks the context to pro-

vide a more precise understanding of the con-

cepts. This creates a gap when fusing knowl-

edge graphs into language modeling, espe-

cially when there is insufﬁcient labeled data.

Thus, we propose to employ external entity

descriptions to provide contextual information

for knowledge understanding. We retrieve de-

scriptions of related concepts from Wiktionary

and feed them as additional input to pre-

trained language models. The resulting model

achieves state-of-the-art result in the Common-

senseQA dataset and the best result among

non-generative models in OpenBookQA.

1 Introduction

One critical aspect of human intelligence is the abil-

ity to reason over everyday matters based on obser-

vation and knowledge. This capability is usually

shared by most people as a foundation for commu-

nication and interaction with the world. Therefore,

commonsense reasoning has emerged as an impor-

tant task in natural language understanding, with

various datasets and models proposed in this area

(Ma et al., 2019; Talmor et al., 2018; Wang et al.,

2020; Lv et al., 2020).

While massive pre-trained models (Devlin et al.,

2018; Liu et al., 2019) are effective in language

understanding, they lack modules to explicitly han-

dle knowledge and commonsense. Also, structured

data like knowledge graph is much more efﬁcient

in representing commonsense compared with un-

structured text. Therefore, there have been multiple

∗

Equal contribution

methods coupling language models with various

forms of knowledge graphs (KG) for commonsense

reasoning, including knowledge bases (Sap et al.,

2019; Yu et al., 2020b), relational paths (Lin et al.,

2019), graph relation network (Feng et al., 2020)

and heterogeneous graph (Lv et al., 2020). These

methods combine the merits of language modeling

and structural knowledge information and improve

the performance of commonsense reasoning and

question answering.

However, there is still a non-negligible gap be-

tween the performance of these models and hu-

mans. One reason is that, although a KG can en-

code topological information between the concepts,

it lacks rich context information. For instance, for

a graph node for the entity “Mona Lisa”, the graph

depicts its relations to multiple other entities. But

given this neighborhood information, it is still hard

to infer that it is a painting. On the other hand, we

can retrieve the precise deﬁnition of “Mona Lisa”

from external sources, e.g. the deﬁnition of Mona

Lisa in Wiktionary is “A painting by Leonardo da

Vinci, widely considered as the most famous paint-

ing in history”. To represent structured data that

can be seamlessly integrated into language models,

we need to provide a panoramic view of each con-

cept in the knowledge graph, including its neigh-

boring concepts, relations to them, and a deﬁnitive

description of it.

Thus, we propose the DEKCOR model, i.e. DE-

scriptive Knowledge for COmmonsense question

answeRing, to tackle multiple choice common-

sense questions. Given a question and a choice,

we ﬁrst extract the contained concepts. Then, we

extract the edge between the question concept and

the choice concept in ConceptNet (Speer et al.,

2017). If such an edge does not exist, we compute

a relevance score for each knowledge triple (sub-

ject, relation, object) containing the choice concept,

and select the one with the highest score. Next, we

下载后可阅读完整内容，剩余6页未读，立即下载

IT徐师兄

粉丝: 2258
资源: 2689

利用上下文增强常识问答的知识图谱融合

Fusing image representations for classification using support.pdf

Fusing Multiple Deep Features for Face Anti-spoofing.pdf单篇论文

藏经阁-FUSING APACHE SPARK AND LUCENE.pdf

Failure Modes and Fusing of TVS Devices.pdf

二极管资料.pdf

Constructing probabilistic graphical model from predicate formulas for fusing logical and probabilistic knowledge

电子-2W10.pdf

ASME Ⅸ-2019.pdf

workcentre_5945 Service Manual.pdf

APTIV 域控制器白皮书.pdf

最新资源