自然语言理解中的关系推理：一种新型文本基准与神经消息传递基线

需积分: 10 179 浏览量更新于2024-09-03 收藏 486KB PDF 举报

"Compositional Language Understanding with Text-based Relational Reasoning" 是一个研究论文，主要关注自然语言处理（NLP）领域中的关系推理和组合泛化能力。该论文指出，尽管神经网络在基于事实的问题回答和常识推理方面取得了一定进展，但其在关系推理和从自然语言中进行组合泛化的能力却往往被注解偏见和语言建模的主导地位所掩盖。为了解决这个问题，作者们提出了一种新的基准数据集，专门用于测试和评估模型在关系推理上的性能。在传统的自然语言理解任务中，如提取式问题回答，模型通常侧重于从文本中抽取已知事实来回答问题。然而，这并不足以衡量模型是否能理解和运用语言中的关系信息进行推理。论文中提到的关系推理是指模型需要理解实体之间的关系，例如因果、空间或时间关系，并基于这些关系进行逻辑推断。这种能力对于实现真正的人工智能至关重要，因为人类能够利用语言中的关系信息进行复杂的思考。为了推动这一领域的研究，作者构建了一个新的数据集，旨在孤立地评估模型在关系推理上的表现。这个数据集可能包含精心设计的任务，这些任务需要模型超越简单的事实提取，去理解并应用文本中的关系信息来解决新问题。此外，他们还提出了一个神经消息传递的基线模型，这种模型通过引入关系归纳偏置（relational inductive bias），在组合泛化方面表现出优于传统循环神经网络（RNN）的表现。关系归纳偏置意味着模型在学习过程中倾向于识别和利用实体之间的结构关系。消息传递机制允许模型在不同实体之间交换信息，以模拟关系推理过程。这种机制使得模型能够处理未见过的关系模式，从而提高了其在新情境下的泛化能力。通过比较传统方法与神经消息传递模型在新数据集上的表现，研究者能够量化模型在关系推理上的进步，这有助于未来模型的设计和优化。这篇论文强调了关系推理在自然语言理解中的重要性，并通过创建新的评估工具和模型架构，为提升模型的组合泛化能力提供了方向。这对于开发更加智能、更能理解语言深层含义的AI系统具有深远的影响。

Compositional Language Understanding with

Text-based Relational Reasoning

Koustuv Sinha

∗

Mila

McGill University, Canada

koustuv.sinha@mail.mcgill.ca

Shagun Sodhani

Mila

Université de Montréal, Canada

sshagunsodhani@gmail.com

William L. Hamilton

Mila

McGill University, Canada

Facebook AI Research (FAIR), Montreal

will.leif.hamilton@gmail.com

Joelle Pineau

Mila

McGill University, Canada

Facebook AI Research (FAIR), Montreal

jpineau@cs.mcgill.ca

Abstract

Neural networks for natural language reasoning have largely focused on extractive,

fact-based question-answering (QA) and common-sense inference. However, it is

also crucial to understand the extent to which neural networks can perform rela-

tional reasoning and combinatorial generalization from natural language—abilities

that are often obscured by annotation artifacts and the dominance of language

modeling in standard QA benchmarks. In this work, we present a novel bench-

mark dataset for language understanding that isolates performance on relational

reasoning. We also present a neural message-passing baseline and show that this

model, which incorporates a relational inductive bias, is superior at combinatorial

generalization compared to a traditional recurrent neural network approach.

1 Introduction

Neural language understanding systems have been extremely successful at information extraction

tasks, such as question answering (QA). An array of existing datasets are available, which test

a system’s ability to extract factual answers text [

–

], as well as datasets that emphasize simple,

commonsense inference (e.g., entailment between sentences) [

]. However, it is difﬁcult to evaluate

a model’s reasoning ability in isolation using existing datasets. Most datasets combine several chal-

lenges of language processing into one, such as co-reference / entity resolution, incorporating world

knowledge, and semantic parsing. Moreover, the state-of-the-art on all these existing benchmarks

relies heavily on large, pre-trained language models [

], highlighting that the primary difﬁculty in

these datasets is incorporating the statistics of natural language, rather than reasoning.

In this work, we see to directly evaluate and innovate on the compositional reasoning ability of

a QA system. Inspired by CLEVR [

]—a synthetic computer vision dataset that isolates the

challenges of relational reasoning—we propose a text based dataset for Compositional Language

Understanding with Text-based Relational Reasoning (CLUTRR). Our initial version, CLUTTR v0.1,

requires reasoning and generalizing about kinship relationships, and we plan to use our proposed data

generation pipeline to extend the set of tasks in the future. We develop and evaluate strong baselines

on CLUTTR v0.1, including a recurrent LSTM model and a message-passing graph neural network

∗

Work done while being an intern at Samsung Advanced Institute of Technology (SAIT), Montreal

Proceeedings of Relational Representation Learning Workshop, 32nd Conference on Neural Information Pro-

cessing Systems (NIPS 2018), Montréal, Canada.

arXiv:1811.02959v2 [cs.CL] 8 Nov 2018

下载后可阅读完整内容，剩余7页未读，立即下载

hywcxq

粉丝: 0
资源: 33

自然语言理解中的关系推理：一种新型文本基准与神经消息传递基线

specs2-scalacheck_2.11-3.8.5.1-scalaz-7.1.zip

Biblio.Distribution.C++.For.Artists.The.Art.Philosophy.and.Science.of.Object-Oriented.Programming.2003.chm

neural-processes-master.zip

COVARIANT COMPOSITIONAL NETWORKS FOR LEARNING GRAPHS.pdf

Multi-Label Classification with Label Graph Superimposing.pdf

A Weibull-based compositional approach for hierarchical dynamic fault trees

sel4.0.8.pdf

olga中文教学.pdf

学术英语文章体裁.pdf

Liu2020_Chapter3_CompositionalSemantics.pdf

最新资源