堆叠学习提升内隐语篇关系识别性能

157 浏览量更新于2024-08-29 收藏 462KB PDF 举报

内隐语篇关系识别的堆叠学习是一项旨在提升自动检测两个相邻论据（Arg）之间内在关联任务的先进技术。在现有的系统中，研究人员已经取得了显著成就，包括构建出高效的分类模型、稳健的特征选择方法以及丰富的训练数据，这些都表明了将多个系统在统一框架内协作的可行性。本文的核心贡献是提出了一种基于堆叠学习的协作策略。堆叠学习（Stacked Learning）是一种多层次的机器学习方法，它通过构建多层模型，让底层模型的预测结果作为上层模型的输入，从而提高整体性能。在这个内隐语篇关系识别的场景中，两层学习机制被运用，首先，不同的识别系统各自执行任务，然后，上层模型会利用这些系统的置信度信息来综合判断语篇关系，从而更好地捕捉那些可能被单一系统忽略的复杂关联。作者杨旭、阮惠斌和洪宇的研究关注于PDTB（Penn Discourse Treebank）v2.0数据集，这是当前评价语篇关系识别系统性能的标准基准。PDTBv2.0包含了明确和隐含两种类型的语篇关系，这对评估算法的准确性和全面性提出了高要求。实验结果显示，堆叠学习方法能够显著提升系统的性能，特别是在处理那些依赖于多种特征和上下文信息的隐性语篇关系时，其优势更为明显。这项工作不仅强调了个体系统的优势互补，还展示了如何通过堆叠学习技术优化模型间的协同，从而在内隐语篇关系识别任务中实现更精确和全面的结果。这种方法具有广泛的应用前景，对于自然语言处理领域的语义理解和文本挖掘都有重要的推动作用。

Stacked Learning for Implicit Discourse

Relation Recognition

Yang Xu, Huibin Ruan, and Yu Hong

(

)

Natural Language Processing Lab, School of Computer Science and Technology,

Soochow University, Suzhou 215006, China

andreaxu41@gmail.com, huibinnguyen@gmail.com, tianxianer@gmail.com

Abstract. The existing discourse relation recognition systems have dis-

tinctive advantages, such as superior classiﬁcation models, reliable feature

selection, or holding rich training data. This shows the feasibility of mak-

ing the systems collaborate with each other within a uniform framework.

In this paper, we propose a stacked learning based collaborative approach.

By the two-level learning, it facilitates the application of the conﬁdence of

diﬀerent systems for the discourse relation determination. Experiments on

PDTB show that our method yields promising improvement.

1 Introduction

Discourse relation recognition aims to automatically classify the discourse rela-

tions between two adjacent arguments (abbr., Arg). In the Penn Discourse Tree-

bank 2.0 corpus (PDTB v2.0) [16], discourse relation falls into explicit and

implicit cases. See the examples as below:

(1) Shorter maturities are considered a sign of rising rates (Arg

[Because] portfolio managers can capture higher rates sooner (Arg

(2) The woman has “psychic burns” on her back from the confrontation

(Arg

) [?] She declines to show them (Arg

Example (1) shows an explicit Causality relation signaled directly by the dis-

course connective “Because”. Example (2) shows an implicit Comparison rela-

tion. In the example, there isn’t any explicit connective between the arguments,

though we can imagine a possible connective as “but”. In this paper, we focus

on studying on the implicit discourse relation recognition.

Great eﬀort has been put into the exploration of eﬀective linguistic and struc-

tural features for relation recognition, such as polarity, verbs, inquirer tags, brown

cluster pairs, word pairs, etc. [10,13–15,20,24]. Meanwhile, the validity of dif-

ferent classiﬁcation models has been evaluated [6]. Hong et al. [4] and Wang

et al. [20] mine high-quality comparable samples to enrich the reference data

for estimating the relations. Li and Nenkova [8] propose a novel feature rep-

resentation method to fulﬁll multidimensional aggregate of sparse data. Ji and

Eisenstein et al. [5] propose an solution to recognize implicit discourse relation

 Springer International Publishing AG 2017

J. Wen et al. (Eds.): CCIR 2017, LNCS 10390, pp. 161–169, 2017.

https://doi.org/10.1007/978-3-319-68699-8

_13

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38558659

粉丝: 6
资源: 917

堆叠学习提升内隐语篇关系识别性能

浅卷积神经网络的内隐语篇关系识别

双语约束综合数据用于内隐语篇关系识别

隐语框架配合TEE实现数据加密

什么是隐语义模型的协同过滤推荐，什么是ALS算法

请查阅推荐算法资料，描述一个互联网大厂的推荐算法（抖音、京东、小红书、美团、快手等） 简单描述即可！

请总结一下经典的推荐算法及模型有哪些，以及他们对应的内容，优点及缺点

当前有哪些流行的广告推荐算法？

给我推荐20个比较流行的推荐算法模型

卷积神经 推荐算法 数据movie

美食推荐系统应该用什么推荐算法

最新资源

请查阅推荐算法资料，描述一个互联网大厂的推荐算法（抖音、京东、小红书、美团、快手等）简单描述即可！

卷积神经推荐算法数据movie