使用双向循环神经网络提升中文语义角色标注效果

197 浏览量更新于2024-08-26 收藏 316KB PDF 举报

"这篇研究论文探讨了如何使用双向递归神经网络（Bi-RNN）与长短期记忆（LSTM）模型来改进中文语义角色标注（SRL）的方法。传统方法在处理中文SRL时严重依赖特征工程，并且难以捕捉句子中的长距离依赖关系。论文提出的新方法旨在通过Bi-RNN-LSTM结构来有效地捕捉句子中的双向和长距离依赖，同时减少对特征工程的依赖。实验证明，在中文命题银行（CPB）数据集上，该方法相较于现有最佳方法有显著的性能提升。" 本文是关于自然语言处理领域的一篇研究论文，重点关注的是中文语义角色标注任务的优化。语义角色标注是自然语言处理中的一个重要任务，它的目标是识别出句子中的动词及其相关的论元，如主语、宾语等，并对这些论元的角色进行分类，以便更好地理解句子的意义。传统的中文SRL方法通常需要大量的特征工程工作，包括手工设计各种句法和语义特征，这既费时又容易受到人为因素的影响。此外，这些方法往往难以处理句子中的长距离依赖，即一个词的意义可能取决于远离它的其他词。这种问题在处理复杂句子结构时尤为突出。为了克服这些问题，该论文引入了双向递归神经网络，尤其是配备了长短期记忆单元的Bi-RNN-LSTM模型。RNN是一种能够处理序列数据的深度学习模型，它能够记住之前的信息，但原始RNN在处理长序列时可能会遇到“梯度消失”或“梯度爆炸”的问题。LSTM则通过门控机制解决了这个问题，能更好地捕获长距离依赖。而双向RNN则同时考虑了词序的前向和后向信息，增强了模型对上下文的理解能力。论文在中文命题银行数据集上进行了实验，结果显示，Bi-RNN-LSTM模型在SRL任务上的表现显著优于现有的最佳方法，这表明这种方法在处理中文语义角色标注时具有更高的准确性和效率，减少了对人工特征工程的依赖，而且能够有效处理复杂的句子结构和长距离依赖关系。这对于提升自然语言处理系统在理解和解析中文文本时的性能有着重要的意义。

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1626–1631,

Lisbon, Portugal, 17-21 September 2015.

2015 Association for Computational Linguistics.

Chinese Semantic Role Labeling with Bidirectional Recurrent Neural

Networks

Zhen Wang, Tingsong Jiang, Baobao Chang, Zhifang Sui

Key Laboratory of Computational Linguistics, Ministry of Education

School of Electronics Engineering and Computer Science, Peking University

Collaborative Innovation Center for Language Ability, Xuzhou 221009 China

wzpkuer@gmail.com, {tingsong, chbb, szf}@pku.edu.cn

Abstract

Traditional approaches to Chinese Seman-

tic Role Labeling (SRL) almost heavily re-

ly on feature engineering. Even worse,

the long-range dependencies in a sentence

can hardly be modeled by these method-

s. In this paper, we introduce bidirection-

al recurrent neural network (RNN) with

long-short-term memory (LSTM) to cap-

ture bidirectional and long-range depen-

dencies in a sentence with minimal fea-

ture engineering. Experimental results on

Chinese Proposition Bank (CPB) show a

signiﬁcant improvement over the state-of-

the-art methods. Moreover, our model

makes it convenient to introduce hetero-

geneous resource, which makes a further

improvement on our experimental perfor-

mance.

1 Introduction

Semantic Role Labeling (SRL) is deﬁned as the

task to recognize arguments for a given predicate

and assign semantic role labels to them. Because

of its ability to encode semantic information, there

has been an increasing interest in SRL on many

languages (Gildea and Jurafsky, 2002; Sun and Ju-

rafsky, 2004). Figure 1 shows an example in Chi-

nese Proposition Bank (CPB) (Xue and Palmer,

2003), which is a Chinese corpus annotated with

semantic role labels.

Traditional approaches to Chinese SRL often

extract a large number of handcrafted features

from the sentence, even its parse tree, and feed

these features to statistical classiﬁers such as CRF,

MaxEnt and SVM (Sun and Jurafsky, 2004; Xue,

2008; Ding and Chang, 2008; Ding and Chang,

2009; Sun, 2010). However, these methods suf-

fer from three major problems. Firstly, their per-

formances are heavily dependent on feature engi-

Figure 1: A sentence with semantic roles labeled

from CPB.

neering, which needs domain knowledge and la-

borious work of feature extraction and selection.

Secondly, although sophisticated features are de-

signed, the long-range dependencies in a sentence

can hardly be modeled. Thirdly, a speciﬁc anno-

tated dataset is often limited in its scalability, but

the existence of heterogenous resource, which has

very different semantic role labels and annotation

schema but related latent semantic meaning, can

alleviate this problem. However, traditional meth-

ods cannot relate distinct annotation schemas and

introduce heterogeneous resource with ease.

Concerning these problems, in this paper, we

propose bidirectional recurrent neural network

(RNN) with long-short-term memory (LSTM) to

solve the problem of Chinese SRL. Our approach

makes the following contributions:

• We formulate Chinese SRL with bidirection-

al LSTM RNN model. With bidirectional

RNN, the dependencies in a sentence from

both directions can be captured, and with L-

STM architecture, long-range dependencies

can be well modeled. The test results on the

bechmark dataset CPB show a signiﬁcant im-

provement over the state-of-the-art methods.

• Compared with previous work that relied on

a huge number of handcrafted features, our

model can achieve much better performance

only with minimal feature engineering.

• The framework of our model makes the intro-

duction of heterogeneous resource efﬁcient

1626

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38625184

粉丝: 4

使用双向循环神经网络提升中文语义角色标注效果

基于改进深度注意神经网络的语义角色标注.pdf

基于递归神经网络的文本分类研究_黄磊1

基于双向门限递归单元神经网络的维吾尔语形态切分

3维卷积递归神经网络的高光谱图像分类方法.pdf

点云语义分割新方法：上下文融合三维递归神经网络

递归神经网络的深度研究与技术创新

基于BERT特征的双向LSTM神经网络在中文电子病历输入推荐中的应用研究

递归神经网络：长短期记忆网络（LSTM）

递归神经网络（RNN）在自然语言处理中的应用

递归神经网络(RNN)：揭秘语言模型背后的革命力量

最新资源