深度多任务学习在方面术语抽取中的应用

需积分: 9 51 浏览量更新于2024-09-08 收藏 139KB PDF 举报

"这篇论文探讨了深度多任务学习在方面术语抽取中的应用，提出了一个基于LSTM的深度多任务学习框架，旨在从用户评论句子中提取方面和观点。该框架利用扩展的记忆和神经记忆操作来处理方面和观点的提取任务，并通过记忆交互实现更精确的预测。此外，还引入了情感句约束以提高预测准确性。实验证实在两个基准数据集上的有效性。" 在自然语言处理领域，方面术语抽取（Aspect Term Extraction，ATE）是情感分析的一个关键子任务，它涉及到从用户评论或产品评价中识别出讨论的主要特征或属性（如“价格”，“服务”等）。这个过程对于理解消费者对产品或服务的态度至关重要。论文“Deep Multi-Task Learning for Aspect Term Extraction with Memory Interaction”提出了一种新的方法，利用深度学习模型，特别是长短期记忆网络（Long Short-Term Memory, LSTM），来解决这一问题。 LSTM是一种特殊的循环神经网络（Recurrent Neural Network, RNN），能够处理序列数据中的长期依赖关系。在本研究中，作者设计了两个具有扩展内存的LSTM，它们能够共同处理方面和观点的提取任务。这两个LSTM通过记忆交互机制进行通信，从而能更好地捕捉句子中的上下文信息和依赖关系。此外，论文还引入了一个情感句约束，即通过另一个LSTM来考虑句子的情感极性。这有助于提高预测的准确性，因为观点通常与情感紧密相关。情感信息可以帮助模型区分正向和负向的观点，从而更准确地定位和分类方面术语。实验部分，研究人员在两个标准的数据集上测试了他们的框架，结果显示，与单任务学习和其他多任务学习方法相比，他们的框架在方面术语抽取任务上表现出了显著的提升。这些结果验证了深度多任务学习和记忆交互在处理复杂NLP任务时的有效性，特别是在涉及情感分析的场景下。这篇论文对深度学习在自然语言处理中的应用提供了新的见解，特别是对于多任务学习和情感分析的结合，它可能对未来的NLP研究和应用产生积极影响，比如改进社交媒体分析、客户反馈处理和产品推荐系统等。

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2886–2892

Copenhagen, Denmark, September 7–11, 2017.

2017 Association for Computational Linguistics

Deep Multi-Task Learning for Aspect Term Extraction

with Memory Interaction

∗

Xin Li and Wai Lam

Key Laboratory on High Conﬁdence Software Technologies (Sub-Lab, CUHK),

Ministry of Education, and

Department of Systems Engineering and Engineering Management

The Chinese University of Hong Kong, Hong Kong

{lixin, wlam}@se.cuhk.edu.hk

Abstract

We propose a novel LSTM-based deep

multi-task learning framework for aspect

term extraction from user review sen-

tences. Two LSTMs equipped with ex-

tended memories and neural memory op-

erations are designed for jointly handling

the extraction tasks of aspects and opin-

ions via memory interactions. Sentimental

sentence constraint is also added for more

accurate prediction via another LSTM.

Experiment results over two benchmark

datasets demonstrate the effectiveness of

our framework.

1 Introduction

The aspect-based sentiment analysis (ABSA) task

is to identify opinions expressed towards speciﬁc

entities such as laptop or attributes of entities such

as price (Liu, 2012a). This task involves three sub-

tasks: Aspect Term Extraction (ATE), Aspect Po-

larity Detection and Aspect Category Detection.

As a fundamental subtask in ABSA, the goal of

the ATE task is to identify opinionated aspect ex-

pressions. One of most important characteristics

is that opinion words can provide indicative clues

for aspect detection since opinion words should

co-occur with aspect words. Most publicly avail-

able datasets contain the gold standard annotations

for opinionated aspects, but the ground truth of

the corresponding opinion words is not commonly

provided. Some works tackling the ATE task ig-

nore the consideration of opinion words and just

focus on aspect term modeling and learning (Jin

∗

The work described in this paper is substantially sup-

ported by a grant from the Research Grant Council of the

Hong Kong Special Administrative Region, China (Project

Code: 14203414). We thank Lidong Bing and Piji Li for their

helpful comments on this draft and the anonymous reviewers

for their valuable feedback.

et al., 2009; Jakob and Gurevych, 2010; Toh and

Wang, 2014; Chernyshevich, 2014; Manek et al.,

2017; San Vicente et al., 2015; Liu et al., 2015;

Poria et al., 2016; Toh and Su, 2016; Yin et al.,

2016). They fail to leverage opinion information

which is supposed to be useful clues.

Some works tackling the ATE task con-

sider opinion information (Hu and Liu, 2004a,b;

Popescu and Etzioni, 2005; Zhuang et al., 2006;

Qiu et al., 2011; Liu et al., 2012b, 2013a,b, 2014)

in an unsupervised or partially supervised manner.

Qiu et al. (2011) proposed Double Propagation

(DP) to collectively extract aspect terms and opin-

ion words based on information propagation over

a dependency graph. One drawback is that it heav-

ily relies on the dependency parser, which is prone

to generate mistakes when applying on informal

online reviews. Liu et al. (2014) modeled relation

between aspects and opinions by constructing a bi-

partite heterogenous graph. It cannot perform well

without a high-quality phrase chunker and POS

tagger reducing its ﬂexibility. As unsupervised or

partially supervised frameworks cannot take the

full advantages of aspect annotations commonly

found in the training data, the above methods lead

to deﬁciency in leveraging the data. Recently,

Wang et al. (2016) considered relation between

opinion words and aspect words in a supervised

model named RNCRF. However, RNCRF tends to

suffer from parsing errors since the structure of the

recursive network hinges on the dependency parse

tree. CMLA (Wang et al., 2017a) used a multi-

layer neural model where each layer consists of

aspect attention and opinion attention. However

CMLA merely employs standard GRU without ex-

tended memories.

We propose MIN (Memory Interaction Net-

work), a novel LSTM-based deep multi-task learn-

ing framework for the ATE task. Two LSTMs

with extended memory are designed for handling

2886

下载后可阅读完整内容，剩余6页未读，立即下载

chengsl_2010

粉丝: 0
资源: 13

深度多任务学习在方面术语抽取中的应用

Learning Word Vectors for Sentiment Analysis

随机单词生成器WordCreatorv19.7.1绿色免费版

aspect-extraction:尝试了几种方面提取方法

A multi-task learning based approach to biomedical entity relation extraction

论文研究-A Novel Traffic Parameters Extraction Method Based on Time-Spatial Images of Multi-Cameras.pdf

Deep Ranking Based Cost-sensitive Multi-label Learning for Distant Supervision Relation Extraction

Gabor-based-feature-extraction-.zip_extraction_gabor_gabor featu

Graph-based reasoning model for multiple relation extraction.pdf

Li 等 - 2021 - Reinforcing neuron extraction and spike inference .pdf

BI-LSTM-CRF-for-Aspect-Extraction

最新资源