大规模多方面情感分析挑战数据集与模型

113 浏览量更新于2024-08-26 收藏 439KB PDF 举报

"这篇研究论文主要探讨了基于方面的情感分析（Aspect-Based Sentiment Analysis, ABSA）的挑战数据集和有效的模型。在2019年计算语言学协会的实证方法自然语言处理和第9届国际联合自然语言处理会议中发表，作者包括Qingnan Jiang、Lei Chen、Ruifeng Xu等人。文章指出当前ABSA数据集中大多数句子仅包含一个方面或具有相同情感极性的多个方面，导致任务简化为句级情感分析。因此，他们提出一个新的大规模多方面情感分析数据集，旨在推动研究进展和模型的复杂性。” 正文: 基于方面的情感分析（ABSA）是自然语言处理领域的一个重要研究方向，近年来因其在产品评论、社交媒体分析等广泛应用而受到广泛关注。传统的情感分析主要关注对整个文本的情感倾向，而ABSA则更深入，它旨在识别和理解文本中特定方面的（如产品的某个特性）情感极性。现有的ABSA数据集通常存在局限性，大部分句子只包含一个方面或者多个方面但情感极性相同。这种情况下，任务实质上变成了判断整个句子的情感，而非针对每个独立方面的分析。为了克服这一限制，论文提出了一个新的大规模多方面情感分析数据集。这个数据集包含更丰富的信息，其中的句子可能涵盖多个不同的方面，每个方面都有其独立的情感极性，这样可以更好地模拟现实世界中的复杂情感分析场景。此外，论文还介绍了一些有效的模型来应对这个挑战。这些模型可能采用了深度学习的方法，如卷积神经网络（CNN）、循环神经网络（RNN）或Transformer架构，以捕获句子内部的结构信息和方面相关的语义特征。同时，结合注意力机制（Attention Mechanism）或图神经网络（GNN），模型能够更好地聚焦于关键的方面信息，并对不同方面进行独立的情感分析。论文进一步讨论了这些模型在新数据集上的性能，并与其他已有的方法进行了比较。实验结果表明，提出的模型在处理多方面情感分析任务时表现出了优越的性能，这为未来的研究提供了有价值的参考和基准。这篇研究论文不仅贡献了一个新的、更为复杂的ABSA数据集，而且通过提出有效的模型，推动了情感分析领域的进步。这些成果对于提高自然语言处理系统的准确性和实用性具有重要意义，尤其是在商业决策、用户反馈分析等领域。

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing

and the 9th International Joint Conference on Natural Language Processing, pages 6280–6285,

Hong Kong, China, November 3–7, 2019.

2019 Association for Computational Linguistics

6280

A Challenge Dataset and Effective Models for Aspect-Based Sentiment

Analysis

Qingnan Jiang

, Lei Chen

, Ruifeng Xu

2,3

, Xiang Ao

, Min Yang

1∗

Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences

Department of Computer Science, Harbin Institute of Technology (Shenzhen)

Peng Cheng Laboratory

Institute of Computing Technology, Chinese Academy of Sciences

jqnthomask@gmail.com, lei.chen@siat.ac.cn, xuruifeng@hit.edu.cn

aoxiang@ict.ac.cn, min.yang@siat.ac.cn

Abstract

Aspect-based sentiment analysis (ABSA) has

attracted increasing attention recently due to

its broad applications. In existing ABSA

datasets, most sentences contain only one as-

pect or multiple aspects with the same senti-

ment polarity, which makes ABSA task degen-

erate to sentence-level sentiment analysis. In

this paper, we present a new large-scale Multi-

Aspect Multi-Sentiment (MAMS) dataset, in

which each sentence contains at least two dif-

ferent aspects with different sentiment polar-

ities. The release of this dataset would push

forward the research in this ﬁeld. In addi-

tion, we propose simple yet effective CapsNet

and CapsNet-BERT models which combine

the strengths of recent NLP advances. Ex-

periments on our new dataset show that the

proposed model signiﬁcantly outperforms the

state-of-the-art baseline methods

1 Introduction

Aspect-based sentiment analysis (ABSA) aims at

identifying the sentiment polarity towards the spe-

ciﬁc aspect in a sentence. An target aspect refers

to a word or a phrase describing an aspect of an

entity. For example, in the sentence “The decor

is not special at all but their amazing food makes

up for it”, there are two aspect terms “decor” and

“food”, and they are associated with negative and

positive sentiment respectively.

Recently, neural network methods have domi-

nated the study of ABSA since these methods can

be trained end-to-end and automatically learn im-

portant features. (Wang et al., 2016) proposed to

learn an embedding vector for each aspect, and

these aspect embeddings were used to calculate

the attention weights to capture important infor-

mation with regard to the given aspects. (Tang

∗

Min Yang is corresponding author

Data and code can be found as: https://github.com/siat-

nlp/MAMS-for-ABSA

et al., 2016b) developed the deep memory network

to compute the importance degree and text repre-

sentation of each context word with multiple at-

tention layers. (Ma et al., 2017) introduced the in-

teractive attention networks (IAN) to interactively

learn attentions in contexts and targets, and gen-

erated the representations for target and context

words separately. (Xue and Li, 2018) proposed to

extract sentiment features with convolutional neu-

ral networks and selectively output aspect related

features for classiﬁcation with gating mechanisms.

Subsequently, Transformer (Vaswani et al., 2017)

and BERT based methods (Devlin et al., 2018)

have shown high potentials on ABSA task. There

are also several studies attempting to simulate the

process of human reading cognition to further im-

prove the performance of ABSA (Lei et al., 2019;

Yang et al., 2019).

So far, several ABSA datasets have been con-

structed, including SemEval-2014 Restaurant Re-

view dataset, Laptop Review dataset (Pontiki

et al., 2014) and Twitter dataset (Dong et al.,

2014). Although these three datasets have since

become the benchmark datasets for the ABSA

task, most sentences in these datasets consist

of only one aspect or multiple aspects with the

same sentiment polarity (see Table 1)

, which

makes aspect-based sentiment analysis degener-

ate to sentence-level sentiment analysis. Based on

our empirical observation, the sentence-level sen-

timent classiﬁers without considering aspects can

still achieve competitive results with many recent

ABSA methods (see TextCNN and LSTM in Ta-

ble 3). On the other hand, even advanced ABSA

methods trained on these datasets can hardly dis-

tinguish the sentiment polarities towards different

aspects in the sentences that contain multiple as-

pects and multiple sentiments.

ATSA and ACSA represent aspect-term and aspect-

category sentiment analysis, respectively.

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38701312

粉丝: 8
资源: 947

大规模多方面情感分析挑战数据集与模型

Aspect-Based-Sentiment-Analysis:一个为SemEval 2016数据集实现基于方面的情感分析分类系统的python程序

Al_challenger细粒度情感分析数据集

MAMS-for-ABSA:用于基于方面的情感分析的多方面多情感数据集

基于python的旅游景点方面级别情感分析语料库与模型源码数据库.docx

基于注意力机制的情感分析模型

基于文本语义和情感分析的个性识别追随者推荐模型

监督的音乐视频情感数据集：用于数据驱动算法的扩展和验证的音乐视频情感分析数据集

面向双注意力网络的特定方面情感分析模型

realtime-facial-emotion-analyzer：使用网络摄像头feed中的面部表情实时进行人类情绪分析。 基于Kaggle的面部表情识别挑战中的数据集

基于DEAP数据集的脑电情绪识别（构建生成对抗网络（GAN）和条件GAN（CGAN）模型）Pytorch框架

最新资源

realtime-facial-emotion-analyzer：使用网络摄像头feed中的面部表情实时进行人类情绪分析。基于Kaggle的面部表情识别挑战中的数据集