多粒度数据增强融合神经网络：短文本情感分析新方法

研究论文

需积分: 15 5 浏览量更新于2024-08-26 收藏 574KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇研究论文提出了一种用于短文本情感分析的基于多粒度数据增强的融合神经网络模型。在自然语言处理领域，情感分析是一项挑战性任务，尤其是在处理短文本时，由于语言结构的复杂性、语义结构以及标注数据和上下文信息的稀缺性，使得这个问题更为复杂。为了解决深度学习模型在处理此类问题时可能出现的数据稀疏性和过拟合问题，作者提出了多粒度文本导向的数据增强技术，以生成大量的训练数据供神经网络使用。他们还创新性地引入了‘混淆’策略，以增加模型的泛化能力。" 文章内容详述: 情感分析是自然语言处理中的一个关键任务，旨在确定文本中表达的情感极性，如积极、消极或中立。在短文本（如社交媒体帖子或评论）的情况下，这个任务尤为困难，因为短文本通常缺乏丰富的上下文信息，而且语言使用可能非常非正式和模糊。传统的机器学习方法在处理这类数据时可能会遇到挑战，而深度学习模型尽管在处理复杂任务上表现出色，但其需要大量标注数据进行训练，而这正是短文本情感分析面临的难题。为了克服这些挑战，该研究提出了一个名为多粒度数据增强的融合神经网络模型。数据增强是一种常见的技术，通过创建文本的变体来扩大训练数据集，以提高模型的泛化能力。在本文中，数据增强不仅限于单个粒度（如单词级别），而是采用了多粒度的方法，可能包括词组、句子结构和语义层面的变换。这种方法可以模拟真实世界中语言的多样性和灵活性，帮助模型更好地理解不同层次的文本信息。此外，论文中还提到了一种“混淆”策略，这是一种对抗性训练的手段，通过向训练数据中添加轻微的扰动或噪声，使得模型在学习过程中不仅要识别正确的情感，还需要学会区分近似的、误导性的输入。这种策略有助于防止模型过度依赖特定的特征或模式，从而增强其对未知数据的适应性，减少过拟合的风险。融合神经网络模型将这些增强后的数据作为输入，利用多种类型的神经网络组件（如卷积神经网络CNN、循环神经网络RNN或Transformer等）的组合，以捕捉不同粒度和时间依赖性的信息。这样的设计可以综合各种网络的优势，提高对短文本情感的识别精度。这项研究通过创新的数据增强技术和融合神经网络模型，为短文本情感分析提供了一种有效的解决方案，有助于在有限的标注数据下提高模型的性能和泛化能力。这种方法对于理解和处理社交媒体、在线评论等领域的大量短文本数据具有重要的实际应用价值。

资源详情

资源推荐

2017 Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)

A Multi-granularity Data Augmentation based

Fusion Neural Network Model for Short Text Sentiment Analysis

Xiao Sun

School of Computer and Information

Hefei University of Technology

Hefei, Anhui, 230009

Email: sunx@hfut.edu.cn

Jiajin He

School of Computer and Information

Hefei University of Technology

Hefei, Anhui, 230009

Email: hejudgin@mail.hfut.edu.cn

Changqin Quan

Department of Computational Science

Kobe University

Kobe, Japan, 6578501

Email: quanchqin@gold.kobe-u.ac.jp

Abstract—Sentiment analysis is a challenging task in Natural

Language Processing due to the complexity of language struc-

ture, the semantic structure, and the relative scarcity of labeled

data and context information especially in the ﬁeld of short-text

processing. To overcome data sparseness and the over-ﬁtting

problem when adopting a deep learning model, we propose

multi-granularity text-oriented data augmentation technologies

to generate large amounts of data for neural network train-

ing. We propose a novel confused model (LSCNN) with the

proposed data argumentation technology that improves the

performance and outperforms other effective neural network

models. The proposed data augmentation method enhances

the generalization ability of the proposed model. We also show

that the proposed data augmentation method in combination

with the neural networks model can achieve astonishing per-

formance without any handcrafted features on cross-domain

sentiment analysis, which is a efﬁcient technology for comments

sentiment detection.

1. Introduction

Sentiment analysis [1] is commonly used to mine the

user’s perception of a product and the user’s sentiment for

chat robots [2]. Effective sentiment analysis can obtain the

user’s subjective feelings towards some product and adjust

the service timely.

Recently, a neural network-based sentiment analysis

model became popular. A large amount of data is necessary

to train effective models. However, extremely deep neural

networks may lead to over-ﬁtting. In order to solve the

problems, the idea of data argumentation is introduced into

the neural networks model. Neural network-based architec-

tures have achieved complete success in the ﬁeld of Natural

Language Processing (NLP), such as Convolutional Neural

networks(CNN) [3] , Recurrent Neural networks(RNN) [4],

[5], and Long-Short Term Memory(LSTM) [6]; however,

these efforts were adversely affected by the lack of large-

scale data for training.

We introduce a data augmentation method to generate a

larger dataset for pre-training and training neural networks

for sentiment classiﬁcation, which has been widely applied

in image processing [7], [8] and sound processing [11]. The

proposed data augmentation technology has been applied to

neural network-based models such as Convolutional Neu-

ral Networks [3], Long-Short Term Memory [6], [9] and

BOW-based SVMs model [10]. We show that the proposed

neural network models with data augmentation outperform

models without it and the BOW-based model. The crucial

contributions are as follows:

(1) We propose multi-granularity text-oriented data augmen-

tation technologies to automatically manufacture artiﬁcial

data to overcome the problem of data sparseness in NLP.

(2) We ﬁrstly proposed a data augmentation-oriented hybrid

neural network model called LSCNN and successfully apply

the proposed model to sentiment analysis and obtain signif-

icant improvements and enhance the generalization ability.

(3) The proposed LSCNN model is almost automatic and

independent of any manual features and other resources.

The remainder of the paper is organized as follows.

2 introduces the proposed data augmentation technologies.

Section 3 describes the proposed model. Section 4 reports

the experiments and evaluation results with and without the

data augmentation method. The conclusion and future work

are provided in the ﬁnal section 5.

2. Data Augmentation

Data augmentation has been successfully applied to

image classiﬁcation [7], [8]. The most convenient and

common way to avoid over-ﬁtting when training a neu-

ral networks model is to automatically enlarge the dataset

using data augmentation technologies. There are many

methods for data augmentation for image data, such as

rotation/reflection, flip, zoom, shif t, changing the

scale and color

, contrast, and introduce noise. Our

approach involves using data augmentation technologies on

text sentiment analysis due to the great success achieved

with image classiﬁcation. In this paper, we ﬁrstly propose

a multi-granularity data augmentation method, including

word-level, phrase-level, and sentence-level data augmen-

tation. We exploit some special ways for text data by

leveraging the characteristics of text as follows.

978-1-5386-0680-3/17/$31.00

2017 IEEE

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38730129

粉丝: 7
资源: 927

多粒度数据增强融合神经网络：短文本情感分析新方法

基于深度学习的短文本情感倾向分析综述.pdf

基于卷积神经网络模型的互联网短文本情感分类.pdf

基于深度学习的短文本相似度分析

短文本分类 python 神经网络

我想训练一个用于短文本分类任务的AI模型，我该怎么做？

基于bert短文本分类影评

情感分析训练百度飞浆数据集

ccks2020中文短文本实体链接数据集下载

帮我设计一个基于朴素贝叶斯算法用于中文短文本分类的python代码。要求可以导入excel数据；可以实现根据文本内某一关键字就能进行分类；具体步骤要用中文解释

自然语言情感分析python

帮我设计一个基于朴素贝叶斯算法用来进行中文短文本分类的python代码，要求可以导入数据、自己设置分组

帮我设计一个基于朴素贝叶斯算法的中文短文本分类python代码，要求可以导入数据、自己设置分组

transformer短文本分类改进

短文本分类 fasttext python

基于nltk情感字典的情感分析

python对心理相关短文本做lda主题分析的代码

BERT + CONV1

lcsts数据集三部分

短文本相似度算法java

最新资源