基于协调CNN-LSTM-注意力模型的情感文本分类

14 浏览量更新于2024-08-28 收藏 1.94MB PDF 举报

“A Text Sentiment Classification Modeling Method Based on Coordinated CNN-LSTM-Attention Model”描述了一种利用协调的卷积神经网络（CNN）、长短期记忆网络（LSTM）和注意力机制（Attention）进行文本情感分类建模的方法，旨在解决捕捉文本内在语义、情感依赖信息以及情感表达关键部分的挑战。在文本情感分析中，理解文本的深层情感和语义关系是关键。传统的机器学习方法可能难以捕捉到这些复杂信息，而深度学习模型如CNN和LSTM因其在处理序列数据上的优势，已被广泛应用于自然语言处理任务。CNN擅长捕捉局部特征，而LSTM则擅长捕获长期依赖性。然而，单独使用它们可能无法全面地理解文本的情感信息。该CCLA模型通过结合CNN和LSTM，能够同时考虑文本的局部结构和时间序列信息。在CCLA单元中，CNN用于提取句子的局部特征，LSTM则用于捕捉句子内部的时间依赖性。此外，引入的注意力机制使得模型能够自动聚焦到文本中的关键信息，增强对情感表达重要部分的识别。通过将这些组件协调工作，CCLA模型能够自适应地编码句子的语义和情感信息，并将其转化为文档的向量表示。这种表示方式能够有效地捕获句子之间的关联性。最后，使用softmax回归分类器对文本的情感倾向进行识别，从而实现情感分类。实验结果表明，与其它方法相比，CCLA模型在捕捉文本的局部和长期情感模式方面表现优秀，提高了情感分类的准确性和效率。这种方法对于处理中文文本，尤其是长文本中的情感分析具有重要的应用价值，有助于提升社交媒体监控、客户反馈分析等领域的性能。总结来说，CCLA模型是一种创新的文本情感分类技术，它结合了CNN的局部特征提取、LSTM的长期依赖性学习以及注意力机制的焦点引导，从而在理解和分类文本情感时展现出强大的能力。这一方法为自然语言处理中的情感分析提供了新的思路和工具，有助于推动相关领域的研究进展。

Chinese Journal of Electronics

Vol.28, No.1, Jan. 2019

A Text Sentiment Classiﬁcation Modeling

Method Based on Coordinated

CNN-LSTM-Attention Model

∗

ZHANG Yangsen

1,2

, ZHENG Jia

, JIANG Yuru

1,2

, HUANG Gaijuan

1,2

and CHEN Ruoyu

1,2

(1. Institute of Intelligent Information Processing, Beijing Information Science and Technology University,

Beijing 100192, China)

(2. Beijing Laboratory of National Economic Security Early-Warning Engineering,Beijing 100192,China)

Abstract — The major challenge that text sentiment

classiﬁcation modeling faces is how to capture the

intrinsic semantic, emotional dependence information and

the key part of the emotional expression of text. To

solve this problem, we proposed a Coordinated CNN-

LSTM-Attention(CCLA) model. We learned the vector

representations of sentence with CCLA unit. Semantic and

emotional information of sentences and their relations are

adaptively encoded to vector representations of document.

We used softmax regression classiﬁer to identify the

sentiment tendencies in the text. Compared with other

methods, the CCLA model can well capture the local

and long distance semantic and emotional information.

Experimental results demonstrated the eﬀectiveness of

CCLA model. It shows superior performances over several

state-of-the-art baseline methods.

Key words — Coordinated CNN-LSTM-Attention,

Sentiment analysis, Text modeling, Semantic information.

I. Introduction

Text sentiment classiﬁcation modeling is a funda-

mental problem in the ﬁeld of Nature language processing

(NLP) and is a crux to understand user intention

in product reviews or social networks

[1,2]

. The core

of text sentiment classiﬁcation modeling is to capture

semantic features from variable-length text units. As a

traditional method, the bag-of-words model

[3]

is the most

common and popular vector representations method for

texts because of its eﬃciency, simplicity and surprising

accuracy. But the bag-of-words model treats sentence or

document as an unordered collection of words. Lacking

word order, diﬀerent sentences can have the exactly same

representation, given that the same words are used.

Until now, some machine learning algorithms have

achieved good results on text sentiment classiﬁcation

modeling

[4]

, but with the deep learning models have

achieved remarkable eﬀects in the ﬁeld of speech

recognition and computer vision in recent years, order-

sensitive models based on the neural networks model such

as Recursive neural networks (RNNs), Recurrent neural

networks (RNN), Convolutional neural networks (CNN),

Long short-term memory (LSTM) and attention model

are becoming increasingly popular due to their ability

to capture word order information and further learn

the semantic and emotional information from text. Deep

learning comes from traditional neural network models.

It is not just a multi-layer network but emphasizes the

extraction of hidden features and higher-level abstract

features.

II. Related Work

1. Deep learning model

RNNs have been proved eﬀective in modeling text

semantics

[5−7]

. However, it need to construct semantic

tree and its performance depends on the accuracy of the

semantic tree. But, the semantic relationship between

two sentences may not be able to form a tree structure.

RNN do not need to build the semantic tree

[8]

and it

can capture the context information over long distances.

However, RNN is a bias model, or to be more speciﬁc, a

positive model, in which the relatively backward words

in the text occupy a more dominant position. At the

same time, RNN also have the problem of exploding and

vanishing gradient.

In order to solve the semantic bias problem of RNN,

it is proposed to use CNN for text semantic modeling.

∗

Manuscript Received Aug. 4, 2017; Accepted May 29, 2018. This work is supported by the National Natural Science

Foundation of China (No.61772081, No.61602044) and the Science and Technology Development Project of Beijing Municipal Education

Commission(No.KM201711232014).

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38688097

粉丝: 5

基于协调CNN-LSTM-注意力模型的情感文本分类

利用CNN,LSTM,CNN-LSTM,TextCNN,Bi-LSTM和传统的机器学习算法进行情感分析.zip

Attention-based LSTM for Aspect-level Sentiment Classification 论文代码

Text-Classification-Sentiment-Analysis-with-LSTM:使用LSTM进行文本分类情感分析

pytorch-sentiment-analysis-classification:情感分析分类的PyTorch教程（RNN，LSTM，Bi-LSTM，LSTM + Attention，CNN）

Amazon-review-sentiment-analysis-webscraping-nlp-lstm-deployment

Transformer模型与情感分析：结合BERT和CNN-LSTM的深度学习方法

cnn-lstm参考文献

cnn-lstm用于情感分析的结构图

Attention-based LSTM for aspect-level sentiment classification主要技术

sentiment-analysis-IMDB-Review-using-LSTM

最新资源