注意力LSTM-CNN：识别中国社交媒体文本的不确定性

需积分: 10 44 浏览量更新于2024-08-13 收藏 1.09MB PDF 举报

本文主要探讨了在中文社交媒体文本识别中，基于注意力机制的长短时记忆网络（LSTM）与卷积神经网络（CNN）的结合（Attention-based LSTM-CNNs）在不确定性识别中的应用。不确定性识别是一项关键的语义处理任务，对于诸如主题检测、问答系统等技术中的信息真实性评估至关重要。由于社交媒体文本通常是非正式的，且在各种应用场景中广泛使用，确保信息的真实性显得尤为重要。作者们针对这一问题，首先介绍了注意力机制在处理文本信息时的优势，它能够对输入序列中的不同部分赋予不同的权重，从而更好地捕捉文本的关键特征。在传统的LSTM模型中，通过门控单元控制信息的流进流出，而注意力机制的引入则允许模型动态地关注文本的不同部分，提高了模型对复杂和不规则文本结构的适应性。 LSTM-CNN模型结合了循环神经网络的长期依赖处理能力（LSTM）和卷积神经网络的局部感知特性，这使得模型能够同时捕捉到词语级别的局部特征和句子级别的全局上下文。通过这种方式，模型能够更准确地识别出社交媒体文本中的不确定性，如模糊性、矛盾性或信息不完整性。文章详细阐述了模型的架构设计、训练过程以及如何通过注意力机制优化不确定性识别的性能。研究者可能使用了诸如注意力加权的隐藏状态更新、多层网络结构或者注意力与深度学习相结合的方法来提高模型的性能。此外，文中还可能讨论了实验结果，包括在标准数据集上进行的评估，对比了他们的方法与其他现有不确定性识别方法的性能。可能涉及到了准确率、召回率、F1分数等评价指标，以及模型在实际社交媒体情境中的效果分析。最后，文章提出了未来的研究方向，可能包括如何进一步提高模型的鲁棒性，应对社交媒体文本中的噪声和多样性，或者探索如何将不确定性识别应用到更多的自然语言处理任务中。这篇研究论文深入探讨了如何利用注意力机制强化LSTM-CNN模型，在复杂的中文社交媒体文本中有效地识别不确定性，为提高信息真实性和可信度提供了新的理论和技术支持。

Attention-based LSTM-CNNs for Uncertainty

Identification on Chinese Social Media Texts

Binyang Li

School of Information

Science and Technology

University of

International Relations

Beijing, China

byli@uir.edu.cn

Kaiming Zhou

School of Information

Science and Technology

University of

International Relations

Beijing, China

kmzhou@uir.edu.cn

Wei Gao

School of Information

Management

Victoria University of

Wellington

Wellington, New Zealand

wei.gao@vuw.ac.nz

Xu Han



School of Information

Science and Technology

Capital Normal

University

Beijing, China

csxhan@cnu.edu.cn

Linna Zhou

School of Information

Science and Technology

University of

International Relations

Beijing, China

lnzhou@uir.edu.cn

Abstract—Uncertainty identification is an important semantic

processing task, which is crucial to the quality of information in

terms of factuality in many techniques, e.g. topic detection,

question answering. Especially in social media, the texts are

written informally which are widely used in many applications, so

the factuality has become a premier concern. However, existing

approaches that still rely on lexical cues suffer greatly from the

casual or word-of-mouth peculiarity of social media, in which the

cue phrases are often expressed in sub-standard form or even

omitted from sentences. To tackle these problems, this paper

proposes the attention-based LSTM-CNNs for the uncertainty

identification on social media texts, named ALUNI. ALUNI

incorporates attention-based LSTM networks to represent the

semantics of words, and convolutional neural networks to capture

the most important semantics of uncertainty for identification.

Experiments are conducted on both Chinese Weibo and news

datasets, and 78.19% and 73.95% of F1-measure scores are

achieved with 11% and 3% improvement over the baseline,

respectively.

Keywords—LSTM, CNN, uncertainty identification, social

media

I. INTRODUCTION

“Uncertainty - in its most general sense - can be interpreted as

lack of information: the receiver of the information (i.e., the

hearer or the reader) cannot be certain about some pieces of

information” [1]. The identification of uncertainty is significant

to the trustworthiness of many natural language processing

techniques and applications, such as question answering,

information extraction, and so on [2].

The CoNLL-2010 Shared Task aimed at identifying

uncertainty in biological papers and Wikipedia articles written

in English [3] [4]. Most participants utilized linguistics features,

e.g., lexical cues, Part-Of-Speech (POS), to detect the uncertain

sentences from the texts.

Recently, with the growing popularity of social media, there

exist more and more texts consisting of casual or word-of-mouth

expressions. The quality of information in social media in terms

of factuality has become a premier concern [5]. The generation

and propagation of uncertain information will lead to rumor

flooding among social media and even influence the real world.

For example, the 2011 London Riots occurred owing to the

spread of uncertain in-formation among social media, such as



Corresponding author

Twitter or Facebook. Therefore, uncertainty identification, i.e.,

identifying uncertain sentences is becoming increasingly critical

for users to synthesize information to derive reliable

interpretation.

However, unlike the biological papers and Wikipedia

articles, the texts in social media are usually short and informal.

Due to the word count limit and casual expression, many cue

phrases are expressed in substandard shape or even omitted from

sentences. In this case, the uncertain semantics will be implicitly

conveyed by the whole sentence rather than explicitly by cue

phrases. Existing approaches based on cue phrases for

uncertainty identification are ineffective for social media texts,

and they are also not good enough for formal text uncertain

identification. It is noteworthy that in the CoNLL-2010 Shared

Task, the participants all achieved better results on biological

dataset than wiki dataset. It indicated the more formal the article

is, the easier it is to judge the sentence uncertainty. As a result,

uncertainty identification on Chinese social media texts has

become a big challenge which needs more semantics

information to solve.

We tried to judge the uncertainty of the Chinese text of social

media based on semantics, so we turned to deep learning which

could express the semantics of words and sentences well.

Bahdanau et al. apply the RNN with attention mechanism to

machine translation [6], their model makes the words’ semantics

and the relation between words in both languages clearer. Kim

utilizes CNNs to classify sentences and achieves good results [7],

which shows CNNs have a unique advantage in both image and

text classifying issues. Considering these above researches we

decided to combine the two model structures to solve

uncertainty identification problem.

This paper proposes Attention-based Long Short-Term

Memory-Convolutional Neural Networks (LSTM-CNNs) for

Uncertainty Identification on social media texts, named ALUNI.

ALUNI incorporates attention mechanisms into LSTM

networks to represent the semantics of the context in a sentence,

and uses CNNs for the uncertainty identification. Benefitted

from the attention mechanisms, the key elements of sentences

can be highlighted and the hidden semantics can be captured,

which will enable us to detect uncertainty based on the context

of the whole sentence instead of depending on the cue-phrases.

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38632006

粉丝: 3
资源: 939

注意力LSTM-CNN：识别中国社交媒体文本的不确定性

基于CNN-BiLSTM-Attention模型的网络入侵检测方法的python实现源码.zip

基于卷积双向长短期记忆网络结合注意力机制(CNN-BILSTM-Attention)时间序列预测（Matlab完整源码和数据）

twittercommunities:使用文本信息在社交媒体上检索社区成员

情感分析与命名实体识别：使用LSTM进行文本分类

【图像识别中的LSTM】：探索前沿应用，技术实践揭秘

【基于多层次注意力机制的深度学习模型设计方法研究】： 研究基于多层次注意力机制的深度学习模型设计方法

【Python自然语言处理入门】：从文本分析到情感识别的案例解析

社交媒体数据挖掘：海量信息提取价值的终极技术

社交媒体情感分析：机器学习技术的高效应用案例

基于机器学习的文本信息抽取方法详解

最新资源

【基于多层次注意力机制的深度学习模型设计方法研究】：研究基于多层次注意力机制的深度学习模型设计方法