Twitter中表情符号的多样化语言功能探析

应用语言学

需积分: 9 195 浏览量更新于2024-08-29 收藏 139KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"Varying linguistic purposes of emoji in Twitter context探讨了在Twitter环境中表情符号的多样化语言功能。文章指出，早期的研究主要集中在表情符号的高频使用和解释上的歧义。更深入的研究揭示了表情符号至少有两种截然不同的用途：作为内容词和功能词，或者作为多模态的情感标记。 Noa Na’aman、Hannah Provenza和Orion Montoya三位来自Brandeis University的研究者通过分析推特数据，展示了表情符号如何在文本交流中起到丰富的作用。他们强调，理解表情符号是否替代了文本内容，对于自然语言处理（NLP）工具来说至关重要，因为这可能使NLP工具有能力将它们解析为‘词汇项’或‘表达方式’。内容词是指在句子中具有特定含义的词汇，如名词、动词、形容词等，而功能词如介词、连词则主要用于结构和语法。表情符号作为内容词时，它们在传达具体信息，可能是物体、情感或情境的代表。例如，发送一个“🎉”庆祝生日，或者“🍔”表示食物。相反，作为功能词，表情符号可以用来指示语境、语气或情绪，如“😄”可能用于表示愉快的语气。另一方面，多模态的情感标记表明表情符号可以结合文本之外的视觉、声音和其他感官元素来表达情感。这使得表情符号在社交媒体上成为了一种强大的沟通工具，能够跨越文化和语言界限，增强信息的表达力和理解性。研究还指出，识别表情符号的这些功能对于理解和分析社交媒体上的对话至关重要。这对于开发能够理解非传统文本形式的算法和模型具有深远影响，比如情感分析、语义理解和社会网络分析。通过更好地理解表情符号的用途，我们可以更准确地捕捉到推文背后的社交动态和用户情绪。 "Varying linguistic purposes of emoji in Twitter context"这项研究揭示了表情符号在现代通信中的复杂性和重要性，它们已经超越了简单的图形表达，成为了语言和情感交流中的重要组成部分。这一发现对自然语言处理技术的发展和社交媒体分析具有指导意义，有助于我们更好地适应和利用这个日益丰富的数字表达世界。"

资源详情

资源推荐

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics- Student Research Workshop, pages 136–141

Vancouver, Canada, July 30 - August 4, 2017.

2017 Association for Computational Linguistics

https://doi.org/10.18653/v1/P17-3022

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics- Student Research Workshop, pages 136–141

Vancouver, Canada, July 30 - August 4, 2017.

2017 Association for Computational Linguistics

https://doi.org/10.18653/v1/P17-3022

MojiSem:

Varying linguistic purposes of emoji in (Twitter) context

Noa Na’aman, Hannah Provenza, Orion Montoya

Brandeis University

{nnaaman,hprovenza,obm}@brandeis.edu

Abstract

Early research into emoji in textual com-

munication has focused largely on high-

frequency usages and ambiguity of inter-

pretations. Investigation of a wide range of

emoji usage shows these glyphs serving at

least two very different purposes: as con-

tent and function words, or as multimodal

affective markers. Identifying where an

emoji is replacing textual content allows

NLP tools the possibility of parsing them

as any other word or phrase. Recognizing

the import of non-content emoji can be a

a signiﬁcant part of understanding a mes-

sage as well.

We report on an annotation task on En-

glish Twitter data with the goal of classify-

ing emoji uses by these categories, and on

the effectiveness of a classiﬁer trained on

these annotations. We ﬁnd that it is pos-

sible to train a classiﬁer to tell the differ-

ence between those emoji used as linguis-

tic content words and those used as par-

alinguistic or affective multimodal mark-

ers even with a small amount of training

data, but that accurate sub-classiﬁcation

of these multimodal emoji into speciﬁc

classes like attitude, topic, or gesture will

require more data and more feature engi-

neering.

1 Background

Emoji characters were ﬁrst offered on Japanese

mobile phones around the turn of the 21st cen-

tury. These pictographic elements reached global

language communities after being added to Uni-

code 6.0 in 2010, and then being offered within

software keyboards on smartphones. In the ensu-

ing half-decade, digitally-mediated language users

have evolved diverse and novel linguistic uses for

emoji.

The expressive richness of emoji communica-

tion would, on its own, be sufﬁcient reason to

seek a nuanced understanding of its usage. But

our initial survey of emoji on Twitter reveals many

cases where emoji serve direct semantic functions

in a tweet or where they are used as a grammat-

ical function such as a preposition or punctua-

tion. Early work on Twitter emoticons (Schnoe-

belen, 2012) pre-dated the wide spread of Uni-

code emoji on mobile and desktop devices. Recent

work (Miller et al., 2016) has explored the cross-

platform ambiguity of emoji renderings; (Eis-

ner et al., 2016) created word embeddings that

performed competitively on emoji analogy tasks;

(Ljube

sic and Fi

ser, 2016) mapped global emoji

distributions by frequency; (Barbieri et al., 2017)

used LSTMs to predict them in context.

We feel that a lexical semantics of emoji char-

acters is implied in these studies without being di-

rectly addressed. Words are not used randomly,

and neither are emoji. But even when they replace

a word, emoji are used for different purposes than

words. We believe that work on emoji would be

better informed if there were an explicit typology

of the linguistic functions that emoji can serve in

expressive text. The current project offered anno-

tators a framework and heuristics to classify uses

of emoji by linguistic and discursive function. We

then used a model based on this corpus to pre-

dict the grammatical function of emoji characters

in novel contexts.

2 Annotation task

Although recognizing the presence of emoji char-

acters is trivial, the linguistic distinctions we

sought to annotate were ambiguous and seemed

prone to disagreement. Therefore in our annota-

tion guidelines we structured the process to mini-

mize cognitive load and lead the annotators to in-

136

下载后可阅读完整内容，剩余5页未读，立即下载

陈逸伦家开的解放西

粉丝: 0
资源: 10

Twitter中表情符号的多样化语言功能探析

A review of content-based image retrieval in medical applications.pdf

hhaa017.pdf

hql months_between

GLSL ES 中的存储变量修饰符（const/attribute/uniform/varying/in/centroid in/out/centroid out)...

Write a paper about Deep-learning based analysis of metal-transfer images in GMAW process , requiring 10000 words

ERROR: operator does not exist: character varying = integer

Computational electromagnetics: the finite-difference time-domain method

yAxisIndex: 1, symbolSize: 10,

"ERROR: operator does not exist: character varying = integer

定长子网划分和可变长度子网划分的英文

电力电子基础英文版课后题答案解析习题 fundamentals

List at least three challenges when designing programming for multicore systems.

Finite-time synchronization of inertial neural networks with time-varying delays在哪里下载

Why is project scope management so challenging in IT projects? What suggestions do you have for preventing scope creep in projects?

ValueError: setting an array element with a sequence.

Fine-Grained Feature Enhancement for Object Detection in Remote Sensing Images

sklearn.cluster.DBSCAN¶

QT使用gpu将RAW图转为RGB图

各种函数声明和定义模块

湖北工业大学在河南2021-2024各专业最低录取分数及位次表.pdf

最新资源