跨模态一致回归联合视觉-文本情感分析

需积分: 10 77 浏览量更新于2024-09-09 收藏 2.55MB PDF 举报

视觉文本情感分析中的跨模态一致回归在社交媒体时代，用户生成的内容分析变得越来越重要。 sentiment analysis 是一个关键的任务，它可以帮助预测政治选举结果、衡量经济指标等。传统上，研究人员主要依赖文本 sentiment analysis，但是随着社交媒体的发展，用户们开始使用更多的图像、视频等多媒体形式来表达他们的看法和分享经验。本文主要介绍了一种新的方法，即跨模态一致回归（Cross-modality Consistent Regression），用于联合视觉文本情感分析。这种方法可以同时处理文本和图像数据，实现多模态情感分析。跨模态一致回归的关键思想是，使用一个共同的特征空间来表示文本和图像数据。这样可以使得模型同时学习文本和图像特征，从而实现多模态情感分析。该方法可以应用于社交媒体分析、电子商务、广告投放等领域。在本文中，我们将详细介绍跨模态一致回归的理论基础和算法步骤。同时，我们还将展示实验结果，证明了跨模态一致回归在视觉文本情感分析中的优越性。本文的主要贡献在于： 1. 提出了跨模态一致回归方法，实现了联合视觉文本情感分析。 2. 实现了多模态情感分析，扩展了传统文本 sentiment analysis 的应用范围。 3. 证明了跨模态一致回归在视觉文本情感分析中的优越性。本文的研究成果可以为社交媒体分析、电子商务、广告投放等领域提供新的解决方案，提高情感分析的准确性和实时性。

000

001

002

003

004

005

006

007

008

009

010

011

012

013

014

015

016

017

018

019

020

021

022

023

024

025

026

027

028

029

030

031

032

033

034

035

036

037

038

039

040

041

042

043

044

045

046

047

048

049

050

051

052

053

054

055

056

057

058

059

060

061

062

063

064

065

066

067

068

069

070

071

072

073

074

075

076

077

078

079

080

081

082

083

084

085

086

087

088

089

090

091

092

093

094

095

096

097

098

099

100

101

102

103

104

105

106

107

CVPR

#262

CVPR

#262

CVPR 2015 Submission #262. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Cross-modality Consistent Regression for Joint Visual-Textual Sentiment

Analysis

Anonymous CVPR submission

Paper ID 262

Abstract

Sentiment analysis of online user generated content is

important for many social media analytics tasks. Re-

searchers have largely relied on textual sentiment analysis

to develop systems to predict political elections, measure e-

conomic indicators, and so on. Recently, social media users

are increasingly using additional images and videos to ex-

press their opinions and share their experiences. Sentiment

analysis of such large-scale textual and visual content can

help better extract user sentiments toward events or topics.

Motivated by the needs to leverage large-scale social mul-

timedia content for sentiment analysis, we propose a cross-

modality regression (CCR) model, which is able to utilize

both the state-of-the-art visual and textual sentiment anal-

ysis techniques. We ﬁrst ﬁne-tune a convolutional neural

network (CNN) for image sentiment analysis and train a

paragraph vector model for textual sentiment analysis. On

top of them, we train our multi-modality regression model.

We use sentimental queries to obtain half a million training

samples from Getty Images. We have conducted extensive

experiments on both machine weakly labeled and manual-

ly labeled image tweets. The results show that the proposed

model can achieve better performance than the state-of-the-

art textual and visual sentiment analysis algorithms alone.

1. Introduction

The increasing popularity of social networks attracts

more and more people to share their experiences and to ex-

press their opinions on virtually all events and subjects in

online social network platforms. Each day, billions of mes-

sages and posts are generated. In this study, we focus on

deriving people’s opinions or sentiments towards topics and

events happening in real world. In other words, we are in-

terested in automatical detection of sentiment from online

user generated content.

Figure 1 shows several example image tweets from Twit-

PD Achilles meets a new

friend. Special post for

one of our followers who

I met last night and had a

good chat to

If anyone woke up in

edinburgh this morning

to discover their car

missing i think i know

where it is

Hello there sweetie. :)

(a) (b) (c)

Figure 1. Examples of image tweets from Twitter.

ter. Image tweets refer to those tweets that contain images.

If we take a look at these three example image tweets, we

can observe that in example (a), both image and the text

indicate that this tweet carries a positive sentiment; in (b)

while it is difﬁcult to tell the sentiment from the image in

the middle image tweet, however, we can tell that this tweet

expresses positive sentiment from the text; in (c) on the con-

trary, it is hard to tell the sentiment from the text, however

the worn-out car in the image suggest an overall negative

sentiment. These examples explain the motivation for our

work. We would like to learn people’s overall sentiment

over the same object from different modalities of the object

provided by the user. In particular, we focus on inferring

people’s sentiment according to the available images and

the short and informal text.

Many researchers have contributed to sentiment analy-

sis. For instance, there are related works on detecting users’

sentiment and applying sentiment analysis to predict box-

ofﬁce revenues for movies [1], political elections [22, 28]

and economic indicators [3, 31]. In particular, recently pub-

lished works started to focus on analyzing sentiment of in-

formally user generated content from online social network-

s. However, current techniques are mostly based on the

analysis of textual content to detect sentiment. On the oth-

er hand, visual content, including both images and videos,

are becoming increasingly popular in all mainstream online

下载后可阅读完整内容，剩余8页未读，立即下载

antonyzorrow

粉丝: 0
资源: 1

跨模态一致回归联合视觉-文本情感分析

Sentiment-Analysis-with-Convolutional-Networks, IMDB电影评论情感分析的卷积神经网络模型.zip

Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentification

Combined Operation Modality vs. Imatinib Mesylate Alone for Patients with Recurrent or Metastatic Gastrointestinal Stromal Tumors: the Randomized COMVIA Trial

High-Precision, Consistent EKF-based Visual-Inertial Odometry.pdf

high-precision, consistent ekf-based visual- inertial odometry

An Implicit Nonlinearly Consistent Method for the Two-Dimensional Shallow-Water

用于文本图像合成的循环一致逆GAN_Cycle-Consistent Inverse GAN for Text-to-Image

Consistent-Sparse-Deep-Learning-Theory-and-Computation

Consistent-Video-Depth-Estimation

Consistent Group ICA for fMRI Toolbox-开源

最新资源