受限玻尔兹曼机驱动的 sentiment-aspect 提取：一种无监督方法

研究论文

199 浏览量更新于2024-08-27 收藏 378KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

本文主要探讨了"基于受限玻尔兹曼机的sentiment-aspect extraction"这一主题，它在意见挖掘领域中具有重要的应用价值。受限玻尔兹曼机（Restricted Boltzmann Machines, RBMs）是一种无监督学习模型，通常用于深度学习中的概率建模和特征学习。在这个研究中，作者林琳·王、康磊、朱超和姜俊等人提出了一种新颖的模型，旨在联合解决情感分析和 aspect extraction 两个任务，无需依赖标注数据。传统的观点挖掘通常涉及从用户评论中识别出特定的产品或服务的优点（aspect）及其相关的情感倾向（sentiment）。然而，单独进行这两个任务可能会导致信息丢失或者效率低下。通过引入受限玻尔兹曼机，研究人员构建了一个具有异质性结构的隐藏层，这允许模型更好地捕捉文本数据的复杂性。此外，他们还考虑了包含有益先验知识的集成，进一步提高了模型的性能。受限玻尔兹曼机在模型设计中扮演了关键角色，它的无监督学习特性使得模型能够自动学习到输入数据中的潜在特征表示，这对于处理未标记的评论数据非常有效。实验结果表明，与先前的最先进的方法相比，该模型在情感分析和 aspect extraction 的准确性和效果上取得了显著的提升，证明了其在实际应用中的优越性。在介绍部分，文章提到随着互联网的发展，人们越来越倾向于在网上分享对各类实体的看法，这促使了情感分析和 aspect extraction 成为必不可少的研究课题。通过这种方法，研究人员不仅能够更好地理解消费者的需求和反馈，还可以帮助企业优化产品和服务，提高市场竞争力。这篇研究论文提出了一种创新的机器学习框架，将情感分析和 aspect extraction 融合在受限玻尔兹曼机中，展示了在无监督条件下处理大规模用户评论的有效途径。这项工作的成功表明，结合深度学习模型和合理的模型架构对于提升自然语言处理任务的性能具有重要意义。

资源详情

资源推荐

Sentiment-Aspect Extraction based on Restricted Boltzmann Machines

Linlin Wang

, Kang Liu

2⇤

, Zhu Cao

, Jun Zhao

and Gerard de Melo

Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, China

National Laboratory of Pattern Recognition, Institute of Automation,

Chinese Academy of Sciences, Beijing, China

{ll-wang13, cao-z13}@mails.tsinghua.edu.cn

{kliu, jzhao}@nlpr.ia.ac.cn, gdm@demelo.org

Abstract

Aspect extraction and sentiment analysis

of reviews are both important tasks in

opinion mining. We propose a novel senti-

ment and aspect extraction model based on

Restricted Boltzmann Machines to jointly

address these two tasks in an unsupervised

setting. This model reﬂects the gener-

ation process of reviews by introducing

a heterogeneous structure into the hidden

layer and incorporating informative priors.

Experiments show that our model outper-

forms previous state-of-the-art methods.

1 Introduction

Nowadays, it is commonplace for people to ex-

press their opinion about various sorts of entities,

e.g., products or services, on the Internet, espe-

cially in the course of e-commerce activities. Ana-

lyzing online reviews not only helps customers ob-

tain useful product information, but also provide

companies with feedback to enhance their prod-

ucts or service quality. Aspect-based opinion min-

ing enables people to consider much more ﬁne-

grained analyses of vast quantities of online re-

views, perhaps from numerous different merchant

sites. Thus, automatic identiﬁcation of aspects of

entities and relevant sentiment polarities in Big

Data is a signiﬁcant and urgent task (Liu, 2012;

Pang and Lee, 2008; Popescu and Etzioni, 2005).

Identifying aspect and analyzing sentiment

words from reviews has the ultimate goal of dis-

cerning people’s opinions, attitudes, emotions, etc.

towards entities such as products, services, orga-

nizations, individuals, events, etc. In this con-

text, aspect-based opinion mining, also known as

feature-based opinion mining, aims at extracting

and summarizing particular salient aspects of enti-

ties and determining relevant sentiment polarities

⇤

Corresponding Author: Kang Liu (kliu@nlpr.ia.ac.cn)

from reviews (Hu and Liu, 2004). Consider re-

views of computers, for example. A given com-

puter’s components (e.g., hard disk, screen) and

attributes (e.g., volume, size) are viewed as aspects

to be extracted from the reviews, while sentiment

polarity classiﬁcation consists in judging whether

an opinionated review expresses an overall posi-

tive or negative opinion.

Regarding aspect identiﬁcation, previous meth-

ods can be divided into three main categories:

rule-based, supervised, and topic model-based

methods. For instance, association rule-based

methods (Hu and Liu, 2004; Liu et al., 1998)

tend to focus on extracting product feature words

and opinion words but neglect connecting product

features at the aspect level. Existing rule-based

methods typically are not able to group the ex-

tracted aspect terms into categories. Supervised

(Jin et al., 2009; Choi and Cardie, 2010) and semi-

supervised learning methods (Zagibalov and Car-

roll, 2008; Mukherjee and Liu, 2012) were intro-

duced to resolve certain aspect identiﬁcation prob-

lems. However, supervised training requires hand-

labeled training data and has trouble coping with

domain adaptation scenarios.

Hence, unsupervised methods are often adopted

to avoid this sort of dependency on labeled data.

Latent Dirichlet Allocation, or LDA for short,

(Blei et al., 2003) performs well in automatically

extracting aspects and grouping corresponding

representative words into categories. Thus, a num-

ber of LDA-based aspect identiﬁcation approaches

have been proposed in recent years (Brody and El-

hadad, 2010; Titov and McDonald, 2008; Zhao et

al., 2010). Still, these methods have several im-

portant drawbacks. First, inaccurate approxima-

tions of the distribution over topics may reduce the

computational accuracy. Second, mixture models

are unable to exploit the co-occurrence of topics

to yield high probability predictions for words that

are sharper than the distributions predicted by in-

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38695159

粉丝: 5
资源: 942

受限玻尔兹曼机驱动的 sentiment-aspect 提取：一种无监督方法

sentiment-analysis-on-movie-reviews.zip

PyPI 官网下载 | aspect-based-sentiment-analysis-2.0.0.tar.gz

stanford-sentiment-treebank数据

pytorch-sentiment-classification 数据下载

exploiting bert for end-to-end aspect-based sentiment analysis

aspect-based sentiment analysis

Self-attention-based BGRU and CNN for Sentiment Analysis

generative aspect-based sentiment analysis with contrastive learning and exp

Attention-based LSTM for aspect-level sentiment classification主要技术

情感分析 torch

can you give me a tuotrial on using models on huggingface?

Generative Pre-trained Transformer

tpthon lstm 参考文献

model.predict 用的cpu

from transformers import pipeline

tweet sentiment extraction

最新资源