对话推荐系统：探索与挑战

需积分: 50 84 浏览量更新于2024-07-15 收藏 8.2MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇综述论文探讨了对话推荐系统（CRSs）的最新进展和面临的挑战，重点关注了如何通过自然语言交互来更准确地理解用户偏好，以及在多轮对话策略、探索-利用平衡等方面的问题。" 对话推荐系统是近年来推荐技术领域的一个重要发展，它们突破了传统静态推荐模型的局限，能够通过与用户的实时交互来获取用户的需求和喜好。传统的推荐系统主要依赖于用户的浏览历史、购买记录等静态数据来推断用户偏好，但这种方式往往难以精确地了解用户真正喜欢什么，也无法解释用户为何对某一项目感兴趣。在对话推荐系统中，用户与系统之间的多轮对话策略是关键。这种策略允许系统逐步细化推荐，通过询问用户对特定特征或属性的偏好来收集信息，从而实现更精确的用户偏好建模。例如，系统可以询问用户对电影类型的偏好，然后根据用户的回答调整推荐列表。这种交互式推荐不仅提高了推荐的准确性，也为用户提供了一种更加个性化和参与度高的体验。然而，对话推荐系统也面临着一系列挑战。首先，如何有效地进行偏好提取（preference elicitation）是一个核心问题。在与用户的对话中，系统需要找到合适的时机和方式来提问，同时避免过于频繁或冗余的询问，以免引起用户的不耐烦。此外，如何设计有效的多轮对话策略，以适应用户不断变化的需求和情绪，也是研究的重点。其次，探索与利用（exploration-exploitation）的平衡是另一个重要课题。推荐系统需要在提供当前最可能满足用户需求的推荐（即利用）和尝试新的可能性以发现潜在更佳选择（即探索）之间找到最佳策略。在对话场景中，这个平衡变得更加复杂，因为每一次对话都可能影响用户对系统的满意度和信任感。最后，对话推荐系统的评估也是一个难点。传统的推荐系统评价指标如准确率、召回率等可能不再适用，因为对话推荐强调的是交互过程和用户体验，而不仅仅是最终推荐的结果。因此，需要发展新的评价方法，如模拟对话环境的用户模拟器和基于真实用户反馈的在线实验。对话推荐系统带来了巨大的潜力，但同时也伴随着诸多技术和理论挑战。这篇综述论文旨在总结这一领域的研究成果，分析现有系统的局限性，并为未来的研究方向提供指导，推动对话推荐系统在理解和满足用户需求方面取得更大的进步。

资源详情

资源推荐

Advances and Challenges in Conversational Recommender Systems: A Survey

propose to integrate a knowledge graph into the interactive

recommendation to solve these problems.

However, directly requiring items is ineﬃcient for building

the user proﬁle because the candidate item set is large. In

real-world CRS applications, users will get bored as the num-

ber of conversation turns increases. It is more practical to

ask attribute-centric questions, i.e., to ask users whether they

like an attribute (or topic/category in some works), and then

make recommendations based on these attributes [207, 88].

Therefore, the estimation and utilization of a user’s prefer-

ences towards attributes become a key research issue.

2.2. Asking about Attributes

Asking about attributes is more eﬃcient because whether

users like or dislike an attribute can signiﬁcantly reduce the

recommendation candidates. The challenge is to determine

a sequence of attributes to ask so as to minimize the uncer-

tainty of current user needs [119, 164]. The aforementioned

critiquing-based methods fall into this category. Besides,

there are other kinds of methods, we introduce some main-

stream branches as below.

2.2.1. Fitting Patterns from Historical Interaction

A conversation can be deemed as a sequence of entities

including consumed items and mentioned attributes, and the

objective is to learn to predict the next attribute to ask or the

next item to recommend. Therefore, the sequential neural

network such as the gated recurrent unit (GRU) model [29]

and the long short term memory (LSTM) model [62] can be

naturally adopted in this setting, due to its ability to capture

long and short term dependency in user behavioral patterns.

An exemplar work is the question & recommendation

(Q&R) model proposed by Christakopoulou et al. [31], where

the interaction between the system and a user is implemented

as a selection system. In each turn, the system asks the user

to choose one or more distinct topics (e.g., NBA, Comics, or

Cooking) from the given list, and then recommends items in

these topics to the user. It contains a trigger module to de-

cide whether to ask a question about attributes or to make

a recommendation. The triggering mechanism can be as

simple as a random mechanism or can be more sophisti-

cated, i.e., using criteria capturing the user’s state, or even

be user-initiated. At the 𝑡-th time step, the next topic 𝑞 that

user click can be predicted based on the user’s watching his-

tory 𝑒

, … , 𝑒

𝑇

as: 𝑃

(

𝑞 ∣ 𝑒

, … , 𝑒

𝑇

)

. After user clicking

a topic 𝑞, the model can recommend an item 𝑟 based on

the conditional probability written as: 𝑃

(

𝑟 ∣ 𝑒

, … , 𝑒

𝑇

, 𝑞

)

Both of the two conditional probabilities are implemented

as the GRU architecture [29]. This algorithm is deployed on

YouTube, for obtaining preferences from cold-start users.

Zhang et al. [207] propose a “System Ask User Response”

(SAUR) paradigm. For each item, they utilize the rich re-

view information and convert a sentence containing an aspect-

value pair to a latent vector via the GRU model. Then they

adopt a memory module with attention mechanism [158, 83,

118] to perform both the next question generation task (de-

termining which attribute to ask) and the next item recom-

mendation task. Again, they also develop a heuristic trigger

to decide whether it is the time to display the top-𝑛 recom-

mended items to users or to keep asking questions about at-

tributes. One limitation of the work is that the authors as-

sume all information in reviews can support the purchasing

behavior, however it is not true as users may complain cer-

tain aspects of the purchased items, e.g., a user may write

“64 Gigabytes is not enough”. Using information without

discrimination will mislead the model and deteriorate the

performance.

The utterances produced by the system, i.e., the ques-

tions, are constructed with predeﬁned language patterns or

templates, meaning that what the system needs to pay at-

tention to are only the aspect and the value. This is a com-

mon setting in state-of-the-art CRS studies because the core

task here is recommendation instead of language generation

[31, 88, 89].

Note that these kinds of methods have a common disad-

vantage: learning from historical user behaviors cannot aid

understanding the logic behind the interaction. As interac-

tive systems, these models do not consider how to react to

feedback when users reject the recommendation, i.e., they

just try to ﬁt the preferences in historical interaction and do

not consider an explicit strategy to deal with diﬀerent feed-

back.

2.2.2. Reducing Uncertainty

Unlike sequential neural network-based methods that do

not have an explicit strategy to handle all kinds of user feed-

back, some studies try to build a straightforward logic to nar-

row down item candidates.

Critiquing-based Methods. The aforementioned critiquing

model is typically equipped with a heuristic tactic to elicit

user preference on attributes [23, 187, 107, 106]. In tradi-

tional critiquing models, where the critique on an attribute

value (e.g., “not red” for color or “less expensive” for price)

is used for reconstructing the candidate set by removing the

items with unsatisﬁed attributes [23, 116, 154, 171, 12, 153].

The neural vector-based methods take the criticism into the

latent vector, which is responsible for generating both the

recommended items and the explained attributes. For exam-

ple, Wu et al. [187] propose an explainable neural collabo-

rative ﬁltering (CE-NCF) model for critiquing. They use the

neural collaborative ﬁltering model [60] to encode the pref-

erence of a user 𝑖 for an item 𝑗 as a latent vector



𝐳

𝑖,𝑗

, then



𝐳

𝑖,𝑗

is used for producing the rating score 𝑟

𝑖,𝑗

as well as the ex-

plained attribute vector



𝐬

𝑖,𝑗

. The attributes are composed of

a set of key-phrases such as “golden, copper, orange, black,

yellow,” and each dimension of



𝐬

𝑖,𝑗

corresponds to a certain

attribute. When a user dislikes an attribute and critique it

in real-time feedback, the system updates the explained at-

tribute vector



𝐬

𝑖,𝑗

by setting the corresponding dimension to

zero. Then the updated vector



𝐬

𝑖,𝑗

is used to update the la-

tent vector



𝐳

𝑖,𝑗

to be



𝐳

𝑖,𝑗

. Consequently, the recommendation

score is updated to be 𝑟

𝑖,𝑗

. Following this setting, Luo et al.

[107] change the base NCF model to be a variational autoen-

coder (VAE) model, and this generative model can help the

Gao et al.: Preprint submitted to Elsevier Page 6 of 30

剩余29页未读，继续阅读

syp_net

粉丝: 158
资源: 1187

对话推荐系统：探索与挑战

一篇多轮对话方面的论文

对话管理的综述论文：最近的进展和挑战

赵梦媛, 黄晓雯, 桑基韬, 等. 对话推荐算法研究综述[j]. 软件学报, 2021: 0-0.

nlp的对话系统开发

实现智能机器人对话系统

实体匹配 和 对话系统 怎么结合

怎么开发智能机器人对话系统

请问对话系统中交互目标主要包括哪些？在目前主流的研究分类上，对话系统主要分为哪几种类型，并分别举例说明其基本原理和试用场景

写一段比较详细的对话系统的研究背景

unity对话系统代码

用godot在3d游戏中制作对话系统。要求：从txt中读取内容。给出操作步骤和代码。

odin unity怎么制作一个对话系统

聊天机器人chatrobot 100万条中文闲聊对话高质量语料数据集:nlp开放域对话学习训

安卓开发实现模拟登陆对话系统

prompt learning 综述 pdf

unity 对话系统插件

统计机器学习方法搭建对话系统

实现客服对话系统,关键点是什么?运用了哪些工作原理和信息技术

用godot在3d游戏中制作对话系统，给出操作步骤和代码。

unity 怎么做对话系统

最新资源

实体匹配和对话系统怎么结合