自我注意力机制的下一项推荐系统

需积分: 11 15 浏览量更新于2024-09-06 收藏 2.16MB PDF 举报

"Next Item Recommendation with Self-Attention是悉尼新南威尔士大学和CSIRO Data61的研究者提出的一种新颖的序列感知推荐模型。该模型利用自我注意力机制从用户的历叐交互中推断物品之间的关系，以更好地学习用户瞬时兴趣的表示。通过在度量学习框架中训练，该模型考虑了短期和长期意图，实现在不同领域的一系列数据集上超越当前最先进的推荐方法。关键词包括推荐系统、序列推荐和自我注意力机制。" 本文介绍了一种基于自我注意力的下一个项目推荐方法，旨在解决推荐系统中的序列依赖问题。传统的推荐系统通常忽略用户行为的时间顺序，而该模型则强调了用户历史交互中的时间序列信息。自我注意力机制是深度学习中的一种重要技术，最初在自然语言处理领域被广泛采用，能捕捉到输入序列中的长距离依赖关系。在推荐系统中，这种机制允许模型分析用户过去交互的各个项目，赋予它们相对权重，从而理解用户的动态兴趣模式。作者提出的方法首先将用户的历叐交互序列作为输入，通过自我注意力层来分析每个项目的相对重要性。每个项目都与序列中的其他项目相互作用，生成表示用户兴趣的加权向量。这种表示不仅反映了用户对单个项目的偏好，还捕获了项目间的关联，使得模型能够区分短期趋势和长期偏好。为了综合考虑用户的短期和长期兴趣，模型采用了度量学习框架进行训练。度量学习的目标是学习一个距离函数，使相似的样本之间的距离较小，不相似的样本之间距离较大。在这种情况下，模型会尝试调整物品的表示，以便最近的交互项目更接近用户的当前兴趣，而较远的项目则反映其长期兴趣。实验部分，研究人员在多个领域的广泛数据集上对比了该模型与其他最先进的推荐算法，结果表明，自我注意力机制显著提高了推荐的准确性。这些数据集包括但不限于电商购买记录、社交媒体活动和音乐流媒体服务，证明了模型在各种场景下的泛化能力和有效性。 "Next Item Recommendation with Self-Attention"模型通过引入自我注意力机制，增强了推荐系统对用户行为序列的理解，有效捕捉了用户兴趣的变化，并在实践中展示了出色的性能。这为推荐系统领域提供了一个新的研究方向，尤其是在如何利用用户行为序列信息方面。

Next Item Recommendation with Self-Aention

Shuai Zhang

UNSW and Data61, CSIRO

Sydney, NSW 2052, Australia

shuai.zhang@student.unsw.edu.au

Yi Tay

Nanyang Technological University

Singapore

ytay017@e.ntu.edu.sg

Lina Yao

University of New South Wales

Sydney, NSW 2052, Australia

lina.yao@unsw.edu.au

Aixin Sun

Nanyang Technological University

Singapore

axsun@ntu.edu.sg

ABSTRACT

In this paper, we propose a novel sequence-aware recommendation

model. Our model utilizes self-aention mechanism to infer the

item-item relationship from user’s historical interactions. With

self-aention, it is able to estimate the relative weights of each item

in user interaction trajectories to learn beer representations for

user’s transient interests. e model is nally trained in a metric

learning framework, taking both short-term and long-term inten-

tions into consideration. Experiments on a wide range of datasets

on dierent domains demonstrate that our approach outperforms

the state-of-the-art by a wide margin.

KEYWORDS

Recommender Systems; Sequential Recommendation; Self-Aention

ACM Reference format:

Shuai Zhang, Yi Tay, Lina Yao, and Aixin Sun. 2018. Next Item Recommen-

dation with Self-Aention. In Proceedings of Conference Submission, , Month

2018/9, 10 pages.

DOI:

1 INTRODUCTION

Anticipating a user’s next interaction lives at the heart of making

personalized recommendations. e importance of such systems

cannot be overstated, especially given the ever growing amount

of data and choices that consumers are faced with each day [

Across a diverse plethora of domains, a wealth of historical interac-

tion data exists, e.g., click logs, purchase histories, views etc., which

have, across the years, enabled many highly eective recommender

systems.

Exploiting historical data to make future predictions have been

the cornerstone of many machine learning based recommender

systems. Aer all, it is both imperative and intuitive that a user’s

past interactions are generally predictive of their next. To this end,

many works have leveraged upon this structural co-occurrence,

along with the rich sequential paerns, to make informed decisions.

Our work is concerned with building highly eective sequential

Permission to make digital or hard copies of part or all of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for third-party components of this work must be honored.

For all other uses, contact the owner/author(s).

Conference Submission,

DOI:

recommender systems by leveraging these auto-regressive tenden-

cies.

In the recent years, neural models such as recurrent neural

network (RNN)/convolutional neural network (CNN) are popu-

lar choices for the problem at hand [

]. In recurrent models,

the interactions between consecutive items are captured by a re-

current matrix and long-term dependencies are persisted in the

recurrent memory while reading. On the other hand, convolution

implicitly captures interactions by sliding parameterized transfor-

mations across the input sequence [

]. However, when applied to

recommendation, both models suer from a shortcoming. at is,

they fail to

explicitly

capture item-item interactions

across the

entire user history. e motivation for modeling item-item relation-

ships within a user’s context history is intuitive, as it is more oen

than not, crucial to understand ne-grained relationships between

individual item pairs instead of simply glossing over them. All in

all, we hypothesize that providing an inductive bias for our models

would lead to improve representation quality, eventually resulting

in a signicant performance improvement within the context of

sequential recommender systems.

To this end, this paper proposes a new neural sequential recom-

mender system where sequential representations are learned via

modeling not only consecutive items but across

all user interac-

tions

in the current window. As such our model can be considered

as a ‘local-global’ approach. Overall, our intuition manifests in

the form of an aention-based neural model that explicitly invokes

item-item interactions across the entire user’s historical transaction

sequence. is not only enables us to learn global/long-range repre-

sentations, but also short-term information between

-consecutive

items. Based on this self-matching matrix, we learn to aend over

the interaction sequence to select the most relevant items to form

the nal user representation. Our experiments show that the pro-

posed model outperforms the state-of-the-art sequential recommen-

dation models by a wide margin, demonstrating the eectiveness

of not only modeling local dependencies but also going global.

Our model takes the form of a metric learning framework in

which the distance between the self-aended representation of

a user and the prospective (golden) item is drawn closer during

training. To the best of our knowledge, this is the rst proposed

In RNNs, this is captured via memory persistence. While in CNNs, this is only weakly

captured by the sliding-window concatenated transformations. In both cases, there is

no explicit interaction.

arXiv:1808.06414v2 [cs.IR] 25 Aug 2018

下载后可阅读完整内容，剩余9页未读，立即下载

qq_29172981

粉丝: 0
资源: 14

自我注意力机制的下一项推荐系统

next item recommendation with self-attention

Collaborative Dynamic Sparse Topic Regression with User Profile Evolution for Item Recommendation

2015-04-09 VDA Recommendation_305-100_Interchangeability_VDA2.pdf

Python库 | recommendation-0.3.1-py3-none-any.whl

PyPI 官网下载 | recommendation-0.3.1-py3-none-any.whl

T-REC-G[1].984.3-200507-S!Amd1!PDF-E.pdf

Recommendation-ITU-T-G.987.2

2018-Sequential Recommendation with User Memory Networks.pdf

2019-Session-based Recommendation with Graph Neural Networks.pdf

T-REC-G.987.2-201010-I!!PDF-E.pdf

最新资源