2018年变分自编码器提升协作过滤的非线性模型

下载需积分: 20 | PDF格式 | 2.69MB | 更新于2024-09-06 | 119 浏览量 | 举报

2018年的论文"Variational Autoencoders for Collaborative Filtering"由Dawen Liang、Rahul G. Krishnan、Matthew D. Hoffman和Tony Jebara合作撰写，发表在Netflix和MIT的研究者之间，探讨了在推荐系统领域引入变分自编码器（Variational Autoencoders, VAEs）的创新方法。传统的协同过滤技术主要依赖线性因子模型，然而，这篇论文挑战了这一现状，提出了非线性概率模型来克服线性模型在处理隐式反馈数据时的局限性。文章的核心贡献在于，作者构建了一个具有多项式似然性的生成模型，并采用了贝叶斯推断进行参数估计。尽管多项式似然性在自然语言处理和经济学等领域广泛应用，但在推荐系统研究中却较少被关注。论文特别强调了一种新的正则化参数在学习目标中的关键作用，它对于模型性能的提升至关重要。值得一提的是，作者提出了一种有效的参数调整方法——退火，使得模型的学习过程更为高效。这种方法基于信息论原理，能够优化模型的性能，使其在竞争激烈的推荐系统环境中保持竞争力。该研究不仅扩展了变分自编码器在推荐系统中的应用，还引入了新颖的统计学工具和技术，对于理解和改进基于用户行为的个性化推荐算法具有重要意义。通过实证结果，论文展示了VAEs在处理复杂用户行为数据上的潜力，为未来的推荐系统研究开辟了新的方向。

展开

Variational Autoencoders for Collaborative Filtering

Dawen Liang

Netix

Los Gatos, CA

dliang@netix.com

Rahul G. Krishnan

MIT

Cambridge, MA

rahulgk@mit.edu

Matthew D. Homan

Google AI

San Francisco, CA

mhoman@google.com

Tony Jebara

Netix

Los Gatos, CA

tjebara@netix.com

ABSTRACT

We extend variational autoencoders (vaes) to collaborative ltering

for implicit feedback. This non-linear probabilistic model enables us

to go beyond the limited modeling capacity of linear factor models

which still largely dominate collaborative ltering research. We

introduce a generative model with multinomial likelihood and use

Bayesian inference for parameter estimation. Despite widespread

use in language modeling and economics, the multinomial likeli-

hood receives less attention in the recommender systems literature.

We introduce a dierent regularization parameter for the learning

objective, which proves to be crucial for achieving competitive per-

formance. Remarkably, there is an ecient way to tune the parame-

ter using annealing. The resulting model and learning algorithm has

information-theoretic connections to maximum entropy discrimi-

nation and the information bottleneck principle. Empirically, we

show that the proposed approach signicantly outperforms several

state-of-the-art baselines, including two recently-proposed neural

network approaches, on several real-world datasets. We also pro-

vide extended experiments comparing the multinomial likelihood

with other commonly used likelihood functions in the latent factor

collaborative ltering literature and show favorable results. Finally,

we identify the pros and cons of employing a principled Bayesian

inference approach and characterize settings where it provides the

most signicant improvements.

KEYWORDS

Recommender systems, collaborative ltering, implicit feedback,

variational autoencoder, Bayesian models

ACM Reference Format:

Dawen Liang, Rahul G. Krishnan, Matthew D. Homan, and Tony Jebara.

2018. Variational Autoencoders for Collaborative Filtering. In Proceedings of

The 2018 Web Conference (WWW 2018). ACM, New York, NY, USA, 10 pages.

https://doi.org/10.1145/3178876.3186150

This paper is published under the Creative Commons Attribution-NonCommercial-

NoDerivs 4.0 International (CC BY-NC-ND 4.0) license. Authors reserve their rights to

disseminate the work on their personal and corporate Web sites with the appropriate

attribution.

WWW 2018, April 23–27, 2018, Lyon, France

2018 IW3C2 (International World Wide Web Conference Committee), published

under Creative Commons CC BY-NC-ND 4.0 License.

ACM ISBN 978-1-4503-5639-8/18/04.

https://doi.org/10.1145/3178876.3186150

1 INTRODUCTION

Recommender systems are an integral component of the web. In

a typical recommendation system, we observe how a set of users

interacts with a set of items. Using this data, we seek to show users

a set of previously unseen items they will like. As the web grows

in size, good recommendation systems will play an important part

in helping users interact more eectively with larger amounts of

content.

Collaborative ltering is among the most widely applied ap-

proaches in recommender systems. Collaborative ltering predicts

what items a user will prefer by discovering and exploiting the

similarity patterns across users and items. Latent factor models

[

] still largely dominate the collaborative ltering research

literature due to their simplicity and eectiveness. However, these

models are inherently linear, which limits their modeling capacity.

Previous work [

] has demonstrated that adding carefully crafted

non-linear features into the linear latent factor models can signif-

icantly boost recommendation performance. Recently, a growing

body of work involves applying neural networks to the collabora-

tive ltering setting with promising results [14, 41, 51, 54].

Here, we extend variational autoencoders (vaes) [

] to col-

laborative ltering for implicit feedback. Vaes generalize linear

latent-factor models and enable us to explore non-linear proba-

bilistic latent-variable models, powered by neural networks, on

large-scale recommendation datasets. We propose a neural gen-

erative model with multinomial conditional likelihood. Despite

being widely used in language modeling and economics [

multinomial likelihoods appear less studied in the collaborative

ltering literature, particularly within the context of latent-factor

models. Recommender systems are often evaluated using ranking-

based measures, such as mean average precision and normalized

discounted cumulative gain [

]. Top-

ranking loss is dicult to

optimize directly and previous work on direct ranking loss mini-

mization resorts to relaxations and approximations [

]. Here,

we show that the multinomial likelihoods are well-suited for mod-

eling implicit feedback data, and are a closer proxy to the ranking

loss relative to more popular likelihood functions such as Gaussian

and logistic.

Though recommendation is often considered a big-data problem

(due to the huge numbers of users and items typically present in a

recommender system), we argue that, in contrast, it represents a

uniquely challenging “small-data” problem: most users only inter-

act with a tiny proportion of the items and our goal is to collectively

arXiv:1802.05814v1 [stat.ML] 16 Feb 2018

下载后可阅读完整内容，剩余9页未读，立即下载

身份认证购VIP最低享 7 折!

30元优惠券

Jayxp

粉丝: 6

2018年变分自编码器提升协作过滤的非线性模型

Tutorial on Variational Autoencoders

An Introduction to Variational Autoencoders.pdf

协同过滤算法前沿论文最新进展 2018.11.02 方建勇-11

协同过滤算法前沿论文最新进展 2018.11.02 方建勇1

CTMP-ThesisProject:硕士论文@查尔斯大学-带有BOPE的Poisson分布式评级的协作主题模型（伯努利随机性的MAP估计）

Parameter initialization error: Circular reference Length in expression for Untitled_323101_vtmu.FS_ScintillationS_vtmg1.Length

Matlab环境下决策分类树的构建、优化与应用

《营销调研》第7章-探索性调研数据采集.pptx

Assignment1_search_final(1).ipynb

美团外卖优惠券小程序 美团优惠券微信小程序 自带流量主模式 带教程.zip

最新资源

美团外卖优惠券小程序美团优惠券微信小程序自带流量主模式带教程.zip