宽深学习提升推荐系统性能：融合记忆与泛化

需积分: 21 200 浏览量更新于2024-08-29 收藏 499KB PDF 举报

" Wide & Deep Learning for Recommender Systems" 是一篇由来自 Google 的一组研究人员撰写的论文，主要探讨了在大规模推荐系统中如何有效地结合广度（Wide）模型和深度（Deep）学习方法。传统上，线性模型配以非线性特征转换广泛应用于处理稀疏输入的回归和分类问题，其中广度模型通过大量的交叉特征组合能够较好地记忆特征交互，但可能需要更多的特征工程来确保泛化能力。然而，深度神经网络（DNN）凭借较少的特征工程，能够通过学习低维度密集嵌入对稀疏特征进行更好的泛化，从而推荐更相关的项目。但在用户-物品交互数据非常稀疏且高阶时，深度模型可能会过度泛化，导致推荐不相关的内容。针对这一问题，论文提出了一种名为 Wide & Deep Learning 的框架，它将广度模型与深度模型联合训练，旨在融合记忆广度模型的优点（即易于理解和解释）以及深度模型的潜在表示学习能力，从而在保持解释性的同时提高推荐的准确性和个性化。在 Wide & Deep Learning 中，关键的设计思想是将线性模型与深度神经网络相结合。广度部分保留了基础的线性预测，而深度部分则负责处理复杂的、潜在的用户和物品特征关系。这种混合模型通过利用深度学习的表达能力和广度模型的稳健性，能够在处理稀疏数据时保持推荐的准确性，并减少过拟合的风险。同时，它还允许在保持模型可解释性的前提下，适应用户的动态兴趣变化，提升推荐系统的整体性能。论文作者包括 Heng-Tze Cheng、Levent Koc 等，他们在研究中详细阐述了模型的架构、训练策略以及在实际推荐系统中的应用效果。通过实证分析，Wide & Deep Learning 被证明在许多推荐任务中取得了优于单一模型的性能，这使得它成为现代推荐系统设计中的一个重要参考和实践标准。此外，该研究也启示了后续的研究者们如何在推荐系统中平衡模型的复杂度、泛化能力和可解释性，以满足不断发展的用户需求。

Wide & Deep Learning for Recommender Systems

Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra,

Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, Rohan Anil,

Zakaria Haque, Lichan Hong, Vihan Jain, Xiaobing Liu, Hemal Shah

Google Inc.

⇤

ABSTRACT

Generalized linear mode ls with nonlinear feature transfor-

mations are widely used for large-scale regression and clas-

siﬁcation problems with sparse inputs. Memorization of fea-

ture interactions through a wide s et of cross-product feature

transformations are e↵ective and interpretable, while gener-

alization requires more feature engineering e↵ort. With less

feature engineering, deep neural networks can generalize bet-

ter to unseen feature combinations through low-dimensional

dense embeddings learned for the sparse features. However,

deep neural networks with embeddings can over-generalize

and recommend less relevant items when the user-item inter-

actions are sparse and high-rank. In this paper, we present

Wide & Deep learning—jointly trained wide linear models

and deep neural networks—to combine the b eneﬁts of mem-

orization and generalization for recommender systems. We

pro ductionized and evaluated the system on Google Play,

a commercial mobile app store with over one billion active

users and over one million apps. Online experiment results

show that Wide & Deep signiﬁcantly increased app acquisi-

tions compared with wide-only and deep-only models. We

have also open-sourced our implementation in TensorFlow.

CCS Concepts

•Computing methodologies ! Machine learning; Neu-

ral networks; Supervised learning; •Information systems

! Recommender systems;

Keywords

Wide & Deep Learning, Recommender Systems.

1. INTRODUCTION

A recommender system can be viewed as a search ranking

system, where the input query is a set of user and contextual

information, and the output is a ranked list of items. Given

a query, the recommendation task is to ﬁnd the relevant

items in a database and then rank the items based on certain

objectives, such as clicks or purchases.

One challenge in recommender systems, similar to the gen-

eral search ranking problem, is to achieve b oth memorization

and generalization. Memorization can be loosely deﬁned as

learning the frequent co-occurrence of items or features and

exploiting the correlation available in the hi storical data.

Generalization, on the other hand, is based on transitivity

of correlation and explores new feature combinations that

⇤

Correspondi ng author: hengtze@google.com

have never or rarely occurred in the past. Recommenda-

tions bas ed on memorization are usually more topical and

directly rel evant to the items on which users have already

p erformed actions. Compared with memorization, general-

ization tends to improve the diversity of the recommended

items. In this paper, we focus on the apps recommendation

problem for the Google Play store, but the approach should

apply to generic recommender systems.

For massive-scale online recommendation and ranking sys-

tems in an industrial setting, generalized linear models such

as logistic regression are widely used because they are sim-

ple, scalable and interpretable. The models are often trained

on binarized sparse features with one-hot encoding. E.g., the

binary feature “user_installed_app=netflix” has value 1

if the user installed Netﬂix. Memorization can b e achieved

e↵ectively using cross-product transformations over sparse

features, such as AND(user_installed_app=netflix, impres-

sion_app=pandora”), whose value is 1 if the user installed

Netﬂix and then is later shown Pandora. This explains how

the co-occurrence of a feature pair correlates with the target

lab el. Generalization can be added by using features that are

less granular, such as AND(user_installed_category=video,

impression_category=music), but manual feature engineer-

ing is often required. One limitation of cross-product trans-

formations is that they do not generalize to query-item fea-

ture pairs that have not appeared in the training data.

Embedding-based models, such as factorization machines

[5] or deep neural networks, can generalize to previously un-

seen query-item feature pairs by learning a low-dimensional

dense embedding vector for each query and item feature,

with less burden of feature engineering. However, it is dif-

ﬁcult to learn e↵ective low-dimensional representations for

queries and items when t he underlying query-item matrix is

sparse and high-rank, such as users with speciﬁc preferences

or niche items w ith a narrow appeal. In such cases, there

should be no interactions between most query-item pairs,

but dense embeddings will lead to nonzero predictions for all

query-item pairs, and thus can over-generalize and make less

relevant recommendations. On the other hand, linear mod-

els with cross-product feature transformations can memorize

these “exception rules” with much fewer parameters.

In this paper, we present the Wide & Deep learning frame-

work to achieve both memorization and generalization in one

mo del, by jointly training a linear model component and a

neural network component as shown in Figure 1.

The main contributions of the paper include:

• The Wide & Deep learning framework for jointly train-

ing feed-forward neural networks with embeddings and

arXiv:1606.07792v1 [cs.LG] 24 Jun 2016

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_45786425

粉丝: 0
资源: 7

宽深学习提升推荐系统性能：融合记忆与泛化

Google论文"Wide & Deep Learning for Recommender Systems"全套工程文件+数据集+调试过程

Wide &Deep learning for Recommender Systems

Wide & Deep Learning for Recommender Systems

Wide & Deep Learning for Recommender Systems 论文阅读总结1

wide & deep.pdf

机器学习与深度学习技术分享 FFM及DeepFFM模型在推荐系统的探索及实践 共47页.pdf

hands on machine learning on google cloud platform

AI论文45篇.zip

Vue2 全家桶 + Vant 搭建大型单页面商城项目 新蜂商城前床分离版本-前端Vue 项目源码.zip

【创新未发表】基于matlab沙猫群算法SCSO-PID控制器优化【含Matlab源码 9671期】.zip

最新资源

机器学习与深度学习技术分享 FFM及DeepFFM模型在推荐系统的探索及实践共47页.pdf

Vue2 全家桶 + Vant 搭建大型单页面商城项目新蜂商城前床分离版本-前端Vue 项目源码.zip