深度学习推荐系统模型：DSSM的挑战与策略

需积分: 10 163 浏览量更新于2024-09-09 收藏 356KB PDF 举报

"这篇文档主要探讨了深度学习在推荐系统中的应用，特别是针对DSSM（深度结构化相似性模型）的难点与相关技巧。作者们提出了一个基于树的深度模型来解决推荐系统中计算成本高的问题，尤其在大规模数据集上。传统的推荐系统如矩阵分解，依赖于内积形式来预测用户对物品的喜好，但这种方法难以处理复杂的用户-物品特征交互。论文中，作者们尝试引入更复杂的模型，如深度神经网络，来捕捉更丰富的特征交互，同时克服由此带来的计算难题。" 深度学习在推荐系统中的应用是近年来研究的热点。DSSM（深度结构化相似性模型）是一种利用深度学习技术来捕获用户和项目之间的复杂关系的模型。它通常用于信息检索、广告投放等领域，通过构建深度神经网络来学习用户的隐含表示和项目的隐含表示，然后通过比较这些表示的相似性来预测用户可能的兴趣。然而，DSSM面临的主要挑战在于大规模数据集上的计算效率。当推荐系统的用户和项目数量庞大时，使用传统模型（如矩阵分解）来计算所有用户-项目偏好的预测成本非常高，这使得全量检索变得极其困难。为了解决这个问题，研究者们通常采用近似最近邻搜索技术，如基于内积的索引方法，来提高预测速度。本文提出了一种基于树的深度模型，旨在引入任意先进的模型到推荐系统中，同时保持高效的计算性能。这种方法可能能够更有效地处理深度神经网络带来的计算复杂性，允许模型捕捉到用户特征和物品特征之间的更丰富的交互形式，而不只是简单的内积。这将有助于提升推荐的准确性和个性化程度，从而改善用户体验。尽管深度学习在推荐系统中有巨大的潜力，但是如何平衡模型的表达能力与计算效率是一个持续的研究问题。论文中提到的方法可能提供了一个新的解决方案，但实际应用中还需要考虑模型的训练时间、内存需求以及对实时性能的影响。这篇文档对于深入理解DSSM的挑战和潜在解决方案，以及深度学习在推荐系统中的应用提供了有价值的见解。它对于从事相关领域的研究人员和工程师来说，是一份极具参考价值的资料。

arXiv:1801.02294v3 [stat.ML] 21 May 2018

Learning Tree-based Deep Model for Recommender Systems

Han Zhu, Xiang Li, Pengye Zhang, Guozheng Li, Jie He, Han Li, Kun Gai

Alibaba Group

{zhuhan.zh,yushi.lx,pengye.zpy,guozheng .lgz,jay.hj,lihan.lh,jingshi.gk}@alibaba-inc.c om

ABSTRACT

Model-based methods for recommender systems have been stud-

ied extensively in recent years. In systems with large corpus, how-

ever, the calculation cost for the learnt model to predict all user-

item preferences is tremendous, which makes full corpus retrieval

extremely diﬃcult. To overcome the calculation barriers, models

such as matrix factorization resort to inner p roduct form (i.e., model

user-item preference as the inner product of user, item latent fac-

tors) and indexes to facilitate eﬃcient approximate k-nearest neigh-

bor searches. However, it still remains challenging to incorporate

more expressive interaction forms between user and item features,

e.g., interactions through deep neural networks, bec ause of the cal-

culation cost.

In t his paper, we focus on the problem of introducing arbitrary

advanced models to recommender systems with lar ge corpus. We

propose a novel tree-based metho d which can provide logarithmic

complexity w.r.t. corpus size even with more expressive models

such as deep neural networks. Our main idea is to predict user in-

terests from coarse to ﬁne by traversing tree nodes in a top-down

fashion and making decisions for each user-node pair. We also

show that the tree structure can be jointly learnt towards better

compatibility with users’ interest distribution and hence facilitate

both training and prediction. Experimental evaluations with two

large-scale real-world datasets show that the prop osed method sig-

niﬁcantly outperforms traditional method s. Online A/B test results

in Taobao display advertising platform also demonstrate the eﬀec-

tiveness of the proposed method in production environments.

CCS CONCEPTS

• Computing methodologies → Classiﬁcation and regression

trees; Neural networks; • Information systems → Recom-

mender systems;

KEYWORDS

Tree-based Learning, Recommender Systems, Implicit Feedback

1 INTRODUCTION

Recommendation has been widely used by various kinds of content

providers. Personalized reco mmendation method, base d on t he in-

tuition t hat users’ interests can b e inferred from their historical

behaviors or other users with similar preference, has been proven

to be eﬀective in YouTube [7] and Amazon [22].

Designing such a recommendation model to predict the best

candidate set from the entire corpus for each user has many chal-

lenges. In systems with enormous corpus, some well-performed

recommendation algorithms may fail to predict from the entire

corpus. The linear prediction complexity w.r.t. the corpus size is

unacceptable. Deploying such large-scale recommender system re-

quires the amount of calculation to predict for each single user

be limited. And besides preciseness, the novelty of recommended

items should also be responsible for user experience. Results that

only contain homogeneous items with user’s historical behaviors

are not expected.

To reduce the amount of calculation and handle enormous cor-

pus, memory-based collaborative ﬁltering methods are widely de-

ployed in industry [22]. As a representative method in co llabo-

rative ﬁltering family, item-based collaborative ﬁ ltering [31] can

recommend from very large corpus with relatively much fewer

computations, depending on the pre-calculated similarity between

item pairs and using user’s historical behaviors as triggers to recall

those most similar items. However, there exists restriction on the

scope of candidate set, i.e., not all items but only items similar to

the triggers can be ultimately recommended. This intuitio n pre-

vents the recommender system from jumping out of historical be-

havior to explore potential user interests, which limits the accu racy

of recalled results. And in practice the recommendation novelty is

also criticized. Another way to reduce calculation is making co ar se-

grained recommendation. For example, the system recommends a

small number of item categories for users and picks out all corre-

sponding it ems, with a fol lowing ranking stage. However, for large

corpus, the calculatio n problem is still not solved. If the category

number is large, the category recommendation itself also me ets the

calculation barrier. If not, some categories will inevitably include

too many items, making the following ranking calculation imprac-

ticable. Besides, the used categories are usually not designed for

recommendation problem, which can seriously harm the recom-

mendation accuracy.

In the literatures of recommender systems, model-based meth-

ods are an active t opic. Models such as matrix factorization (MF)

[19, 30] try to decompose pairwise user-item preferences (e.g., rat-

ings) into u ser and item factors, then reco mmend to each user its

most preferred items. Factorization machine (FM) [28] further pro-

poses a uniﬁed model that can mimic diﬀerent factorization models

with any kind of input data. In some real-world scenarios that have

no explicit preference but only implicit user feedback (e.g., user

behaviors like clicks or pu rchases), Bayesian personalized ranking

[29] gives a solution that formulates the preference in triplets with

partial order, and applies it to MF models. In industry, YouTube

uses deep neural network [7] to learn both user and item’s embed-

dings, where two kinds of embeddings are generated from their

corresponding features separately. In all the above kinds of meth-

ods, the preference of user-item pair can be formulated as the in-

ner product of user and item’s vector representations. The predic-

tion stage thus is equivalent to retrieve user vector’s nearest neigh-

bors in inner product space. For vector search problem, indices like

hashing or quantization [18] for approximate k-nearest neighbor

(kNN) search can ensure the eﬃciency of retrieval.

However, the inner product interaction form between user and

item’s vector representations severely limits model’s capability. There

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38444012

粉丝: 1
资源: 19

深度学习推荐系统模型：DSSM的挑战与策略

通过DSSM算法进行商品推荐.zip

DSSM_1050_V3.zip

easyrec dssm

DSSM-Lookalike

DSSM-with-Paddle:使用PaddlePaddle的DSSM实现

基于movieLen1M数据集的DSSM深度召回实验_DSSM.zip

DSSM(双塔).pdf

DSSM图书推荐实现.zip

dssm-wemb-theano:在Theano中实现深度结构化语义模型（DSSM）

paddledssm:带桨的dssm代码

最新资源