视觉建模提升用户体验：行为图像在高级模型服务器中的应用

需积分: 10 94 浏览量更新于2024-09-04 收藏 1.64MB PDF 举报

《视觉建模用户行为：使用高级模型服务器》（Image Matters: Visually modeling user behaviors using Advanced Model Server）是一篇深入探讨如何在阿里巴巴旗下的全球最大电商平台——淘宝中提升用户体验和商业效果的研究论文。在电子商务环境中，广告点击率（Click Through Rate, CTR）预测是一项关键任务，它通过分析用户的海量历史行为数据来判断用户对候选广告的兴趣。论文关注的是如何将用户行为图像信息融入到CTR预测中，以增强行为表示，捕捉用户的视觉偏好。传统上，CTR预测仅依赖于单个候选广告的图片，但这种方法忽略了用户行为中的丰富视觉信息。作者提出了一种新的策略，即利用用户行为图像（包括他们在浏览过程中的多次互动和选择），这些图像数量可能达到上百甚至数千张。这样做的目的是为了更全面地反映用户的行为模式和视觉喜好，从而提高CTR预测的准确性。该研究的核心是构建一个能够联合处理用户行为ID特征和行为图像的模型，这可能涉及深度学习技术，如卷积神经网络（CNNs）或注意力机制，以捕捉图像的视觉特征并将其与用户的其他行为数据整合。模型的训练挑战在于如何有效地处理大量图像数据，以及如何设计有效的特征提取和融合方法，以避免过拟合并确保模型的泛化能力。论文作者包括Tiezheng Ge、Liqin Zhao等人，他们来自阿里巴巴集团，他们的电子邮件地址表明了他们在这个领域的专业知识和研究背景。这篇论文的摘要强调了视觉信息在用户行为理解中的重要性，并预示了使用高级模型服务器进行大规模图像处理和复杂模型训练的技术细节，这无疑为电子商务行业的个性化推荐和广告优化提供了新的可能性。这篇论文是深度学习在电商领域的一项创新应用，它不仅提升了CTR预测的精度，还展示了如何通过视觉建模来洞察用户行为，这对于优化广告展示策略、提升用户体验具有显著的意义。

Image Maers: Visually modeling user behaviors using

Advanced Model Server

Tiezheng Ge, Liqin Zhao, Guorui Zhou, Keyu Chen, Shuying Liu

Huiming Yi, Zelin Hu, Bochao Liu, Peng Sun, Haoyu Liu, Pengtao Yi, Sui Huang

Zhiqiang Zhang, Xiaoqiang Zhu, Yu Zhang, Kun Gai

Alibaba Inc.

{tiezheng.gtz, zhang.zhiqiang, jingshi.gk}@alibaba-inc.com

ABSTRACT

In Taobao, the largest e-commerce platform in China, billions of

items are provided and typically displayed with their images. For

better user experience and business eectiveness, Click Through

Rate (CTR) prediction in online advertising system exploits abun-

dant user historical behaviors to identify whether a user is inter-

ested in a candidate ad. Enhancing behavior representations with

user behavior images will bring user’s visual preference and can

greatly help CTR prediction. So we propose to model user prefer-

ence jointly with user behavior ID features and behavior images.

However, comparing with utilizing candidate ad image in CTR pre-

diction which only introduces one image in one sample, training

with user behavior images brings tens to hundreds of images in one

sample, giving rise to a great challenge in both communication and

computation. With the well-known Parameter Server (PS) frame-

work, implementing such model needs to communicate the raw

image features, leading to unacceptable communication load. It in-

dicates PS is not suitable for this scenario. In this paper, we propose

a novel and ecient distributed machine learning paradigm called

Advanced Model Server (AMS). In AMS, the forward/backward pro-

cess can also happen in the server side, and only high level semantic

features with much smaller size need to be sent to workers. AMS

thus dramatically reduces the communication load, which enables

the arduous joint training process. Based on AMS, the methods

of eectively combining the images and ID features are carefully

studied, and then we propose a Deep Image CTR Model. Our ap-

proach is shown to achieve signicant improvements in both online

and oine evaluations, and has been deployed in Taobao display

advertising system serving the main trac.

CCS CONCEPTS

• Information systems → Online advertising

;

Recommender

systems;

KEYWORDS

Online advertising; User modeling; Computer vision

1 INTRODUCTION

Taobao is the largest e-commerce platform in China, serving hun-

dreds of millions of users with billions of items through both mobile

app and PC website. Users come to Taobao to browse these items

through the search or personalized recommendation. Each item is

usually displayed by an item image along with some describing

texts. When interested in an item, users can click that image to see

the details. Fig 1(a) shows an example of recommended items in

Taobao mobile app.

Taobao also established one of the world’s leading display adver-

tising systems, helping millions of advertisers to connect to users.

Actually display advertising is an indispensable form of online ad-

vertisement. By identifying user interests, it can be presented in

various spots like Guess What You Like and eciently delivers

marketing messages to the right customers. Cost-per-click (CPC)

pricing method is adopted for Taobao display advertising and is

suciently eective [

]. In CPC mode, the ad publishers rank

the candidate ads by eective cost per mille (eCPM), which can

be estimated by multiplying the bid price by the estimated click

through rate (CTR). Such strategy makes CTR prediction the core

task in the advertising system.

CTR prediction scores a user’s preference to an item, and largely

relies on understanding user interests from historical behaviors.

Users browse and click items billions of times in Taobao everyday,

and these visits bring a huge amount of log data weakly reecting

user interests. Traditional researches on CTR prediction focus on

carefully designed feedback feature [

] and shallow models, e.g.,

Logistic Regression [

]. In recent years, the deep learning based

CTR prediction system emerged overwhelmingly [

]. These meth-

ods mainly involve the sparse ID features, e.g., ad ID, user interacted

item ID, etc. However, when an ID occurs less frequently in the

data, its parameter may not be well trained. Images can provide

intrinsic visual descriptions, and thus bring better generalization

for the model. Considering that item images are what users directly

interact with, these images can provide more visual information

about user interests. We propose to naturally describe each behav-

ior by such images, and jointly model them with ID features in CTR

prediction.

Training CTR models with image data requires huge computa-

tion and storage consumption. There are pioneering works [

]

dedicating to represent ad with image features in CTR prediction.

These studies did not explore user behavior images. Modeling user

behavior images can help understand user visual preference and

improve the accuracy of CTR prediction. Moreover, combining

both user visual preference and ad visual information could further

benet CTR prediction. However, modeling user preference with

interacted images is more challenging. Because the number of one

typical user’s behaviors ranges from tens to hundreds, it will bring

the same number of times the consumption than that when only

modeling ad images. Considering Taobao are serving hundreds of

millions of users with billions of items, it is a non-trivial problem

arXiv:1711.06505v2 [cs.CV] 15 Feb 2018

下载后可阅读完整内容，剩余8页未读，立即下载

KimiWing

粉丝: 0
资源: 5

视觉建模提升用户体验：行为图像在高级模型服务器中的应用

Teach Yourself Visually Html and css.pdf

[深入浅出HTML].Head.First.HTML.with.CSS.and.XHTML.pdf

DoingMathWithPython_Solutions.pdf

BMW设计开发流程.pdf

TeachYourselfVisuallyMicrosoftPowerpoint2013.pdf 英文原版

TeachYourselfVisuallyMicrosoftExcel2010.pdf 英文原版

head_first_c.pdf.part1.rar

head_first_c.pdf.part2.rar

A Year in Computer Vision.pdf

TeachYourselfVISUALLYWindows10FreePDFBooks.pdf 英文原版

最新资源