RankingSVM中的模型参数交互分析

研究论文

需积分: 5 151 浏览量更新于2024-08-26 收藏 655KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇研究论文探讨了在支持向量机（SVM）排名中的建模参数交互现象。 Ranking SVM是一种常用的信息检索排名模型，它通过学习文档对的偏好来构建二元SVM。论文作者分析了 Ranking SVM 的双形式解，其中模型参数表现为偏好对的线性组合，并指出由于文档对之间可能存在共享文档，参数之间可能存在显著的交互作用。" 在支持向量机（Support Vector Machine，简称SVM）的排名应用中，模型参数交互是一个关键问题，特别是在处理大量文档和用户偏好的情况下。Ranking SVM是信息检索和推荐系统等领域的一种先进方法，它通过学习文档对之间的相对排序，而非简单的二分类，来优化检索结果。 Ranking SVM的模型建立基于偏好对，即一组文档对(i, j)，其中文档i被认为优于文档j。模型的权重向量w可以表示为这些偏好对的线性组合，每个偏好对(i, j)对应一个拉格朗日乘子αij。这些乘子αij是模型的核心参数，它们决定了每个文档对在模型中的影响力。论文指出，由于不同的偏好对可能包含相同的文档，这意味着参数之间可能存在交互效应。例如，如果文档A分别与文档B和C形成了两个偏好对(A, B)和(A, C)，那么调整关于文档A的参数可能会影响这两个偏好对的排序，进而影响整个模型的性能。这种交互效应可能对模型的优化和泛化能力产生重大影响。为了理解并利用这些交互，研究者可能需要探索更复杂的优化策略，例如考虑参数间的非线性关系，或者使用正则化技术来控制参数的复杂度。此外，论文可能会讨论如何通过改进的算法或学习策略来有效地处理这些交互，以提高Ranking SVM的预测准确性和效率。在实际应用中，理解和处理参数交互有助于优化Ranking SVM模型，从而提供更精确的检索结果或推荐。这可能包括开发新的损失函数、调整核函数的选择，或者采用更先进的优化技术，如基于梯度的优化算法，以适应参数之间的相互依赖性。这篇研究论文针对Ranking SVM模型的参数交互进行了深入探讨，旨在提高排名模型的性能和适应性，这对于信息检索和推荐系统的开发具有重要的理论和实践意义。

资源详情

资源推荐

Modeling Parameter Interactions in Ranking SVM

Yaogong Zhang

†§∗

Jun Xu

‡

Yanyan Lan

‡

Jiafeng Guo

‡

Maoqiang Xie

†§

Yalou Huang

†§

Xueqi Cheng

‡

†

College of Computer and Control Engineering ,Nankai University

‡

CAS Key Lab of Network Data Science and Technology,

Institute of Computing Technology, Chinese Academy of Sciences

College of Software, Nankai University

ygzhang@mail.nankai.edu.cn, {junxu, lanyanyan, guojiafeng}@ict.ac.cn,

{xiemq, huangyl}@nankai.edu.cn, cxq@ict.ac.cn

ABSTRACT

Ranking SVM, which formalizes the problem of learning a

ranking model as that of learning a binary SVM on prefer-

ence pairs of documents, is a state-of-the-art ranking model

in information retrieval. The dual form solution of Ranking

SVM model can be written as a linear combination of the

preference pairs, i.e., w =

(i,j)

− x

), where α

denotes the Lagrange parameters associated with each pair

(i, j). It is obvious that there exist signiﬁcant interactions

over the document pairs because two preference pairs could

share a same document as their items. Thus it is natural to

ask if there also exist interactions over the model parame-

ters α

, which we may leverage to propose better ranking

model. This paper aims to answer the question. Firstly,

we found that there exists a low-rank structure over the

Ranking SVM model parameters α

, which indicates that

the interactions do exist. Then, based on the discovery, we

made a modiﬁcation on the original Ranking SVM model

by explicitly applying a low-rank constraint to the param-

eters. Speciﬁcally, each parameter α

is decomposed as a

product of two low-dimensional vectors, i.e., α

= hv

, v

where vectors v

and v

correspond to document i and j, re-

spectively. The learning process, thus, becomes to optimize

the modiﬁed dual form objective function with respect to

the low-dimensional vectors. Experimental results on three

LETOR datasets show that our method, referred to as Fac-

torized Ranking SVM, can outperform state-of-the-art base-

lines including the conventional Ranking SVM.

Categories and Subject Descriptors: H.3.3 [Informa-

tion Systems Applications]: Information Search and Re-

trieval – Retrieval Models

Keywords: Parameter interaction; Ranking SVM

∗

The work was conducted when Yaogong Zhang was visiting

CAS Key Lab of Network Data Science and Technology.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full cita-

tion on the ﬁrst page. Copyrights for components of this work owned by others than

ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-

publish, to post on servers or to redistribute to lists, requires prior speciﬁc permission

and/or a fee. Request permissions from Permissions@acm.org.

CIKM’15, October 19–23, 2015, Melbourne, Australia.

 2015 ACM. ISBN 978-1-4503-3794-6/15/10 ...$15.00.

DOI: http://dx.doi.org/10.1145/2806416.2806595.

1. INTRODUCTION

Learning to rank has been widely used in information

retrieval and recommender systems. Among the learning

to rank models, Ranking SVM is a representative pairwise

ranking model, evolving from the popular support vector

machines (SVM) [1] for classiﬁcation problems. In training,

Ranking SVM ﬁrst constructs the preference pairs of the

documents based on their relevance labels (or click-through

data [7]). Then, a binary SVM model is learned based on

the preference pairs to capture the diﬀerences between docu-

ments with diﬀerent relevance labels. In ranking, each doc-

ument is assigned a relevance score based on the learned

ranking model. Usually, the ranking model can be writ-

ten in its dual form, the solution is a linear combination of

the training preference pairs, i.e., w =

(i,j)

− x

where x

and x

are the ﬁrst and second document in the

pair (i, j), and α

is the corresponding Lagrange multiplier.

It is obvious that the constructed preference pairs have

signiﬁcant interactions, since two preference pairs could share

a same document as their items. In the dual form solution of

Ranking SVM, each preference pair is associated with a La-

grange multiplier α. Therefore, it is natural to ask whether

there also exist interactions among these Lagrange multipli-

ers. If the answer is yes, how to utilize the interactions to

improve Ranking SVM?

This paper tries to answer the above questions by analyz-

ing the Lagrange multipliers of the trained Ranking SVM

models. Speciﬁcally, we made an arrangement of the La-

grange multipliers and constructed a block diagonal matrix

A, where A(i, j) = α

if α

appears in the Ranking SVM

model and zero otherwise. Then, we performed singular

value decomposition (SVD) on each block of A and sorted

the eigenvalues in descending order. We found that for all

the queries, we just need 40% dimensions to capture 90%

energy, but if we want to capture 100% energy, almost all

the queries need at least 80% dimensions, it means there

exists a low-rank structure in the matrix, which indicates

strong interactions among the Lagrange multipliers.

Based on the discovery, we propose to improve the original

Ranking SVM through explicitly modeling the parameter

interactions in the training process. Speciﬁcally, we apply a

low rank constraint over the Lagrange multipliers in the dual

form solution of Ranking SVM. Each Lagrange multiplier

in the dual form objective function is factorized as the

dot product of two K-dimensional latent vectors, i.e., α

, v

i, where v

and v

correspond to the ﬁrst and second

document in the preference pair, respectively. In this way,

1799

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38611527

粉丝: 8
资源: 903

RankingSVM中的模型参数交互分析

鼠标标定区域训练 SVM分类器 高斯建模

数学建模MATLAB代码SVM分类器代码

SVM用于特征建模R语言代码

svm中degree参数分析

svm中gamma参数分析

svm中epsilon参数范围

SVM中参数p的取值是任意的么

R语言svm中cofe0参数的含义

matlab中如何使用svm建模

svm中的惩罚参数c表示什么

SVM模型的损失函数是什么？SVM模型中的超参数有哪些？SVM模型的超参数中，核函数有哪些类型？

SVM算法中的关键参数有哪些

SVM回归调整模型参数的技巧

我的公式和svm混合建模

svm模型的c参数范围

利用svm方法估计圆参数

svm_ga 核参数_gamma_ga优化svm_惩罚参数c的等高线和3d视图

svm模型的epsilon参数的范围

调整模型参数：SVM模型中的参数对模型的准确性有很大影响，可以通过交叉验证等方法来调整参数，如调整C、gamma等参数

如何定义one class svm中gamma的参数范围

最新资源

鼠标标定区域训练 SVM分类器高斯建模