非参数贝叶斯多任务大间隔分类模型

152 浏览量更新于2024-08-30 收藏 324KB PDF 举报

"这篇论文探讨了非参数贝叶斯多任务大边距分类方法，通过集成大边距学习和层次贝叶斯模型，实现任务的自动聚类和模型共享。" 在"Nonparametric Bayesian Multi-Task Large-margin Classification"这篇研究论文中，作者Changying Du、Jia He、Fuzhen Zhuang、Yuan Qi和Qing He提出了一种新的非参数贝叶斯方法，用于多任务分类问题。这种方法能够智能地将任务分组到最适合的类别，并在每个任务组内灵活地共享模型参数。论文的核心在于将大边距学习（Large-margin Learning）与层次贝叶斯模型相结合，特别采用了一种标准支持向量机（SVM）的重要变体——Proximal SVM (PSVM)，其损失函数被用来定义一个新颖的似然函数。 PSVM的引入使得模型能够在保持分类效果的同时，更好地处理数据的非线性和复杂性。作者进一步假设每个任务的模型参数由两部分组成：一部分是组内共享的（group-level parameter），另一部分是特定于每个独立任务的（task rescaling parameter）。他们对组级参数施加了Dirichlet过程先验，这允许任务间存在某种形式的依赖和共享。而任务重缩放参数则被赋予一个单均值拉普拉斯先验，以确保任务间的独特性。最后，每个任务的参数是其所在组的共享参数乘以其特定的重缩放参数。为了有效地估计这些参数，论文中提出了基于Markov链蒙特卡洛（MCMC）算法的高效计算方法。这种方法能够处理高维数据和大量任务，同时避免了预设参数数量的限制，具有较强的适应性和泛化能力。通过这种非参数贝叶斯框架，模型能够自适应地学习任务之间的关系，从而提高分类性能，尤其在数据量有限或任务之间存在相关性的场景下，其优势更为显著。这篇论文对于理解和开发更先进的多任务学习算法具有重要的理论和实践价值。

Nonparametric Bayesian Multi-Task

Large-margin Classiﬁcation

Changying Du

1,2

, Jia He

, Fuzhen Zhuang

,YuanQi

,QingHe

Abstract. In this paper, we present a nonparametric

Bayesian multi-task large-margin classiﬁcation model which

can cluster tasks into the most appropriate number of groups

and induce ﬂexible model sharing within each task group si-

multaneously. Speciﬁcally, we ﬁrst show a very simple method

to integrate large margin learning with hierarchical Bayesian

models by employing an important variant of the standard

SVM, i.e., proximal SVM (PSVM), whose loss function is

used to deﬁne a novel likelihood function. And then we as-

sume that the model parameter of each task consists of two

parts: one is shared within each task group (group-level pa-

rameter) while the other is speciﬁc to each distinct task (task

rescaling parameter). A Dirichlet process prior is imposed on

the group-level parameter while the task rescaling parameter

is assigned a one-mean Laplace prior. Finally the parameter of

a task is the corresponding group parameter times its specif-

ic rescaling parameter. We give eﬃcient Markov chain Monte

Calo (MCMC) algorithm to conduct model inference. Exper-

iments on the Landmine detection data and the UCI Yeast

data demonstrate the eﬀectiveness of our method.

1 INTRODUCTION

Machine learning lies in the heart of artiﬁcial intelligence,

and has been extensively studied during the past decades.

While traditional machine learning is approaching to its po-

tential performance limit, a new learning scenario called mul-

titask learning (MTL) [6] has attracted more and more atten-

tion in the community of machine learning and data mining

[25, 2, 7, 26, 8, 15, 9, 21]. Multitask learning learns multiple

related tasks together so as to improve the performance of

each task relative to learning them separately. Over the past

decade, MTL has been successfully applied to many importan-

t areas including computer vision [24, 15], natural language

processing [1], bioinformatics [20, 26] and landmine detection

[25, 14].

It has been shown that the performance boosting merit of

MTL is mainly due to its information sharing among tasks,

which is the key aspect in the design of MTL algorithms.

To uncover latent task structure and alleviate harmful infor-

mation sharing, task-grouping is a common practice in MTL

[3, 25, 15, 16, 19]. Existing methods typically assume tasks

Key Lab of Intelligent Information Processing of Chinese Acade-

my of Sciences (CAS), Institute of Computing Technology, CAS,

Beijing 100190, China

University of Chinese Academy of Sciences, Beijing 100049, Chi-

na, email: ducy@ics.ict.ac.cn

Departments of CS and Statistics, Purdue University, IN, USA

in the same cluster share the same model [3, 25], though it is

more reasonable to allow some ﬂexibility in each task group.

Meanwhile, large-margin classiﬁcation models such as SVMs

stand for the most popular classiﬁcation models in tradition-

al learning scenarios, but there are still not many successful

multi-task large-margin classiﬁcation models, especially those

with the capability to ﬁnd latent task groups automatically.

In this paper, we present a nonparametric Bayesian multi-

task large-margin classiﬁcation model which can cluster tasks

into the most appropriate number of groups and induce ﬂex-

ible model sharing within each group simultaneously. Specif-

ically, we ﬁrst show a very simple method to integrate large

margin learning with hierarchical Bayesian models by employ-

ing an important variant of the standard SVM, i.e., proximal

SVM (PSVM) [11], whose empirical loss function can be used

to deﬁne a novel likelihood function. And then we assume

that the model parameter of each task consists of two parts:

one is shared within each task group (group-level parameter)

while the other is speciﬁc to each distinct task (task rescaling

parameter). A Dirichlet process (DP) [10, 23] prior is imposed

on the group-level parameter while each dimension of the task

rescaling parameter is assumed to have a one-mean Laplace

prior. Due to the nonparametric clustering nature of DP, we

can automatically cluster the tasks into separate groups with-

out pre-specifying the group number, which is hard to deter-

mine in advance. In each group all tasks share the same group-

level parameter while each task has its own small task-speciﬁc

rescaling over the group parameter. By imposing a one-mean

laplace prior, the rescaling is sparse, and ﬁnally the parameter

of a task is the group parameter times its speciﬁc rescaling

parameter. This corresponds to that in each task group, for

most dimensions the multidimensional models are identical,

but for special ones they may diﬀer from each other, which is

a ﬂexible model sharing scheme.

We give eﬃcient Markov chain Monte Calo (MCMC) algo-

rithm to conduct model inference. Experiments on the Land-

mine detection data set and the UCI Yeast data set demon-

strate our method can not only outperform state-of-the-art

MTL algorithms but also discover the task-clustering struc-

ture very well.

The remainder is organized as follows. Section 2 brieﬂy cov-

ers the necessary preliminaries. Then in Section 3 we propose

our nonparametric Bayesian multi-task large-margin classiﬁ-

cation model by ﬁrst deﬁning a novel likelihood function. The

experimental results are demonstrated in Section 4 and re-

lated works are given in Section 5. Finally we conclude the

paper in Section 6.

ECAI 2014

T. Schaub et al. (Eds.)

This article is published online with Open Access by IOS Press and distributed under the terms

of the Creative Commons Attribution Non-Commercial License.

doi:10.3233/978-1-61499-419-0-255

255

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38546846

粉丝: 5

非参数贝叶斯多任务大间隔分类模型

非参数贝叶斯最大边距主成分分析：降维与多任务分类的融合

使用Matlab实现NIPS'2018随机非参数事件张量分解

REVA-nonparametric开源软件：下载REVA代码运行指南

Trading-Using-Nonparametric-Time-Series-Classification-Models

Nonparametric-methods-and-robust-estimation

71058-lee-mykland-nonparametric-jump-detection.zip

matlab数据输入代码-Stochastic-Nonparametric-Event-Tensor-Decomposition:NIPS'2

nonparametric bayesian modeling of complex networks

nonparametric-changepoint-detection:非参数变化点检测算法的实现

Nonlinear Time Series Nonparametric and Parametric Methods-chap1

最新资源