响应面技术的正则化多视图学习机应用

89 浏览量更新于2024-08-26 收藏 578KB PDF 举报

"这篇研究论文探讨了基于响应面技术的正则化多视图学习机在模式识别和分类器设计中的应用。通过结合多种信息源，该方法旨在提高机器学习的性能和鲁棒性。" 正文: 在机器学习领域，多视图学习是一种处理来自不同角度或信息源的数据的方法。这种学习策略利用数据的不同表示来捕获更全面的特征，从而提高模型的泛化能力和适应性。基于响应面技术的正则化多视图学习机是该领域的最新进展，由Zhe Wang、Jin Xu、Songcan Chen和Daqi Gao等人提出。响应面技术通常用于优化问题，通过构建一个近似模型来模拟复杂的系统行为。在本文中，它被应用于设计一个多视图学习机，以优化各个视图（或子分类器）之间的协同学习过程。论文作者首先将基础分类器转化为M个不同的子分类器，每个子分类器代表一个特定的视图。这种方法允许模型从多个角度理解数据，增强对复杂模式的理解。 MultiV-MHKS（多视图多核混合分类器）是他们之前工作的一个扩展，它引入了一种联合学习机制，旨在协调这M个子分类器的学习。每个子分类器被视为整个系统的一个独立视角，通过这种方式，它们能够共同学习并相互补充，以达到整体性能的提升。正则化是防止过拟合的关键策略，它通过在损失函数中添加惩罚项来限制模型的复杂度。在基于响应面技术的正则化多视图学习机中，正则化不仅控制每个子分类器的复杂度，还促进了不同视图之间的协同学习。通过调整正则化参数，可以平衡模型的复杂性和泛化能力，确保模型能够在新数据上表现良好。论文还涉及到了模式表示、Rademacher复杂性等概念。模式表示是指如何有效地编码和表示数据，这对于多视图学习至关重要，因为它决定了模型能否有效地捕捉到不同视图中的信息。而Rademacher复杂性则是衡量一个学习算法可能产生的最复杂分类规则的度量，它与过拟合的风险密切相关。总体来说，这篇研究论文贡献了一个创新的多视图学习框架，结合响应面技术和正则化，以优化多源信息的融合和学习。这个框架不仅提高了模式识别的准确性，也为其他复杂分类问题提供了新的解决方案。通过对不同子分类器的学习过程进行协同优化，这种方法有望在大数据分析、计算机视觉、自然语言处理等领域带来更高效、更鲁棒的机器学习算法。

learning. Thus they proposed a regularized convex formulation to

learn the relationships between different tasks, where the pro-

posed formulation was viewed as one novel generalization for

single-task learning.

2.2. The family of HK algorithms

2.2.1. MHKS

The original HK algorithm was expected to obtain a good

classiﬁcation performance. But HK was sensitive to outliers [16].

In order to solve this problem, Leski proposed a modiﬁed HK

algorithm named MHKS [16]. MHKS bases on the regularized least

squares and tries to maximize the separating margin [27–29]. To

be more speciﬁc, MHKS gives its separating hyperplane as

follows:

YwZ 1

N1

: ð1Þ

Consequently, the criterion function of MHKS is changed as

min

d þ 1

, b Z 0

Lðw, bÞ¼ðYw1

N1

bÞ

ðYw1

N1

bÞþc

w, ð2Þ

where c Z 0 is the regularized hyper-parameter that adjusts the

tradeoff between the model complexity and the training error.

The procedure of MHKS is almost the same as that of the original

HK classiﬁer. The difference between MHKS and HK is that the

argument weight vector w

in MHKS becomes

¼ðY

Y þc

IÞ

1

ðb

þ1

N1

Þ, ð3Þ

where

I is an identity matrix with the last element on the main

diagonal set to zero.

2.2.2. MatMHKS

Since vector representation for patterns fails in some image-

based learning, some matrix-based algorithms were proposed in

terms of both feature extraction [30–32] and classiﬁer design [9].

MatMHKS was a typical matrixized classiﬁer and could directly

classify patterns represented with matrix. As a consequence,

MatMHKS was viewed as a matrixized improvement of MHKS.

In the matrix case, suppose that there is a binary-class classiﬁca-

tion problem with N matrix samples ðA

Þ, i ¼ 1 ...N, where

mn

and its corresponding class label

A fþ1, 1g. The

decision function of MatMHKS for the binary problem is given as

gðA

Þ¼u

4 0, if

¼þ1

o 0, if

¼1

(

, ð4Þ

where both uA

and

v A

are the weight vectors. The

corresponding optimization function of MatMHKS is deﬁned as

follows:

min

u A

v A

, v

, b Z 0

Jðu,

v, v

, bÞ¼

i ¼ 1

ðu

v þv

Þ1b

þcðu

uþ

vÞ, ð5Þ

where S

¼ mI

mm

, S

¼ nI

nn

are the two regularized matrices

corresponding to the weight vectors u and

v respectively, and the

regularized parameter c (cA

, c Z 0) controls the generalization

ability of the classiﬁer through making a tradeoff between the

classiﬁer complexity and the training error. The vectors u,

v,andthe

bias v

can be obtained by the gradient optimization of the

formulation (5) with respect to u,

v,andv

respectively. The detailed

processing optimization can be referred in the literature [9].

2.2.3. MultiV-MHKS

In the literature [8], MHKS was supposed to be a single-view

classiﬁer and could be multiviewized into multiple matrixized

MatMHKS. Then we adopted a joint learning for different

MatMHKSs and proposed a multi-view learning machine Mul-

tiV-MHKS. In mathematics, suppose that there is an original

vector pattern x

. The x

can be represented by different

matrices A

n

, q ¼ 1 ...M, where d is equal to m

 n

In MultiV-MHKS, we set Y

¼½y

, ..., y



, y

½u

, 1

i ¼ 1 ...N, b

¼½b

, ...b



, v

¼½

, v



, where the q denotes

the index number of the view in MultiV-MHKS. Then the criterion

function of MultiV-MHKS is given as follows:

min

þ 1

q ¼ 1, ..., M

q ¼ 1

ððY

1

N1

b

ðY

1

N1

b

þc

ðu

þv

ÞÞ

q ¼ 1



p ¼ 1

ðY

 Y



p ¼ 1

ðY

, ð6Þ

where S

¼ m

m

, S

¼ n

n

¼ð

Þ is a matrix with a

dimensionality of ðn

þ1Þðn

þ1Þ, c

is the regularized para-

meter for each view, and the

is the coupling parameter. In the

formulation (6), the weight value of each view is simply set to

1=M. In this case, each MatMHKS plays an equal role in the whole

classiﬁcation. Then for optimizing the criterion function (6), we

make the gradient of J

with respect to both u

and v

be zero.

Therefore we can get the following optimal results:

¼ 1þ

M 1



i ¼ 1

ðA

þc

1



i ¼ 1

ðb

þ1Þ

 1þ

M 1



M 1

N1

p ¼ 1, p a q

ðu

þv

, ð7Þ

¼ 1þ

M 1



þc

1

 1

N1

þb

M1

p ¼ 1, p a q

: ð8Þ

The iteration for both u

and v

is the same as that in MatMHKS.

Since MultiV-MHKS is a joint learning for multiple views, its

decision function integrates multiple MatMHKSs and is given as

follows:

gðzÞ¼

q ¼ 1

ðu

þv

4 0 then zA classþ1

o 0 then zA class1

(

, ð9Þ

where z is the test sample and Z

is the qth matrix representation

for the z.

3. Proposed regularized multi-view learning machine

(RMultiV-MHKS)

MultiV-MHKS was expected to make a full use of the advan-

tage of different matrix representations. But the equal value with

1=M was a simple setting for the weight of each MatMHKS in

MultiV-MHKS, which might be not sensible in some real-world

cases. For example, one certain matrix representation supplied

less even no useful information for discrimination, while the

decision function (9) still took the less useful matrix representa-

tion into the ﬁnal classiﬁcation like the other useful ones. It urges

us to assign different weights to the matrix representation with

different matrix representations. In order to realize such an

Z. Wang et al. / Neurocomputing 97 (2012) 201–213 203

剩余12页未读，继续阅读

weixin_38610277

粉丝: 8
资源: 906

响应面技术的正则化多视图学习机应用

基于python搜索的目标站点内容监测系统源码数据库.docx

个性化资源管理器

基于python的特色饮食情感分析语料库模型建立wlw.zip

Python编程教程全集：从基础到机器学习

深入剖析C#视图组件机制：专家解读组件设计的奥秘

视图组件安全指南：【7】个步骤避免XSS攻击

【调试高手】：解决django.views.generic.simple视图的常见问题

【Django视图新手指南】：掌握这5个技巧，轻松简化Web开发流程！

Python路由性能优化：使用routes.util提升应用响应速度（最佳实践）

实体识别与知识图谱：构建智能问答系统的核心技术

最新资源