结构正则化支持向量机：结构大间隔分类器框架

PDF格式 | 892KB | 更新于2024-07-15 | 162 浏览量 | 举报

"这篇研究论文探讨了结构正则化支持向量机（Structural Regularized Support Vector Machine, SRSVM），作为一个用于结构大间隔分类的框架。作者Hui Xue、Songcan Chen和Qiang Yang（IEEE Fellow）提出了将结构信息作为隐含先验知识在分类任务中的重要性，并统一了现有的一些结构大间隔分类器，如结构大间隔机（Structured Large Margin Machine, SLMM）和拉普拉斯支持向量机（Laplacian Support Vector Machine, LapSVM）。" 支持向量机（SVM）是机器学习领域中最受欢迎的分类器之一，其主要目标是在两类数据之间找到一个能够最大化间隔的超平面。传统的SVM关注于增加类别之间的分离度，而相对忽视了同一类别内部数据的结构信息。然而，最近的研究发现，这种结构信息对于解决现实世界中的复杂分类问题至关重要。结构信息可以视为一种隐含的先验知识，它可以增强分类器的泛化能力。因此，许多研究致力于利用这些内在结构来改进分类性能，如SLMM和LapSVM。SLMM旨在通过最大化结构之间的间隔来构建分类模型，而LapSVM则通过考虑数据点的局部结构（例如，邻域关系）来优化分类边界。在这篇论文中，作者提出了一种新的概念——"结构粒度"，用以统一这些不同的结构大间隔分类方法。他们通过优化问题的公式化，展示如何在这个共同框架下融合SLMM和LapSVM等方法。这一框架有助于理解和改进结构信息在支持向量机中的应用，从而可能提升分类器的性能和泛化能力。通过这个结构正则化的视角，SRSVM不仅考虑了样本间的距离，还考虑了数据的内在结构，这使得模型能够更好地适应具有复杂结构的非线性数据集。这种框架的提出为未来研究提供了新的方向，尤其是在处理具有丰富结构信息的数据时，可能产生更优的分类结果。这篇论文对SVM的发展和应用做出了重要贡献，它强调了结构信息在分类任务中的重要性，并提供了一个统一的理论框架，有助于促进和支持向量机在结构数据上的进一步研究和改进。

XUE et al.: STRUCTURAL REGULARIZED SUPPORT VECTOR MACHINE 575

where ξ

is the penalty for violating the constraints. C is a

regularization parameter that makes a tradeoff between the

margin and the penalties incurred.

If we focus on the constraints in (2), we can immediately

capture the following insight about SVM, which is easily

generalized to the relaxation version.

Proposition 1: SVM constrains the separation between

classes as w

w ≥ 4, where S

(

− μ

)(

− μ

)

is the mean of class i(i = 1, 2).

Proof: Without loss of generalization, we assume that

the class one has the class label y

= 1, and the other

class has y

=−1. Then we reformulate the constraints as:

+ b ≥ 1, where x

belongs to class one; w

+b ≤−1,

where x

belongs to class two.

Let the numbers of the samples in the two classes be

respectively n

and n

.Thenwehave

1/n



i=1



+ b





+ b



≥ 1(4)

−1/n



j=1



+ b



=−



+ b



≥ 1. (5)

Adding the two inequalities (4) and (5), we obtain

(μ

− μ

) ≥ 2. (6)

Squaring the inequality (6), we further have

(

− μ

)(

− μ

)

w ≥ 4. (7)

That is, w

w ≥ 4.

Consequently, following the above proposition, it is clear

that SVM gives a natural lower bound for the separation

between classes, exactly according to its original motivation

that pays more attention to the maximization of margin.

However, it more likely neglects the prior data structural infor-

mation within classes, which is also vital for classiﬁcation.

A linear classiﬁer example is illustrated in Fig. 1, where ‘*’

and ‘

’ denote the two classes, respectively. Here each class

is generated via a mixture of two Gaussian distributions that

have approximately perpendicular trends of data occurrence.

As we mentioned before, SVM does not sufﬁciently utilize

the structurally obvious information, and the derived decision

plane, denoted by the dash line in Fig. 1(a), approximately lies

in the middle of three support vectors [4]–[6] in the training

set, which leads to inaccurate classiﬁcation in the testing set

[Fig. 1(b)]. However, a more reasonable decision plane should

be as denoted by the solid line in Fig. 1. This boundary has

almost parallel orientation to the ‘

’ class data trend, and,

at the same time, relatively far from the ‘*’ class due to

the approximately vertical trend of the corresponding data.

Consequently, SRSVM has better classiﬁcation performance

both in the training and testing sets.

B. Structural Granularity

Deﬁnition 1: Given a dataset T =

{

, y

}

i=1

.Let

, S

, ··· , S

be a partition of T according to some relation

measure, where the partition characterizes the whole data in

SRSVM

SVM

Support vectors

SRSVM

SVM

−1

−2

−3

−4

−1

−2

−3

−4

−2

−1

(a) (b)

234

−2 −101234

Fig. 1. Illustration on the importance of the structural information within

classes in SRSVM and SVM. (a) Discriminant boundaries in the training set.

(b) Discriminant boundaries in the testing set.

Global Granularity 

global

Class Granularity 

class

Cluster Granularity 

cluster

Point Granularity 

point

Class I

Class II

Fig. 2. Illustration of structural granularity.

the form of some structures such as cluster, and S

∪ S

∪

···∪ S

= T.HereS

(i = 1, 2,...,t) is called structural

granularity.

Clearly, structural granularity relies on the different assump-

tions about the actual data structures in real-world problems.

In our viewpoint, it involves four layers, as illustrated in

Fig. 2, where “

◦

”and“



” denote the two classes respectively.

Moreover, the data in the class I “

◦

” are generated by three

Gaussian distributions and the class II “



” are obtained by

two Gaussian distributions.

According to the Gaussian mixture model [20] for a mix-

ture Gaussian distributions, we can characterize the structural

granularity of the training data by ellipsoids (or clusters),

whose centroids (or means) and covariance matrices reﬂect

the properties of Gaussian distributions. As a result, four

granularity layers can be differentiated:

Global Granularity: The granularity refers to the dataset

T. With this granularity, the whole data are characterized or

enclosed by a single ellipsoid, as shown by the solid line

ellipsoid in Fig. 2, whose centroid μ

global

and covariance

matrix Σ

global

can be obtained by minimizing the volume

of the ellipsoid [9]

min

global

,μ

global



global



s.t.





− μ

global



−1

global



− μ

global





≤ 1, (8)

global

≥ 0.

The corresponding classiﬁer, such as EKM, aims to utilize

such global data structure, or more precisely, global data

scatter in its design.

剩余14页未读，继续阅读

weixin_38707192

粉丝: 3

结构正则化支持向量机：结构大间隔分类器框架

Support Vector Machines- Training and Applications

A Tutorial on Support Vector Machines for Pattern

Combined multi-kernel support vector machine and wavelet analysis for hyperspectral remote sensing image classif ication

Martin, John M., Fitzpatrick, Joseph P., and Gould, Robert E. The analysis of delinquent behavior: A structural approach. New York: Random House, 1970, 208 p. (paper)

structural-time-series:美国电力需求数据的结构时间序列

负荷预测matlab代码-structural-garch-deploy:估算结构GARCH模型的代码（Engle和Siriwardane（2

matlab信任模型代码-Structural-Model-Updating:该GitHub软件包提供了用于有限元模型更新的示例MATLAB代

Exploring Structural Consistency in Graph Regularized Joint Spectral-Spatial Sparse Coding for Hyperspectral Image Classification

Density-induced margin support vector machines

A Tutorial on Support Vector Machines for Pattern Recognition

最新资源