数据结构引导的判别分析新框架

47 浏览量更新于2024-07-15 收藏 2.56MB PDF 举报

本文献《结构导向的判别分析框架》探讨了近年来判别分析（Discriminant Analysis, DA）算法的显著发展。作者Bo Yang、Songcan Chen和Xindong Wu在2009年8月接收并于2011年6月接受，最终于同年7月在线发表。这篇研究论文旨在填补一项重要的理论空白，即对数据中隐藏的结构粒度进行系统性分析。首先，作者指出尽管各种DA算法在设计上可能源于不同的动机，但它们的核心目标都是将数据中的结构信息注入到类内和类间散度矩阵（scatter matrices）中。这表明，理解数据的内在结构对于提升分类性能至关重要。然而，之前尚未有一个统一的框架来明确： 1. 数据中存在的哪些结构粒度是关键因素？ 2. 在DA算法中，如何利用这些结构粒度构建散度矩阵？ 3. 是否可以通过现有结构粒度开发出新的DA算法？本文提出了一种被称为“结构导向”（Structurally Motivated, SM）的框架，这是一种理论工具，它提供了对上述问题的清晰答案。SM框架通过将数据结构划分为不同粒度，如特征空间划分、聚类结构或潜在变量模型，来分类现有的DA算法。这样，它为评估各类算法在不同数据结构类型下的适用性提供了一个统一的数学形式，即通过比对结构粒度与散度矩阵的比例关系。具体来说，文章可能探讨了以下内容： - 对于线性判别分析（Linear Discriminant Analysis, LDA），它通常假设数据服从高斯分布，利用的是样本均值和协方差矩阵，反映出全局的平均结构和变异结构。 - 非线性判别分析（如判别聚类分析或核方法）可能基于局部特征空间或通过核函数转换，捕捉非线性结构，适用于复杂的数据分布。 - 结构学习驱动的DA，如稀疏或低秩分解，可能考虑数据的稀疏性和内在关联性，以此降低计算复杂性并提取更有效的特征。此外，SM框架还可能提供了理论基础，指导如何结合不同结构粒度来创建混合模型，或者发展新的DA变体，以适应不同类型的数据和应用需求。这篇论文为深入理解和改进判别分析算法提供了结构导向的视角，对于理解和开发具有更高精度和鲁棒性的机器学习方法具有重要意义。

is always used to characterize a manifold [21], for example,

the popular k-nearest-neighbor graph in manifold learning

just embodies the local neighborhood relation between

each sample and its k neighbors. Besides, the locality can

vividly reﬂect the size of structure granularity. Thus, the

locality structure, i.e., the manifestation of manifold

assumption, is introduced in the forthcoming SM frame-

work. Another conception, i.e., the class, is frequently

mentioned in supervised learning. Yet, it nearly is not

regarded as one of structures, because once the training

samples are given, the class just is ﬁxed and thus naturally

is separately addressed. However, the so-called class

essentially is a given rather than assumed prior. So, com-

pared with the structures of cluster and locality, the class is

in nature a structure with the most size of granularity.

These structures mentioned above are summarized to form

the structure granularity spectrum of SM framework,

namely, from class to cluster and then to locality. Figure 2

illustrates these structure granularities in a two-dimen-

sional space.

For the given samples of two classes ‘‘

’’ a nd ‘‘ ’’, the

formation of class granularity seems natural, that is, one

global structure for each class, as denoted by two big dash-

line ellipses in Fig. 2.MDA[1] is a typical algorithm based

on class granularity. However, it is unavoidable for such

naturally formed simple structure to underﬁt the data at

hand. For example, the big dash-line ellipse for Class 1

leaves a great deal of blank-space that is not covered by

any ‘‘

’’. By contrast, the cluster granularity structure is

more compact, since each class is congregated into multi-

clusters, each of which, respectively, involves a portion of

all samples, and thus little blank-space is left in it. As those

small dash-dot-line ellipses shown in Fig. 2, they, respec-

tively, indicate three clusters of Class 1 and 2 clusters of

Class 2. In each cluster, only some ‘‘

’’ a nd ‘‘ ’’ are

involved. Clearly, each cluster has very little blank-space

and thus its structure is more compact. For the data of

Fig. 1b, if cluster granularity is used as the prior assump-

tion and the number of clusters is set to 3, then the intrinsic

structure of the data is relatively effectively captured,

because the data really consist of three Gaussians. SDA [4]

is one of the algorithms based on cluster granularity. Fur-

ther, the right zoomed part in Fig. 2 shows the locality

structure around point x

, where the six stars connected by

the six dash lines represent its six nearest neighbors. In

fact, each sample has own locality structure and these

locality structures for all sample can be used to charac-

terize the data manifold. For the data in Fig. 1c, if the

locality structure with proper neighbor parameter is adopted,

then the swissroll often can be effectively characterized since

it is just a manifold. MFA [2] exactly utilizes such a structure

to perform manifold learning. Besides, from the three granu-

larities of Class 2 in Fig. 2, it can be found that their sizes

roughly reﬂect such a relation that class granularity C cluster

granularity C locality granularity.

Next, the typical algorithms corresponding to different

granularities will brieﬂy be reviewed and analyzed.

2.1 MDA with class-granularity structure

Multiple discriminant analysis [1] is one of the most pop-

ular linear methods for DA. It aims to ﬁnd an optimal

projection matrix U



that yields the maximum ratio of the

between-class scatter S

to the within-class scatter S

i¼1

ðm

 mÞðm

 mÞ

ð1aÞ

i¼1

and S

x2D

ðx m

Þðx m

;

i ¼ 1; ...; C

ð1bÞ

m ¼

k¼1

ð1cÞ

k¼1

; i ¼ 1; ...; C ð1dÞ

where C is the class size, n and n

are the sizes of all

samples and samples in class i. For LDA, the between-class

scatter S

in (1a) is changed as S

LDA

in (1e) while the other

formulations are still maintained.

LDA

¼ðm

 m

Þðm

 m

ð1eÞ

where m

and m

are, respectively, the means of classes 1

and 2.

Now let us check what structure granularity MDA

adopts. In terms of the construction of its scatter matrices

in (1a) and (1b), it can be found that they use the class-

granularity structure, because the within-class scatter S

the sum of all matrices S

,(i = 1, …, C), and S

is con-

structed by all the samples of class i with the class mean as

the representative point of the class, and is traditionally

called as the within-class scatter for class i. While the

between-class scatter S

also has a similar construction and

Class 1

Class 2

Class granularity

Cluster granularity

Locality granularity

Fig. 2 Different structure granularities of structure granularity

spectrum

352 Pattern Anal Applic (2011) 14:349–367

123

剩余18页未读，继续阅读

weixin_38712899

粉丝: 4
资源: 941

数据结构引导的判别分析新框架

An Iterative Spanning Forest Framework for Superpixel Segmentation

Fabrication of lateral electrodes on semiconductor nanowires through structurally matched insulation for functional optoelectronics

STRUCTURALLY ENHANCED INCREMENTAL NEURAL LEARNING FOR IMAGE CLASSIFICATION WITH SUBGRAPH EXTRACTION

Tetrahedral Coordination in the First Structurally Characterized Dichlorooxovanadium(V) Alkoxide

Preparation and luminescence properties of Na4CaSi3O9:Ce3+ phosphors for solid state lighting

A New Two-dimensional Copper(II) Complex with p-PhDTA Bridges① (2008年)

Synthesis and Crystal Structure, of a Mixed (Phthalocyaninato)(porphyrinato) Yttrium Double-decker Complex① (2011年)

A 3-fold Interpenetrated lvt Cd(Ⅱ) Network Constructed from 4-[(3-pyridyl)methylamino]benzoate Acid① (2014年)

A New Complex [Ni(L)2]・2H2O Containing an Interesting Six-membered Water Ring(HL = 2-Iminomethyl-6-methoxy-phenol)① (2008年)

[Advanced Chapter] Image Deblurring in MATLAB: Using Blind Deblurring Algorithms for Image ...

最新资源