解决小样本问题的广义非线性判别分析

137 浏览量更新于2024-08-28 收藏 526KB PDF 举报

"Generalized Nonlinear Discriminant Analysis and Its Small Sample Size Problems" 文章“Generalized Nonlinear Discriminant Analysis and Its Small Sample Size Problems”是一篇研究论文，由Li Zhang、Wei Da Zhou和Pei-Chann Chang等人撰写，分别来自中国苏州大学计算机科学与技术学院的研究中心、西安电子科技大学的智能信息处理研究所以及台湾远东大学的信息管理系。该论文探讨了广义非线性判别分析（GNDA）方法及其在小样本尺寸问题上的应用。文章的核心内容包括： 1. **Fisherdiscriminantanalysis（Fisher判别分析）**：这是一种经典的统计方法，旨在找到一个投影空间，使得类间距离最大化，同时类内距离最小化。这种方法在分类问题中非常有效，但当样本量较少时，其性能可能会受到影响。 2. **Kerneltrick（核技巧）**：在Fisher判别分析的基础上，引入了核技巧，发展出了核Fisher判别分析（KFDA）。通过将数据映射到高维特征空间，可以实现非线性分类，有效地解决了原始空间中的非线性问题。 3. **Smallsamplesizeproblem（小样本尺寸问题）**：在实际应用中，由于种种原因，数据集的样本数量可能非常有限，这被称为小样本尺寸问题。这种情况下，传统的统计模型和学习算法可能面临过拟合、泛化能力下降等问题。 4. **GeneralizedNonlineardiscriminantanalysis（GNDA）**：论文提出了广义非线性判别分析，这是一种在线性的LDA基础上的扩展，能够处理更复杂的非线性分类问题。它不仅包含了LDA的基本思想，同时也包含了KFDA的非线性特性，因此更加灵活。 5. **处理小样本问题的方法**：论文针对GNDA的小样本问题进行了深入研究，提出了相应的解决方案，以提高在样本稀疏情况下的分类性能。这可能包括正则化、集成学习、半监督学习等策略，以增强模型的稳定性和泛化能力。 6. **算法性能与实证分析**：论文中可能还包括了对GNDA方法的实证分析，通过对比实验展示了在不同样本大小条件下，GNDA相对于LDA和KFDA的优越性，并可能提供了相关的应用场景示例。这篇论文对机器学习和数据挖掘领域的研究者来说具有很高的价值，它提供了一种新的非线性判别分析方法，并解决了小样本问题，从而在样本有限的情况下也能实现高效准确的分类。

Generalized nonlinear discriminant analysis and its small sample

size problems

Li Zhang

a,b,



, Wei Da Zhou

, Pei-Chann Chang

Research Center of Machine Learning and Data Analysis, School of Computer Science and Technology, Soochow University, Suzhou 215006, Jiangsu, China

Institute of Intelligent Information Processing, Xidian University, Xi’an 710071, Shaanxi, China

Department of Information Management, Yuan Ze University, Taoyuan 32026, Taiwan, China

article info

Article history:

Received 25 May 2009

Received in revised form

10 September 2010

Accepted 14 September 2010

Communicated by Liang Wang

Available online 27 October 2010

Keywords:

Fisher discriminant analysis

Kernel trick

Small sample size problem

abstract

This paper develops a generalized nonlinear discriminant analysis (GN DA) method and deals with its

small sample size (SSS) problems. GNDA is a nonlinear extension of linear discriminant analysis (LDA),

while kernel Fisher discriminant analysis (KFDA) can be regarded as a special case of GNDA. In LDA, an

under sample problem or a small sample size problem occurs when the sample size is less than the sample

dimensionality, which will result in the singularity of the within-class scatter matrix. Due to a

high-dimensional nonlinear mapping in GNDA, small sample size problems arise rather frequently.

To tackle this issue, this research presents ﬁve different schemes for GNDA to solve the SSS problems.

Experimental results on real-world data sets show that these schemes for GNDA are very effective in

tackling small sample size problems.

1. Introduction

Discriminant analysis has been widely used for feature

extraction and dimensionality reduction in pattern recognition.

Linear discriminant analysis (LDA), also known as Fisher linear

discriminant is one of the most commonly used method [1]. The

goal of LDA is to ﬁnd an optimal subspace such that the separability

of two classes is maximized. LDA is to maximize

trðW

WÞ

trðW

WÞ

ð1Þ

where trðÞ denotes the trace of matrix , W is a linear projection or

transformation matrix, S

is the between-class scatter matrix and

is the within-class scatter matrix. Maximizing (1) results in the

following generalized eigenvalue problem

W ¼

W ð2Þ

The optimal discriminant subspace is spanned by the generalized

eigenvectors. If S

is nonsingular, the solution to the generalized

eigenvalue problem (1) is obtained by applying eigendecomposi-

tion on S

1

. However, for a small sample size (SSS) problem the

scatter matrix S

is singular. For example, face recognition is a

kind of SSS problems with high-dimensional and few training

samples. So far, there are some methods proposed to deal with the

problem of singularity of S

, such as Fisherface [2], discriminant

common vectors [3], dual space [4], LDA-GSVD (generalized

singular value decomposition) [5], LDA-QR [6], PCA+NULL [7],

and LDA-FKT (Fukunaga–Koontz transform) [8].In[8], a unifying

framework is proposed to understand different methods. By using

Fukunaga–Koontz transform (FKT), the whole sample space can be

decomposed into four subspaces. Discriminant information in the

four subspaces is different, and the performance of methods

depends on their subspaces. The authors in [8] also report that

LDA-GSVD is equivalent to the LDA-FKT, and LDA-FKT has the best

performance.

Unfortunately, LDA can only extract linear features from samples,

it fails to process the data which consist of nonlinear features [9].

Kernel Fisher discriminant analysis (KFDA), one of the nonlinear

discriminant methods, has been developed for extracting nonlinear

discriminant features [9]. A similar work as KFDA is presented in [10].

Kernel functions are restricted to positive semi-deﬁnite symmetric

functions, i.e., Mercer kernels as in [11,12]. KFDA often encounters SSS

problems because S

in a high-dimensional feature space is always

singular. To overcome the computational difﬁculty with KFDA, a

perturbation

I is added to S

in [9] where I is an identity matrix as

thesamesizeasS

, and kernel Fisherface is proposed in [13].

This paper proposes a generalized non linear discri minant analysis

(GNDA). GNDA consists of two steps. First, data in a sample space are

mapped into a nonlinear mapping space by using some nonlinear

mapping function. Then LDA is implemented in the nonlinear mapping

space. In GNDA, the nonlinear mapping function can be any real-valued

nonlinear function, for instance, empirical mapping functions as in

[14,15], Mercer kernel mapping as in [11,12],etc.GNDAisidentical

Contents lists available at ScienceDirect

journal homepage: www.elsevier.com/locate/neucom

Neurocomputing

doi:10.1016/j.neucom.2010.09.022



Corresponding author at: Research Center of Machine Learning and Data

Analysis, School of Computer Science and Technology, Soochow University, Suzhou

215006, Jiangsu, China.

E-mail address: zhangliml@suda.edu.cn (L. Zhang).

Neurocomputing 74 (2011) 568–574

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38670318

粉丝: 6
资源: 919

解决小样本问题的广义非线性判别分析

mhaghighat-Generalized_Discriminant_Analysis

广义主成因分析（Generalized Principal Component Analysis）英文版

Dimensionality Reduction using Generalized Discriminant Analysis (GDA):Generalized Discriminant Analysis - 一种非线性特征降维技术-matlab开发

Generalized Iterated Kalman Filter and its Performance

Generalized Nonlinear Chirp Scaling Algorithm for High-Resolution Highly Squint SAR Imaging

Generalized Principal Component Analysis

GUI for Generalized Nonlinear Non-analytic Chi-Square Fitting：提供类似曲线拟合工具箱的界面，用于卡方拟合-matlab开发

Generalized Collocation Methods - Solutions to Nonlinear Problems

A generalized sinusoidal model and its applications (2009年)

On the generalized bi(skew-)symmetric solutions of a linearmatrix equation and its procrust problems

最新资源