l_21范数正则化Fisher准则在最优特征选择中的应用

需积分: 50 96 浏览量更新于2024-08-26 收藏 1024KB PDF 举报

"最优特征选择的l_21范数正则化Fisher准则" 文章主要探讨了在特征选择领域中，如何通过l_21范数正则化的Fisher准则来实现最优特征子集的选择，以提升模式识别任务如图像分类和人脸识别等的性能。特征选择是机器学习和模式识别中的一个重要环节，它能够减少数据冗余，降低过拟合风险，并提高模型的解释性。传统的基于Fisher准则的方法因其效率和良好的泛化能力而在特征选择中备受关注。Fisher准则通常用于衡量特征对类别之间的区分度，但这些方法往往忽略了不同特征间的相互依赖关系。为了解决这个问题，本文提出了一种新的优化策略——l_21范数正则化的Fisher准则。 l_21范数正则化是一种结合了l_2范数和l_1范数的正则化方法，它旨在鼓励稀疏解，即选择少数具有高影响力的特征。l_21范数可以同时考虑特征的值大小和特征间的相关性，因此在处理高维数据和大量相关特征时，相比于单独使用l_1范数（例如LASSO）或l_2范数（例如Ridge回归），能更有效地进行特征选择。在该研究中，作者首先定义了一个新的优化目标函数，该函数将Fisher准则与l_21范数正则化项相结合。通过最小化这个目标函数，可以找到一组既能最大化类间差异又能保持特征间稀疏性的特征。这种方法不仅考虑了特征的区分能力，还考虑了特征的相关性，从而提高了特征选择的质量。为了求解这个优化问题，可能需要采用优化算法，如梯度下降、坐标下降法或者基于二部图的优化算法。在实验部分，作者可能对比了l_21范数正则化Fisher准则与其他特征选择方法（如基于互信息、卡方检验或基于树的特征选择）在实际数据集上的性能，以证明其优越性。此外，文章可能还涉及了数值稳定性、计算复杂性和算法的收敛性分析。通过实验证明，l_21范数正则化Fisher准则在保持模型性能的同时，能够有效地减少特征数量，提高计算效率，并有助于提升模型的泛化能力。总结来说，"最优特征选择的l_21范数正则化Fisher准则"这篇研究论文提出了一种新颖的特征选择策略，它利用l_21范数的特性来处理特征间的相互依赖，并结合Fisher准则优化特征子集的选择。这种方法对于解决高维数据中的特征选择问题具有重要的理论和应用价值。

2,1

Norm regularized ﬁsher criterion for optimal feature selection

Jian Zhang

, Jun Yu

, Jian Wan

, Zhiqiang Zeng

School of Science and Technology, Zhejiang International Studies University, Hangzhou 310012, China

School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China

College of Computer and Information Engineering, Xiamen University of Technology, Xiamen 361024, China

article info

Article history:

Received 3 November 2014

Received in revised form

6 February 2015

Accepted 19 March 2015

Communicated by Huaping Liu

Available online 9 April 2015

Keywords:

Feature selection

Fisher criterion

2;1

Norm

Sparsity

abstract

Feature selection has been proved to be an effective way to improve the result of many pattern

recognition tasks like image classiﬁcation and automatic face recognition. Among all the methods, those

based on Fisher criterion have received considerable attention owing to their efﬁciency and good

generalization over classiﬁers. However, the original Fisher criterion-based methods ignore the inter-

dependencies between different features. To this end, this paper proposes an optimized feature selection

method which incorporates the l

2;1

norm regularization into the original Fisher criterion. The l

2;1

norm

regularization term assures the sparsity of the feature selection matrix, which makes the feature

selection result to be close to the globally optimized solution. Owing to the sparsity of the feature

selection matrix, a normalization constraint constructed based on the inter-class scatter matrix of Fisher

criterion is used to simplify the original problem, so that the solution of the feature selection problem

can be derived from an iterative algorithm whose key step is to solve a generalized eigenvalue problem.

Experiments on various data sets indicate that the proposed method provides higher accuracy in pattern

recognition tasks compared with several existing approaches.

1. Introduction

With the rapid development of public security business, pattern

recognition technique has found its usage in various applications

such as intelligent video surveillance and automatic entrance

control. In these applications, objects with high level semantics

[1] are often represented by some quantitative low level features

[2] for classiﬁcation based on prior information. Usually, these

features have high data dimensionality and high data redundancy,

which bring out a series of passive inﬂuences to the classiﬁcation

results. To this end, many approaches have been proposed to

reduce the data dimensionality and data redundancy in pattern

recognition, and feature selection [3] is one of the most important

means to solve the problem.

Feature selection methods can be divided into three categories,

unsupervised methods, supervised methods and semi-supervised

methods, according to whether data samples are available for

training. It is generally believed that the unsupervised methods

are inferior to the supervised and semi-supervised methods which

include ﬁlter methods [4,5], wrapper methods [6] and embedded

methods [7]. The ﬁlter methods check each feature's value of

objective function respectively, and select the feature with biggest

objective function value for pattern recognition. The wrapper

methods ﬁnd feature groups according to certain searching strat-

egy, and evaluate the objective values of these feature groups

based on already trained classiﬁers to decide which group is more

suitable for pattern recognition. The embedded methods combine

the searching process into the classiﬁer construction, and gain

higher computational efﬁciency than the wrapper methods.

Though wrapper methods and embedded methods often lead

to higher classiﬁcation accuracy, ﬁlter methods are still widely

used because they are simple, computationally efﬁcient, and have

good generalization over various classiﬁers. Among all the ﬁlter

methods, Fisher criterion attracts considerable attention owing to

the good performance of the objective function. Fisher criterion

evaluates the importance of individual feature to classiﬁcation,

thus are not suitable for selecting a group of features simulta-

neously. Recent improvements of Fisher criterion remove this

limitation, but lack ability to achieve optimal feature group

selection. Another improvement of feature selection is the intro-

duction of sparsity constraint as regularization term of the loss

function. The concern behind this is that selecting a minority of

Contents lists available at ScienceDirect

journal homepage: www.e lsevier.com/locate/n eucom

Neurocomputing

http://dx.doi.org/10.1016/j.neucom.2015.03.033

☆

This paper is supported by the National Natural Science Foundation of China

(Nos. 61303143 and 61472110), the Program for New Century Excellent Talents in

University (NECT-12-0323), the Hong Kong Scholar Programme (XJ2013038),

Scientiﬁc Research Fund of Zhejiang Provincial Education Department (No.

Y201326609).

Corresponding author.

E-mail address: yujun@hdu.edu.cn (J. Yu).

Neurocomputing 166 (2015) 455–463

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38657115

粉丝: 5
资源: 905

l_21范数正则化Fisher准则在最优特征选择中的应用

求解范数问题

l1_ls.rar_L1正则化问题_l1 范数_二范数_最小化 范数_正则化范数

基于稀疏流形聚类嵌入模型和L_1范数正则化的标签错误检测

IST.rar_IST_正则_正则化_正则法_阈值迭代算法

020710.rar_Tikhonov 正则化_regularization_tikhonov_正则化

tikhonov.zip_Tikhonov正则化_tikhonov_反问题_吉洪诺夫_正则化

NMF.rar_nmf_正则化 非负_矩阵正则化_非负矩阵分解

FeatureSelection_patternrecognition_L1正则化_L1正则化参数_特征选择_

LM.rar_LM BP_bp正则化_正则化 bp_贝叶斯_贝叶斯正则化

L0-范数正则化在心外膜电位重建中的应用

最新资源

l1_ls.rar_L1正则化问题_l1 范数_二范数_最小化范数_正则化范数

NMF.rar_nmf_正则化非负_矩阵正则化_非负矩阵分解