改进的内核CSDL-LDA：面向人脸识别的高效非线性字典学习

81 浏览量更新于2024-08-26 收藏 221KB PDF 举报

本文主要探讨了在人脸识别领域利用内核进行类特定判别词典学习（Classspecific Discriminant Dictionary Learning with Kernels, KCSDL-LDA）的问题。近年来，稀疏表示分类（SRC）因其在视觉识别中的卓越表现而受到广泛关注。然而，SRC的一个主要局限在于，它假设每个类别的训练样本在生成字典时具有相等的重要性，这可能导致残留误差增加和性能下降。为解决这个问题，研究人员提出了KCSDL-LDA，这是一种创新的方法，旨在改进传统类特定词典学习（CSDL）。首先，KCSDL-LDA引入了一个新颖的策略，即根据每个类中每个样本的权重生成特定于类别的字典，这样可以更精确地反映样本间的差异，从而减少错误。这种方法强调了样本的个体贡献，提高了模型的灵活性和准确性。其次，KCSDL-LDA进一步拓展到了“再生内核希尔伯特空间”（Reproducing Kernel Hilbert Space, RKHS），这一非线性特征表示环境能够更好地捕捉和表达复杂的数据结构，对于面部识别这类高度非线性的任务来说尤其关键。通过在RKHS中操作，算法能够发掘潜在的非线性模式，从而增强分类的精度。最后，KCSDL-LDA将类特定字典学习与线性判别分析（Linear Discriminant Analysis, LDA）约束相结合，这在复制内核希尔伯特空间中实现了对传统CSDL的进一步提升。LDA的引入有助于最大化类别间差异，同时最小化类别内的方差，使得分类决策更加明确，提高了整体性能。实验证明，通过在扩展的YaleB数据集、CMU PIE数据集和AR数据集等多个面部识别基准数据集上进行大规模实验，KCSDL-LDA显示出显著的优越性。相较于传统的SRC方法，它能有效降低残留误差，提高识别准确率，为实际应用中的人脸识别提供了一种更为有效的解决方案。这种结合内核技术和类别特异性字典学习的方法，为深度学习和计算机视觉领域的研究者们提供了新的思路和技术支持。

Class speciﬁc discriminant dictionary learning with kernels for

face recognition

Bao-Di Liu

; College of Information and Control Engineering, China University of Petroleum; Qingdao, 266580, China

Yuting Wang; Department of Informatics, Karlsruhe Institute of Technology; Karlsruhe, 76131, Germany

Liangke Gui; School of Computer Science, Carnegie Mellon University; Pittsburgh, PA 15213, USA

Yu-Xiong Wang; School of Computer Science, Carnegie Mellon University; Pittsburgh, PA 15213, USA

Bin Shen; Department of Computer Science, Purdue University; West Lafayette, IN 47907 USA

Xue Li; Department of Electronic Engineering, Tsinghua University; Beijing 100084, China

Yan-Jiang Wang; College of Information and Control Engineering, China University of Petroleum; Qingdao, 266580, China

Abstract

The past few years have witnessed the impressive perfor-

mance of sparse representation based classiﬁcation (SRC) for

visual recognition. However, the SRC technique may lead to

high residual error and poor performance due that the training

samples in each class contribute equally to the dictionary in the

corresponding class. This inspired the emergence of class spe-

ciﬁc dictionary learning algorithm. In this paper, we propose

a novel approach—class speciﬁc dictionary learning combined

with linear discriminant analysis constraints in Reproducing Ker-

nel Hilbert Space (KCSDL-LDA), which modiﬁes and extends the

conventional class speciﬁc dictionary learning (CSDL) algorithm

in several aspects. First, we propose a novel class speciﬁc dic-

tionary learning scheme that considers the weight of each sam-

ple for each class when generating the dictionary in that class.

Second, we extend the novel class speciﬁc dictionary learning

scheme to the Reproducing Kernel Hilbert Space, in which non-

linear structure can be extracted and represented to improve the

classiﬁcation accuracy. Finally, we further enhance the classiﬁ-

cation performance by combing class speciﬁc dictionary learning

with linear discriminant analysis constraints in Reproducing Ker-

nel Hilbert Spaces. Extensive experimental results on several face

recognition benchmark datasets, such as Extended YaleB dataset,

CMU PIE dataset and AR dataset, demonstrate the superior per-

formance of our proposed KCSDL-LDA.

Introduction

The past few years have witnessed the impressive perfor-

mance of dictionary learning for sparse representation in visual

computation areas, such as image annotation [1], image inpaint-

ing [2], image classiﬁcation [3], face recognition [4] and image

denoising [5]. Different from traditional decomposition frame-

works like Principal Component Analysis (PCA), Non-negative

Matrix Factorization (NMF) [6] and low-rank factorization, s-

parse representation is capable of generating sparse codes under

over-complete bases to represent the data more adaptively and

ﬂexibly.

Face recognition, one of the successful applications of s-

parse representation, is a classical yet challenging research top-

ic in computer vision and pattern recognition [7]. Effective face

recognition usually involves two important stages: 1) feature ex-

thu.liubaodi@gmail.com

traction, 2) classiﬁer construction and face prediction. For the

ﬁrst stage, Turk et al. performed principal component analysis

(PCA) to extract Eigenfaces [8]. He et al. proposed Laplacian-

faces [9] to preserve local information. Belhumeur et al. extracted

Fisherfaces [10] to maximize the ratio of between-class scatter to

within-class scatter. For the latter stage, Richard et al. introduced

a nearest neighbor method [11] to predict the label of a test image

using its nearest neighbors in the training samples. Tao et al. pre-

sented a nearest subspace method [12] to assign the label of a test

image by comparing its reconstruction error for each category.

Under the nearest subspace framework, Wright et al. [4] de-

scribed a sparse representation based classiﬁcation (SRC) sys-

tem and achieved an impressive performance for face recognition.

Given a test sample, the sparse representation technique repre-

sents it as a sparse linear combination of the train samples. The

predicted label is determined by the residual error from each class.

Zhang et al. [13] illustrated a collaborative representation based

classiﬁcation (CRC) system. Similar to SRC, CRC represents a

test sample as the linear combination of almost all the training

samples. Moreover, Zhang et al. demonstrated that it was the

collaborative representation rather than the sparse representation

that makes the nearest subspace method powerful for classiﬁca-

tion. Overall, both SRC and CRC algorithms directly use the

training samples as the dictionary for each class. This may lead

to high residual error and poor performance due that the training

samples in each class contribute equally to the dictionary in the

corresponding class. Therefore, the emergence of class speciﬁc

dictionary learning algorithm attracts the attention of many re-

searchers. They focus on learning a dictionary enforced by some

discriminative criteria that can reduce the residual error greatly

and achieve a superior performance for classiﬁcation tasks.

So far, existing discriminative dictionary learning approach-

es are mainly categorized into three types: shared dictionary

learning, class speciﬁc dictionary learning and hybrid dictionary

learning. In shared dictionary learning, the bases are learned with

all the training samples together. The discriminative information

is often embedded into the dictionary learning procedure. Mairal

et al. learned a discriminative dictionary [14] with a linear classi-

ﬁer of coding coefﬁcients. Liu et al. embedded the linear dis-

criminant analysis [15] into the dictionary. Zhang et al. ob-

tained a discriminative dictionary by integrated the label infor-

mation [16] into the dictionary learning. The shared dictionary

learning approaches usually lead to a small-sized dictionary and

DOI: 10.2352/ISSN.2470-1173.2016.11.IMAWM-456

IS&T International Symposium on Electronic Imaging 2016

Imaging and Multimedia Analytics in a Web and Mobile World 2016 IMAWM-456.1

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38627603

粉丝: 0
资源: 897

改进的内核CSDL-LDA：面向人脸识别的高效非线性字典学习

基于STM32的人脸识别

正点原子探索板综合实验人脸识别

人脸识别系统：在Raspberry Pi上使用Django Restful IoT进行智能人脸识别

利用PCA和SVM进行人脸识别

内核类特定集中式字典学习在人脸识别中的应用

OpenCV人脸识别与机器学习协同：提升人脸识别系统的智能化，实现更精准的识别

FaceReader:人脸识别，CZ4041 机器学习，2015 年Spring

使用内核方法进行DNA序列分类：内核方法的机器学习（MVA 2021）-使用内核方法和ML算法从头开始进行DNA序列分类

stm32 与labiview串口通讯，进行人脸识别

Face人脸识别

最新资源