无约束相关滤波器在人脸识别中的应用与内核化研究

38 浏览量更新于2024-08-26 收藏 1.28MB PDF 举报

"这篇研究论文探讨了一种有效的无约束相关滤波器——无约束最优原点权衡滤波器（Unconstrained Optimal Origin Tradeoff Filter, UOOTF），并将其应用于人脸识别。与传统的类依赖特征分析（Class-Dependence Feature Analysis, CFA）中的相关滤波器相比，UOOTF在设计时去除了对原始相关输出的硬性限制，从而提高了对未见过的模式的整体性能。为了解决不同类别之间的非线性可分分布问题，作者进一步发展了一种非线性扩展方法，即内核化技术，以增强人脸识别的鲁棒性。" 文章介绍了无约束相关滤波器在人脸识别领域的应用和改进。相关滤波器是一种常用的图像处理工具，它通过计算输入图像与模板之间的相关性来识别特定目标。在人脸识别中，这类滤波器可以用于提取和匹配面部特征。然而，传统的CFA框架下的相关滤波器通常受到一些限制，如滤波器设计时对原始相关输出的约束，这可能影响到对未知样本的识别效果。 UOOTF滤波器的提出，旨在克服这些限制。它采用了一种更灵活的方法，允许原始相关输出的自由变化，从而提高了对未见过的人脸模式的识别准确性和适应性。这种方法的创新之处在于它能够在保持滤波器性能的同时，消除对输出的严格约束，使得滤波器能够更好地适应面部图像的复杂性。面对非线性分布的面部特征，研究者引入了内核化技术。内核方法是一种强大的工具，能够将数据映射到高维特征空间，使得原本在原始空间中难以区分的数据在新的空间中变得线性可分。在人脸识别中，内核化可以处理面部特征之间的非线性关系，提高分类的准确性，特别是在面对光照变化、表情变化和遮挡等挑战时。在实验部分，论文可能对比了UOOTF与传统方法的性能，展示了其在鲁棒性和识别率上的优势。此外，可能还探讨了不同内核函数对结果的影响，以及如何选择合适的内核参数以优化人脸识别系统。这篇论文提供了一个创新的滤波器设计思路，通过无约束的优化和内核化技术，提高了人脸识别的性能，对于理解相关滤波器在复杂视觉任务中的应用和改进具有重要的理论和实践价值。

2.2. Unconstrained correlation ﬁlter design

The traditiona l correlation ﬁlters [21,24–26]in1D-CFAarebased

on the assumption that the correlation peak amplitude should satisfy

aspeciﬁed value (i.e., the origin correlation outputs are restricted to

1foraspeciﬁc class and 0 for the others). However , the overall

performance of those ﬁlters can become worse for unseen patterns if

the correlation peak values are constrained to some speciﬁed

constant values during the ﬁlterdesign,whichmotivatesustodesign

the ﬁlter in the unconstrained form.

UOTF is a traditional unconstrained correlation ﬁlter. The

design criterion of UOTF is to: (i) minimize the average energy

and noise variance of the whole correlation plane for all the

samples, and (ii) maximize the origin correlation outputs for the

intra-class samples. However, the minimization of (i) cannot

guarantee that the origin correlation outputs for the extra-class

samples (used to form the feature) are minimal. As a result,

although UOTF [28] tries to overcome the generalization problem

by removing the hard constraints of OTF, UOTF fails in 1D-CFA (see

Section 4 for the experimental results). Therefore, in this paper, we

propose to directly optimize the origin correlation outputs and

take the extra-class samples and intra-class samples into respec-

tive considerations during the ﬁlter design.

In the following, we describe the details of the proposed

UOOTF. For the clarity of presentation, vectors are denoted by an

arrow on top of the alphabet. Upper case symbols refer to

quantities in the frequency plane terms, while lower case symbols

represent quantities in the space domain.

1D-CFA designs a correlation ﬁlter for each class. Let the ﬁlter

trained for the l-th class be h

, and o

be the output of h

response to y

.Wehave

ðnÞ¼ y

ðnÞ⊙ h

ðnÞ; ð1Þ

where ⊙ is a correlation function; y

is the low-dimensional PCA

feature for the i-th training image; and n is the feature index in the

spatial domain.

Eq. (1) can be expressed in the frequency domain by using the

1D Fourier transform as follows:

ðnÞ¼ ∑

p−1

k ¼ 0

ðkÞ

 H

ðkÞe

j2πkn=p

: ð2Þ

here Y

and H

represent the 1D Fourier transforms of y

and h

respectively; p is the reduced dimensionality of the PCA subspace;

k is the feature index in the frequency domain; and ‘

’ denotes the

conjugate operator. According to (2), the origin correlation output

(n¼0) is the inner product of the input signal and the correlation

ﬁlter in the frequency domain.

The framework of the UOOTF design is shown in Fig. 3. For the

extra-class samples, UOOTF tries to balance the tradeoff between

the origin correlation output energy and the origin correlation

output noise variance. It can be derived by minimizing the

weighted sum of the origin energy j o

ð0Þj

and the origin noise

variance j n

ð0Þj

for the extra-class samples, which is expressed as

min

∑

i ¼ 1

j o

ð0Þj

þ ω

∑

i ¼ 1

j n

ð0Þj

¼ min

∑

i ¼ 1

j Y

Eþ

þ ω

∑

i ¼ 1

j N

Eþ

¼ min

lþ

þ ω

lþ

; ð3Þ

where R

¼ 1=N

∑

i ¼ 1

Eþ

, and Y

ði ¼ 1; …; N

Þ is the 1D

Fourier transform of i-th the extra-class sample for the l-th class.

C ¼ 1=N

∑

i ¼ 1

Eþ

, and N

ði ¼ 1; …; N

Þ is the 1D Fourier

transform of the i-th extra-class noise sample for the l-th class;

C is usually set as a diagonal matrix whose diagonal elements

represent the noise power spectral density (in fact, C can also be

viewed as a regularization term); ‘+’ represents the conjugate

transpose; ω

and ω

ð0≤ω

; ω

≤1Þ are the tradeoff parameters; N is

the number of all the training samples and N

is the number of

training samples for the l-th class; N

¼ N−N

denotes the number

of extra-class training samples for the l-th class.

For the intra-class samples, we try to maximize the average

origin correlation output, which is given by

max

∑

i ¼ 1

Iþ

¼ max

ðM

lþ

Þ; ð4Þ

Fig. 2. Feature extraction in 1D-CFA. Note that FFT is the Fast Fourier Transform

which effectively computes the discrete Fourier transform.

Fig. 3. Framework of the UOOTF design.

Y. Yan et al. / Neurocomputing 119 (2013) 201–21 1 203

剩余10页未读，继续阅读

zcharzon

粉丝: 6
资源: 934

无约束相关滤波器在人脸识别中的应用与内核化研究

基于arm架构的嵌入式人脸识别技术研究.pdf

卷积神经网络在人脸识别上的研究.pdf

基于cnn神经网络人脸识别 实现的考勤demo项目。ui采用pyqt5，实现了简单的录入人脸，人脸检测，人脸识别。.zip

基于cnn的人脸识别与wechat的控制树莓派与uno.zip

基于CNN卷积神经网络实现实时分辨人脸微表情.zip

基于CNN实现对摄像头捕捉的人脸进行性别和年龄的预测.zip

基于CNN的车牌识别程序.zip

基于CNN的图像验证码识别.zip

基于CNN的图像验证码识别，单个验证码识别成功率99%.zip

基于CNN的手写数字识别APP.zip

最新资源

基于cnn神经网络人脸识别实现的考勤demo项目。ui采用pyqt5，实现了简单的录入人脸，人脸检测，人脸识别。.zip