优化矩阵双向投影特征提取法：应用于人脸识别的收敛解决方案

52 浏览量更新于2024-08-27 收藏 687KB PDF 举报

本文主要探讨了矩阵双向投影在特征提取中的应用，特别是在面部识别领域。随着图像矩阵表示方法的兴起，众多基于此技术的特征提取策略被提出。然而，这些方法在求解两个投影矩阵的过程中常常面临优化不充分或收敛性问题，这限制了它们在实际应用中的性能。针对这一挑战，研究者提出了一个新颖的特征提取方法，该方法采用了最大边距准则（Maximum Margin Criterion）。最大边距准则强调的是找到最优的决策边界，使得样本点与类别之间的距离最大化，从而提高分类的稳定性和鲁棒性。作者设计了一个迭代优化算法，用于计算这两个关键的投影矩阵，确保了解决方案的收敛性和有效性。一个显著的特性是，提出的迭代算法能够保证优化目标——双向投影的边际——的单调递增。这意味着每次迭代都会使模型性能得到提升，直至达到最优状态。进一步地，论文作者通过理论分析证明，这个迭代过程不仅能保证目标函数值的递增，还能确保得到的解也是收敛的，即最终结果是全局最优解。在面部识别的具体应用中，这种基于最大边距准则的矩阵双向投影特征提取方法具有重要的意义。它不仅可以有效地降低高维数据的维度，减少计算复杂性，还能提高识别准确性和抗干扰能力。通过实验验证，这种方法在处理人脸图像时表现出优良的性能，对人脸识别任务的挑战性问题提供了有力的支持。总结来说，这篇文章的核心贡献在于提出了一种解决矩阵双向投影特征提取问题的收敛方法，利用最大边距准则驱动的迭代优化策略，不仅提升了特征表示的质量，还解决了传统方法中可能遇到的优化问题，为提高人脸识别系统的性能开辟了新的途径。对于从事机器学习、计算机视觉和特征工程领域的研究人员来说，这篇论文提供了有价值的技术参考和实践指导。

A Convergent Solution to Matrix Bidirectional Projection

3. In our method, the manifold structure of the

image space, which is modeled by an adja-

cency graph, is explicitly taken into account.

4. Our method can automatically select suit-

able feature dimensionality with which algo-

rithm can obtain comparable recognition per-

formance. This is very important in prac-

tice. The previous methods need to consider

all the possible dimensionality to obtain the

top recognition performance. This is very

time-consuming and inapplicable in real face

recognition system.

2. Laplacian scatter matrix

Let matrix x represent an image with m ×n pixels,

then feature matrix y of image x can be obtained by:

y = U

xV (

where U and V are m ×m

′

6 m) column projec-

tion matrix and n×n

′

6 n) row projection matrix,

respectively.

Suppose we are given N training im-

ages X = [x

,... ,x

] = [X

,. .. ,X

] =

(1)

,. .. ,x

(i)

,. .. ,x

(c)

] which belong to c different

classes, the ith class has N

images(

∑

i=1

= N)

nd matrix X

= [x

(i)

,. .. ,x

(i)

] consists of the im-

age matrices from the ith class. By representing

each image matrix as an m-set of row vectors, the

row total scatter matrix can be expressed as:

row

∑

i=1

∑

j=1

−x

( j)

)

−x

( j)

) (2)

∑

i=1

−x)

−x) (3)

∑

i=1

∑

j=1

−x

)

−x

) (4)

where x, x

and x

( j)

are the mean matrix of all train-

ing images, jth row vector of ith image matrix and

mean vector of jth row vector of all training images,

respectively.

The row within-class scatter matrix can be de-

ﬁned as:

row

∑

i=1

∑

j=1

(i)

−m

)

(i)

−m

) (5)

∑

i=1

∑

j,k=1

(i)

−x

(i)

)

(i)

−x

(i)

) (6)

where x

(i)

and m

are the jt

h image matrix of ith class

and mean matrix of ith class, respectively.

The use of manifold information in feature ex-

traction has shown the state-of-the-art face recog-

nition performance

10,23,24

. According to graph em-

bedding theory

, we deﬁne an undirected weighted

graph G(X,W) to characterize the nonlinear mani-

fold structure of the image set X. The real symmet-

ric matrix W measures similarities of any pairs of

samples. It can be constructed using various simi-

larity criterion, such as Gaussian similarity in Lapla-

cian eigenmap

, local neighborhood relationship as

in LLE

and also prior class information in super-

vised learning algorithms. Here, the Gaussian simi-

larity is adopted:

= exp(−kx

−x

/(2

)) (7)

Then in order to incorporate the nonlinear mani-

fold structure of face images, we can deﬁne the fol-

lowing row total Laplacian scatter:

row

∑

i, j=1

−x

)

−x

) (8)

∑

i, j=1

−w

) (9)

′T

(L⊗I

′

(10)

where X

′

= [x

,. .. ,x

]

, D is a diagonal matrix

with d

∑

j=1

, L = D−W is the Laplacian ma-

trix of graph G, I

is identity matrix of order m and

operator ⊗ is the Kronecker product of matrices.

Similarly, in row direction, the image within-

Published by Atlantis Press

865

剩余10页未读，继续阅读

weixin_38546608

粉丝: 6
资源: 945

优化矩阵双向投影特征提取法：应用于人脸识别的收敛解决方案

"线性方程组的迭代法：Jacobi与Gauss-Seidel方法详解

MPI RMA 实现的高性能 ConvergentMatrix 替代方案

IEEE 1905.1a标准：2014年版，家庭网络技术融合与增强

A Globally Convergent and Closed Analytical Solution of Blasius Equation with its Approximation in Application

convergent-matrix-mpi:基于 MPI 的替代 ConvergentMatrix

A finite-time convergent Zhang neural network and its application to real-time matrix square root finding

Image magnified lensless holographic projection by convergent spherical beam illumination

A convergent adaptive algorithm for Poisson's equation

IEEE Std 1905.1-2013 IEEE Standard for a Convergent Digital

逼近算子分数次幂的指数收敛梯形规则_Exponentially convergent trapezoidal rules to

最新资源