迭代优化的半监督人脸识别：成本敏感标签传播

177 浏览量更新于2024-07-15 收藏 856KB PDF 举报

"这篇论文提出了一种用于半监督人脸识别的成本敏感型标签传播方法，旨在解决在有限的标注数据情况下优化分类器性能的问题。在实际应用中，错误分类的代价是不均等的，某些错误可能比其他错误带来的损失更大。传统方法常常在有监督学习阶段前单独进行标签传播，然后固定这些标签信息，这可能导致在后续学习中性能下降。本文的贡献在于构建了一个迭代的、统一的成本敏感框架，该框架同时优化推断出的标签信息和分类器，特别是在处理高成本错误时，能显著提升整体系统性能。实验结果在面部识别基准数据集上验证了该方法优于现有标签传播和成本敏感学习技术。" 文章内容详细展开如下：在人脸识别领域，由于获取大量标注数据的高昂成本和困难，半监督学习成为了一种有效的策略。在这个背景下，标签传播技术被广泛用于从少量标注数据推断出大量未标注数据的类别信息。然而，现有的标签传播方法忽视了不同错误类型的代价差异，这在实际应用中可能是一个重大问题。例如，在安全系统中，将高权限用户误识别为普通用户可能比反之更严重。本文提出的成本敏感型标签传播算法考虑了这种代价差异。它不再将标签传播视为预处理步骤，而是将其与分类器的优化过程相结合，形成一个迭代的框架。这样，随着学习过程的推进，可以不断调整和优化标签信息，以适应不同错误的成本。这种方法允许模型在每次迭代中根据新的学习情况动态更新对未标注数据的分类，从而提高对高成本错误的识别准确率。在实验部分，作者在多个面部识别基准数据集上对比了他们的方法与当前最先进的标签传播和成本敏感学习算法。结果显示，新方法在减少与高成本错误相关的分类错误方面表现出显著优势，证明了其在实际应用中的潜力和价值。这篇研究工作为半监督人脸识别提供了一个强大的工具，通过集成成本敏感学习和迭代标签传播，提高了在有限标注数据条件下的分类性能，尤其是在那些错误代价高昂的情况下。这一成果对人脸识别领域的理论研究和实际应用都有着重要的启示作用。

1556-6013 (c) 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIFS.2018.2885252, IEEE

Transactions on Information Forensics and Security

In this paper, we propose a uniﬁed cost-sensitive frame-

work for conducting label propagation and classiﬁer learning

simultaneously. Let F = [f

, f

, . . . , f

] denote the inferred

cost-sensitive label matrix, where f

is a one-hot vector, i.e.,

only one of its c elements is one and all the others are zero.

In our approach, each label vector f

for i = 1, 2, . . . , N is

estimated in a cost-sensitive way by regressing the current

classiﬁcation results. The joint optimization problem can be

solved by minimizing some misclassiﬁcation loss function in

the general form of

min

W,F

loss{φ(X, W ), F, C} (1)

where φ classiﬁes the input training data X with the projection

matrix W . Then, the classiﬁcation results can be used to

evaluate the label matrix F with the cost matrix C. The

cost-sensitive label information is used in turn to update the

classiﬁer φ with respect to W . This process is iterated until

the overall misclassiﬁcation loss is minimized. In this way,

both label propagation and classiﬁer learning are embedded in

a cost-sensitive framework.

To deal with face feature variations, we further propose to

conduct cost-sensitive semi-supervised learning in some latent

semantic space of face images. The last two key notations

in Table I specify the robust high-level features used in

our approach. In particular, B ∈ R

D×d

spans the learned

latent semantic space and S ∈ R

d×N

accommodates the d-

dimensional latent semantic representations of X.

IV. THE UNIFIED COST-SENSITIVE FRAMEWORK

In this section, we elaborate our uniﬁed cost-sensitive

framework for semi-supervised face recognition. Section IV-A

proposes cost-sensitive latent semantic regression for label

propagation and learning of the classiﬁer. Section IV-B in-

troduces cost-sensitive regularization to guide the label prop-

agation process. Section IV-C presents design of the misclas-

siﬁcation loss function for cost-sensitive learning in the latent

semantic space. Section IV-D describes the iterative algorithm

for solving the uniﬁed framework. Section IV-E explains the

procedure for inference.

A. Cost-sensitive learning in the latent semantic space

Considering facial expressions, lighting and poses of face

images taken at different times, it is necessary to extract robust

feature representations for cost-sensitive face recognition. To

address this issue, we adopt matrix factorization to extract

high-level features that can reﬂect the inherent structure be-

tween data [34], [36]–[39]. The latent semantic space B and

the high-level features S can be jointly learned from

(B, S) = ||X −BS||

(2)

where || · ||

denotes the Frobenius norm. We do not include

any sparsity constraint in (2) for matrix factorization because

face recognition is not commonly considered as a compressive

sensing problem [6], [8], [35].

We then use a linear predictive classiﬁer to project S into

the label space, i.e., φ(X, W ) = W

S(X) where S(X) is the

latent semantic features learned from (2) with input X, and

cast least square minimization for the loss function. Note that it

is possible to consider other classiﬁers for φ and optimization

rules. In our context, linear regression makes an update simpler

in every iteration and yet can achieve effective results for the

uniﬁed framework. Thus, we introduce cost-sensitive latent

semantic regression as

(W, S, F ) =

i=1

h(i)||W

− f

(3)

where s

denotes the latent semantic representation of sample

and h(i), known as the importance function [3], [6]–[8],

depicts the importance of sample x

in the training process.

In supervised learning scenarios [3], [40], the importance

function is often deﬁned as the total cost of misclassifying

sample x

whose true class lable is denoted by l(x

). In our

context of semi-supervised learning, sample x

can be either

labeled or unlabled. Accordingly, the importance of sample x

is evaluated as

h(i) =

(

j=1

l(x

, if i ≤ N

τ, if i > N

(4)

where the hyper-parameter τ is set for unlabeled training data

and its value is found empirically to stress the importance of

unlabeled data in cost-sensitive learning.

Proposition 1: Assume that x

∈ X for i = 1, 2, . . . , N

are conditionally independent of each other given their label

classes l(x

) = 1, 2, . . . , c whose densities are multivariate

Gaussian’s with a common covariance matrix. Given the label

matrix F = [f

, f

, . . . , f

], minimizing the least squares

criterion in the form of min

||W

S −F||

results in a solu-

tion

W = [

, . . . ,

] that projects the latent semantic

feature s

of each sample x

into the label space with regressed

terms proportional to the posteriori class probabilities, i.e.,

∝ p(l(x

) = k|x

) for k = 1, 2, . . . , c, and

− f

∝

j:j≤c,j6=l (x

)

p(j|x

)

+ [1 − p(l(x

)|x

)]

(5)

Proof: Let g

be a row vector in F containing one-hot

vectors for label class k = 1, 2, . . . , c such that g

= 1 if

l(x

) = k and g

= 0 otherwise for all i = 1, 2, . . . , N in

the training dataset. The least squares solution can also be

obtained by solving

min

||S

− g

(6)

for each label classiﬁer individually [41].

Note that the problem expressed in (6) is two-class regres-

sion with class k and a null class that contains all samples

that do not belong to class k, i.e., l(x

) 6= k. Suppose that the

mean for the two classes are m

and m

, respectively. Since

all label classes have the same covariance matrix Σ, the least

squares solution of two-class regression satisﬁes the following

relationship [41]:

∝ Σ

−1

− m

). (7)

On the other hand, we may use a Gaussian Naive Bayes

(GNB) classiﬁer to estimate the posteriori class probability

剩余14页未读，继续阅读

weixin_38749268

粉丝: 5
资源: 943

迭代优化的半监督人脸识别：成本敏感标签传播

一种基于流形正则化的半监督人脸识别方法.pdf

基于半监督流形学习的人脸识别方法

人脸识别系统,用于人脸识别和人脸检测

虹软人脸识别sdk用到了什么人脸识别算法？

python人脸识别数据集

pca人脸识别和bp神经网络人脸识别的区别

paddlepaddle离线 人脸识别

OpenMV人脸识别需要2个模型，K210人脸识别需要3个模型

大华人脸识别springboot

基于opencv的人脸识别

最新资源

paddlepaddle离线人脸识别