结构化稀疏错误编码：对抗遮挡的人脸识别新方法

111 浏览量更新于2024-08-30 收藏 1.08MB PDF 举报

"Structured Sparse Error Coding for Face Recognition With Occlusion" 这篇论文主要探讨了在实际世界中常见的面部识别问题，特别是面对遮挡时如何提高识别的准确性。受结构化稀疏表示理论的启发，作者们尝试从两个方面探索遮挡引起的错误结构：错误形态和错误分布。他们提出，由于人们通常根据遮挡区域的形状或轮廓来识别遮挡，因此遮挡的形状也是一个重要的特征。为了描述错误的形态结构，论文提出了一个形态学图模型（morphological graph model）。这个模型旨在捕捉和表达由于遮挡造成的图像误差的形态特征。形态学图模型可以有效地分析和表示遮挡导致的面部特征变化，帮助识别系统在遮挡情况下更好地理解面部图像。另一方面，由于遮挡的不确定性，错误的分布也是不确定的。然而，作者观察到，由当前相关熵诱导度量（currentropy induced metric）测量的未被遮挡部分和被遮挡部分的误差分别遵循指数分布。结合这两方面的错误结构信息，他们提出了结构化稀疏错误编码（structured sparse error coding）方法。结构化稀疏错误编码是一种创新的处理手段，它将稀疏表示与遮挡误差的形态和分布特性相结合。通过这种方法，可以更好地分离和估计遮挡部分，从而提高面部识别的鲁棒性。这种方法对于处理恶意遮挡（malicious occlusion）特别有用，因为恶意遮挡可能是为了故意干扰识别系统。此外，该工作还涉及了高断裂点分类（high-breakdown point classification）和异常检测（outlier detection），这些都是在有遮挡的面部识别中确保系统稳定性和抗干扰能力的关键技术。通过利用这些技术，即使在严重遮挡的情况下，也能提升识别的准确性和稳定性。这篇论文提出的结构化稀疏错误编码策略为解决面部识别中的遮挡问题提供了一个新的视角，利用形态学和概率分布的特性来增强系统的鲁棒性。这种方法对于改进现有的面部识别系统，尤其是应对现实世界中的复杂遮挡情况，具有重要的理论和应用价值。

LI et al.: STRUCTURED SPARSE ERROR CODING FOR FR WITH OCCLUSION 1891

II. ERROR STRUCTURE

In this section, we explore the structure of the error from

two aspects: the structure of the error support built on a new

graph model, and the structure of the error distribution under

robust error metric.

A. Morphological Graph Model for Error Support Structure

1) Morphological Graph Model: We ﬁrst study the intrinsic

error structure from the cognitive style of the visual system

of human beings. Why could human beings recognize a facial

occlusion accurately and rapidly? Our visual experience indi-

cates that we recognize the occlusion according to its region

shape or proﬁle even without knowing what the occlusion is

and where it lies. Mathematical morphology shows that the

shape can be described by boundaries, skeleton, or the convex

hull [27, Chap. 9]. In this work, we try to describe occlusion

shape by its boundary, which means the interrelationships

between image pixels should be well modeled. While the

error support s

can be used to indicate if an image pixel

is occluded, s

cannot be used to infer if the image pixels

around y

are occluded. Inspired by the work of [25], Zhou

et al. [9] suggest to model the error support s using a graph

G =

(

V, E

)

. Here, V =

{

1,...,m

}

denotes the vertex set

of m pixels of y ∈ R

and each vertex i ∈ V is labeled by

; E = E

(

)



(

i, j

)

|i, j ∈ V,



− c



≤ r



denotes the

edges connecting neighboring pixels, where r is the maximum

edge length and c

= [c

, c

]

is the coordinate vector of

the vertex i

. Thus, the occlusion probabilities of the image

pixels around y

can be inferred from s

according to the edges

connected with vertex i. Clearly, the graph G deﬁned above

cannot describe the occlusion shape. Since the occlusion on

an image corresponds to the subgraph with vertices labeled

by s

= 1, we then consider how to represent the shape of a

subgraph.

Let L

denote the label set of the graph G. Then,

G can be divided into several subgraphs by L

: G =

{

(

, E

)

, k ∈ L

}

,whereV

{

i|i ∈ V, s

= k

}

and E

{

(

i, j

)

|i, j ∈ V

(

i, j

)

∈ E

}

. Since the shape can

be described by boundaries, we incorporate the boundary set

B of all subgraphs into the classical graph model to form a

new graph model G =

(

V, E, B

)

, dubbed the morphological

graph. Before deﬁning the boundary B, we ﬁrst introduce

two special vertex sets to describe the relationships between

vertices: the outside vertex set and the related vertex set. The

outside vertex set v

of the vertex i consists of the vertices

which are connected but have different labels with the vertex

i: v



(

i, j

)

∈ E, s

= s



.Therelated vertex set v

the vertices i and j consists of the vertex pairs which are

connected by the edge but belong to v

and v

, respectively:



k, l|

(

k, l

)

∈ E, k ∈ v

, l ∈ v



. We then deﬁne the

boundary set B of the morphological graph G as following:

Deﬁnition 1: The boundary set B of the morphologi-

cal graph G =

(

V, E, B

)

is the set of the boundaries

of all subgraphs of G =

(

V, E

)

: B =

{

|k ∈ L

}

where B

is the boundary of G

. B

is also a graph

We use the same spatial coordinate convention as [27, Sec. 2.4.2].





,where



i|i ∈ V

=∅



and



(

i, j

)

|i, j ∈

=∅



Section B of the supplementary appendix explores in detail

how to detect the subgraph boundaries for a given graph

G and how the maximum edge length r of G affects its

discriminability. Since subgraphs with r = 1seemstohave

better discriminability than r = 2, we choose r = 1inthe

following work. In the next subsection, we will show how to

use the morphological graph to describe the priori information

of the error support s.

2) Priori Probability for Error Support: We now consider

how to build the prior probability p

(

)

of the error support

s on the morphological graph G =

(

V, E, B

)

.In[9],p

(

)

built on the graph G =

(

V, E

)

using the classical Ising model:

(

)

∝ exp

⎛

⎝



(i, j)∈E

+ λ



i∈V

⎞

⎠

, (2)

where the smooth cost λ



(

i, j)∈E

(λ

≥ 0) describes

the continuity of the occlusion, and the data cost λ



i∈V

(λ

≥ 0) gives the priori assumptions about the locations of

the erroneous pixels. Comparing with G, G is equipped with

an additional graph B. Since the vertices in B are sensitive

to the change of the subgraphs (occlusions), we try to weaken

the inﬂuence of these vertices in the Ising model (2). Then,

we have the priori probability p

(

)

of the error support s on

G (see Section C of the supplementary appendix for a proof):

p(s) ∝ exp

⎛

⎝



(i, j)∈E

+ λ



i∈V

− λ



i∈

⎞

⎠

. (3)

Note that the boundary cost λ



i∈

= λ



actually

reﬂects the regularity of the occlusion boundary. Actually, for

∀G ∈ G

and ∀

(

i, j

)

∈

,wehave



− c



≤

√

2, and



(

i, j)∈



− c



≈



≈



Although we weaken the inﬂuence of the vertices on the

graph boundaries in (3), it does not mean that they are not

important. On the contrary, they are very important in the

quality assessment model of Section III-B3.

B. Exponential Probabilitic Model for Error Distribution

Structure

When there are occlusions, Yang et al. [11], [19] indicate

that the error ˆe may be far from any speciﬁc distribution.

Nevertheless, with the assumption that the unoccluded region

of the test image y can be approximated sufﬁciently by the cor-

responding regions of the training samples, if the error support

s is known and the error ˆe is coded and measured in a speciﬁc

way, we argue that ˆe might follow a speciﬁc distribution. The

details about the error coding and error metric are discussed

in Section D of the supplementary appendix. In this work,

we pay attention to the local error metric LEM



, ˆy





(

)

− h



−ˆy



, where h

(

)

= exp

(

−

/σ

)

,andthe

dilation invariance metric DIM



, ˆy







log



/ ˆy





.By

combining LEM and DIM together, we form a new error

剩余11页未读，继续阅读

weixin_38713586

粉丝: 3
资源: 933

结构化稀疏错误编码：对抗遮挡的人脸识别新方法

Simultaneous Bayesian Sparse Approximation With Structured Sparse Models

Lossy audio signal compression via structured sparse decomposition and compressed sensing

Sphere-Structured Support Vector Machines for Multi-class Pattern Recognition

Semi-supervised Feature selection Analysis with Structured Multi-view Sparse Regularization

Creative Coding For Kids

sparse-structured-lasso

Structured Parallel Programming Patterns for Efficient Computation 无水印pdf

Binary Coded Structured Light Range Scanner for Shiny Objects

Output Feedback Controllers for Systems with Structured Uncertainty

Structured robust correlation filter with L2,1 norm for object tracking

最新资源