十年来面部检测技术综述：特征提取与学习算法进展

Face

Detection

需积分: 10 119 浏览量更新于2024-07-21 收藏 395KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

Input

• Training examples S = {(x

, z

), i = 1, · · · , N }.

• T is the total number of weak classiﬁers to be trained.

Initialize

• Initialize example score F

) =

−

where N

and N

−

are the number of positive and

negative examples in the training data set.

Adaboost Learning

For t = 1, · · · , T :

1. For each Haar-like feature h(x) in the pool, ﬁnd the

optimal threshold H and conﬁdence score c

and c

to minimize the Z score L

(8).

2. Select the best feature with the minimum L

3. Update F

) = F

t−1

) + f

), i = 1, · · · , N ,

4. Update W

+1j

, W

−1j

, j = 1, 2.

Output Final classiﬁer F

(x).

Figure 3. Adaboost learning pseudo code.

1 2 3

Input

sub-windows

Further

processing

T T

Rejected sub-windows

Figure 4. The attentional cascade.

Eq. (8) is referred as the Z score in [80]. In practice, at

iteration t + 1, for every Haar-like feature h(x), we ﬁnd the

optimal threshold H and conﬁdence score c

and c

in order

to minimize the Z score L

t+1

. A simple pseudo code of the

AdaBoost algorithm is shown in Fig. 3.

2.3. The Attentional Cascade Structure

Attentional cascade is a critical component in the Viola-

Jones detector. The key insight is that smaller, and thus

more efﬁcient, boosted classiﬁers can be built which reject

most of the negative sub-windows while keeping almost all

the positive examples. Consequently, majority of the sub-

windows will be rejected in early stages of the detector,

making the detection process extremely efﬁcient.

The overall process of classifying a sub-window thus

forms a degenerate decision tree, which was called a “cas-

cade” in [92]. As shown in Fig. 4, the input sub-windows

pass a series of nodes during detection. Each node will

make a binary decision whether the window will be kept

for the next round or rejected immediately. The number of

weak classiﬁers in the nodes usually increases as the num-

ber of nodes a sub-window passes. For instance, in [92], the

ﬁrst ﬁve nodes contain 1, 10, 25, 25, 50 weak classiﬁers, re-

spectively. This is intuitive, since each node is trying to

reject a certain amount of negative windows while keeping

all the positive examples, and the task becomes harder at

late stages. Having fewer weak classiﬁers at early stages

also improves the speed of the detector.

The cascade structure also has an impact on the training

process. Face detection is a rare event detection task. Con-

sequently, there are usually billions of negative examples

needed in order to train a high performance face detector.

To handle the huge amount of negative training examples,

Viola and Jones [92] used a bootstrap process. That is, at

each node, a threshold was manually chosen, and the par-

tial classiﬁer was used to scan the negative example set to

ﬁnd more unrejected negative examples for the training of

the next node. Furthermore, each node is trained indepen-

dently, as if the previous nodes does not exist. One argu-

ment behind such a process is to force the addition of some

nonlinearity in the training process, which could improve

the overall performance. However, recent works showed

that it is actually beneﬁcial not to completely separate the

training process of different nodes, as will be discussed in

Section 4.

In [92], the attentional cascade is constructed manually.

That is, the number of weak classiﬁers and the decision

threshold for early rejection at each node are both speciﬁed

manually. This is a non-trivial task. If the decision thresh-

olds were set too aggressively, the ﬁnal detector will be

very fast, but the overall detection rate may be hurt. On the

other hand, if the decision thresholds were set very conser-

vatively, most sub-windows will need to pass through many

nodes, making the detector very slow. Combined with the

limited computational resources available in early 2000’s,

it is no wonder that training a good face detector can take

months of ﬁne-tuning.

3. Feature Extraction

As mentioned earlier, thanks to the rapid expansion in

storage and computation resources, appearance based meth-

ods have dominated the recent advances in face detection.

The general practice is to collect a large set of face and non-

face examples, and adopt certain machine learning algo-

rithms to learn a face model to perform classiﬁcation. There

are two key issues in this process: what features to extract,

and which learning algorithm to apply. In this section, we

ﬁrst review the recent advances in feature extraction.

The Haar-like rectangular features as in Fig. 2 (a-f) are

very efﬁcient to compute due to the integral image tech-

nique, and provide good performance for building frontal

face detectors. In a number of follow-up works, researchers

extended the straightforward features with more variations

in the ways rectangle features are combined.

For instance, as shown in Fig. 5, Lienhart and Maydt[49]

generalized the feature set of [92] by introducing 45 degree

剩余16页未读，继续阅读

leehungxd

粉丝: 10
资源: 13

十年来面部检测技术综述：特征提取与学习算法进展

A survey of recent advances in visual feature detection

deep multimodal learning a survey on recent advances and trends

帮我推荐最新的sailency综述

recent advances in deep learning for object detection

ieee icassp recent advances in nonnegative matrix factorization

调研机器视觉的应用 ，包括国内外现状，发展趋势等，将必要参考文献辅到后面

列举几个近几年图像修复的例子

关于slam的文献综述

有没有近两年的NER综述

基于深度学习的空中运动目标检测与追踪的研究背景与意义相关资料

量子图像处理技术及其应用的参考文献10篇

关于无人驾驶技术的中午参考文献

development of multi-agent reinforcement learning

"Post-Quantum Cryptography: A Ten-Year Survey" by Daniele Micciancio et al. 的引用格式

能帮我找到关于电离层综述的文章吗？

Write a generator function merge that takes in two infinite generators a and b that are in increasing order without duplicates and returns a generator that has all the elements of both generators, in increasing order, without duplicates.

管理领域强化学习的文献概览

关于无人驾驶的参考文献

把上面的这段文字引用几篇相关的参考文献

advances in financial machine learning

最新资源

调研机器视觉的应用，包括国内外现状，发展趋势等，将必要参考文献辅到后面