Boosting链学习：提升目标检测的新方法

需积分: 3 77 浏览量更新于2024-09-09 收藏 506KB PDF 举报

"Boosting Chain Learning for Object Detection" 在计算机视觉领域，对象检测是一个关键任务，它涉及到在图像中定位并识别出特定的目标物体。Boosting Chain是一种针对对象检测问题提出的新颖学习框架，旨在提高检测算法的性能和效率。这篇由Rong Xiao, Long Zhu和Hong-Jiang Zhang (微软亚洲研究院)撰写的论文，提出了一个增强学习的连锁结构，即Boosting Chain，用于学习Boosting级联。传统的Boosting方法，如AdaBoost，通过组合多个弱分类器形成一个强分类器，从而减少错误率。然而，Boosting Chain引入了一个新的“链”结构，这个结构能够将历史知识融入到连续的Boosting学习过程中。这种结构不仅允许模型利用先前学习的经验，还解决了Boosting学习中的冗余问题。论文中提出了一种线性优化方案，该方案旨在解决Boosting学习中的特征冗余问题，并在级联耦合中调整阈值。通过这种方式，Boosting Chain可以构建包含更少弱分类器的模型，同时在训练和测试阶段都能实现比传统Boosting级联更低的错误率。这意味着，尽管模型复杂度降低，但其识别准确性得到提升。实验部分，作者通过人脸识别问题对比了Boosting Chain和Boosting Cascade的性能。结果显示，Boosting Chain展现出显著的优势，证实了其在提高检测效率和准确性的有效性。这些实验结果对于解决对象检测中的挑战，特别是大规模数据集上的实时检测，具有重要的理论和实践意义。 Boosting Chain的学习框架为对象检测提供了一个新的视角，它强调了历史知识的整合和优化，以及如何通过更有效的策略减少计算成本。这种方法对于未来开发更加高效、精确的检测系统具有极大的潜力，特别是在自动驾驶、视频监控和智能安全等应用领域。

Boosting Chain Learning for Object Detection

Rong Xiao, Long Zhu, Hong-Jiang Zhang

Microsoft Research Asia

49 Zhichun Road, Beijing 100080, P.R. China

{t-rxiao, hjzhang}@microsoft.com

Abstract

A general classification framework, called boosting

chain, is proposed for learning boosting cascade. In this

framework, a “chain” structure is introduced to integrate

historical knowledge into successive boosting learning.

Moreover, a linear optimization scheme is proposed to

address the problems of redundancy in boosting learning

and threshold adjusting in cascade coupling. By this

means, the resulting classifier consists of fewer weak

classifiers yet achieves lower error rates than boosting

cascade in both training and test. Experimental

comparisons of boosting chain and boosting cascade are

provided through a face detection problem. The

promising results clearly demonstrate the effectiveness

made by boosting chain.

1. Introduction

Different from the traditional pattern classification

problem where decision is made between well-defined

classes, the detection problem requires discriminate

analysis between the object class and the rest of the world.

As a result, the detection algorithm must accommodate

the intra-class variance without compromising the

discriminability of locating object within cluttered scenes.

On the other hand, typical negative samples are usually

unavailable for building a training set due to large

variance of negative class. Moreover, as the location and

scale of target class are unknown, the computation cost

for exhaustive search can hardly be avoided. To conclude,

there are three issues which are critical for a detection

system: training strategy for negative sample collection,

robust learning algorithm, and computation cost for

evaluation.

Sung and Poggio [10] proposed training schema, called

bootstrap, was applied for negative samples collecting.

During bootstrap procedure, false detections are collected

iteratively into the training set, and a very low false

positive rate is achieved after several iterations of

learning.

Also, various learning algorithm has been applied to

the detection problem. Papageorgiou [1] built a detector

by training a Support Vector Machine (SVM) [12] on an

over-complete wavelet representation of object classes.

Rowley [3] presented a neural network-based face

detection system. Roth [2] used a network of linear units,

called SNoW learning architecture, which is specifically

tailored for learning in the presence of a very large

number of features. Schneiderman

[4] used naive Bayesian

classifier on multi-resolution features from different levels

of wavelet transform.

Although, some works, such as [2] and [4] have

achieved the best detection accuracy in the literature, both

of them are too slow to be applied in real-time

applications due to the computation complexity. Thereby,

hierarchical classification framework is wildly adopted to

build rapid detector. Serra [11] implemented a two-layer

detector. The first layer consists of a fast linear SVM that

removes large parts of the background. The second layer

consists of a more accurate polynomial SVM performs the

final face detection. Viola and Jones [7] built a cascade of

boosting classifiers on an over-complete set of Haar-like

features. In each layer of the cascade, AdaBoost [13] is

adapted to integrate the feature selection and classifier

design in one boosting procedure. By adopting

simple-to-complex strategy, most non-face candidates are

rejected in earlier layer of cascade with little computation

costs. This structure results in extremely rapid object

detector. However, AdaBoost is a sequential forward

search procedure using the greedy selection strategy. Its

heuristic assumption is the monotonicity. The premise

offered by the sequential procedure can be broken-down

when the assumption is violated. Stan Li [8] proposed

FloatBoost algorithm by incorporating the idea of Floating

Search into AdaBoost. Based on FloatBoost, a detector

for multi-view face detection [9] is implemented.

Although the new detector achieves the better

performance with fewer features, the FloatBoost is

unstable and computation extensive for learning

complicated problem.

Proceedings of the Ninth IEEE International Conference on Computer Vision (ICCV 2003) 2-Volume Set

下载后可阅读完整内容，剩余6页未读，立即下载

ture_dream

粉丝: 281
资源: 61

Boosting链学习：提升目标检测的新方法

ImVoteNet_Boosting_3D_Object_Detection_in_Point_Cloud.pdf

Alibaba-Dragonwell-Standard-21.0.4.0.4.7-aarch64-linux.tar

【Unity游戏框架】Prodigy Game Framework快速搭建游戏原型

com.harmonyos4.exception.LoadBalancerFailureException.md

Eclipse代码注释模版-codetemplates.xml

PD虚拟机激活，先下载正版软件在启动这个即可完美激活

ssm母婴用品网站.zip

‘泰迪杯’数据分析技能赛：推动学生数据分析能力及高校合作

诗经数据，包含注释，翻译以及解读

低功耗STM32F411开发板+原理图+PCB源文件+官方例程+驱动等

最新资源