DPM：变形部件模型在目标检测中的应用与优势

需积分: 50 200 浏览量更新于2024-09-11 收藏 860KB PDF 举报

"DPM（Deformable Part Model）是一种基于判别训练的多尺度可变形部件模型，用于目标检测。该算法由Pedro Felzenszwalb等人提出，并在2007、2008和2009年的VOC（Visual Object Class）竞赛中取得了优异的成绩，成为目标检测领域的里程碑。DPM因其在物体识别、分类、分割以及人体姿态和行为分析中的应用而备受关注。" DPM算法的核心在于它的可变形部件模型，它将对象视为由多个可变形的部分组成，这些部分可以自由调整位置以适应不同物体的形状变化。这种模型的优势在于能够处理物体在图像中的各种姿态和变形，提高了检测的鲁棒性。在DPM中，每个对象类别都有一个模型，这个模型由一组基部件组成，这些部件可以相对于一个中心点（通常是物体的主体部分）进行位移。模型的训练过程是判别性的，意味着它是通过对正样本和精心选择的负样本进行学习来优化的。为了寻找难例，即那些容易被误判为正样本的负样本，DPM采用了边缘敏感的方法来挖掘数据。此外，DPM引入了Latent SVM的概念，这是一种隐含的支持向量机（SVM）。在传统的SVM中，训练目标是找到一个超平面将正样本和负样本分开。而在Latent SVM中，除了可见的输入特征外，还存在一些隐含变量，这些变量代表了未观测到的物体状态或部分。虽然Latent SVM导致了一个非凸的优化问题，但由于其半凸性，训练过程可以通过迭代方法有效地解决。 DPM的性能提升体现在它在PASCAL VOC挑战赛中的表现。在2006年的比赛，DPM在平均精度上实现了两倍的提升，尤其在人检测任务中表现出色。在2007年的比赛中，它在20个类别中的10个类别超过了所有其他方法，展示了其在复杂基准测试中的强大能力。 DPM算法通过结合可变形部件、边缘敏感的负例挖掘和Latent SVM的训练策略，实现了对复杂场景中目标的有效检测，为后续的目标检测算法提供了重要的理论基础和技术借鉴。

A Discriminatively Trained, Multiscale, Deformable Part Model

Pedro Felzenszwalb

University of Chicago

pff@cs.uchicago.edu

David McAllester

Toyota Technological Institute at Chicago

mcallester@tti-c.org

Deva Ramanan

TTI-C and UC Irvine

dramanan@ics.uci.edu

Abstract

This paper describes a discriminatively trained, multi-

scale, deformable part model for object detection. Our sys-

tem achieves a two-fold improvement in average precision

over the best performance in the 2006 PASCAL person de-

tection challenge. It also outperforms the best results in the

2007 challenge in ten out of twenty categories. The system

relies heavily on deformable parts. While deformable part

models have become quite popular, their value had not been

demonstrated on difﬁcult benchmarks such as the PASCAL

challenge. Our system also relies heavily on new methods

for discriminative training. We combine a margin-sensitive

approach for data mining hard negative examples with a

formalism we call latent SVM. A latent SVM, like a hid-

den CRF, leads to a non-convex training problem. How-

ever, a latent SVM is semi-convex and the training prob-

lem becomes convex once latent information is speciﬁed for

the positive examples. We believe that our training meth-

ods will eventually make possible the effective use of more

latent information such as hierarchical (grammar) models

and models involving latent three dimensional pose.

1. Introduction

We consider the problem of detecting and localizing ob-

jects of a generic category, such as people or cars, in static

images. We have developed a new multiscale deformable

part model for solving this problem. The models are trained

using a discriminative procedure that only requires bound-

ing box labels for the positive examples. Using these mod-

els we implemented a detection system that is both highly

efﬁcient and accurate, processing an image in about 2 sec-

onds and achieving recognition rates that are signiﬁcantly

better than previous systems.

Our system achieves a two-fold improvement in average

precision over the winning system [5] in the 2006 PASCAL

person detection challenge. The system also outperforms

the best results in the 2007 challenge in ten out of twenty

object categories. Figure 1 shows an example detection ob-

tained with our person model.

Figure 1. Example detection obtained with the person model. The

model is deﬁned by a coarse template, several higher resolution

part templates and a spatial model for the location of each part.

The notion that objects can be modeled by parts in a de-

formable conﬁguration provides an elegant framework for

representing object categories [1–3, 6, 10, 12, 13,15,16, 22].

While these models are appealing from a conceptual point

of view, it has been difﬁcult to establish their value in prac-

tice. On difﬁcult datasets, deformable models are often out-

performed by “conceptually weaker” models such as rigid

templates [5] or bag-of-features [23]. One of our main goals

is to address this performance gap.

Our models include both a coarse global template cov-

ering an entire object and higher resolution part templates.

The templates represent histogram of gradient features [5].

As in [14, 19, 21], we train models discriminatively. How-

ever, our system is semi-supervised, trained with a max-

margin framework, and does not rely on feature detection.

We also describe a simple and effective strategy for learn-

ing parts from weakly-labeled data. In contrast to computa-

tionally demanding approaches such as [4], we can learn a

model in 3 hours on a single CPU.

Another contribution of our work is a new methodology

for discriminative training. We generalize SVMs for han-

dling latent variables such as part positions, and introduce a

new method for data mining “hard negative” examples dur-

ing training. We believe that handling partially labeled data

is a signiﬁcant issue in machine learning for computer vi-

sion. For example, the PASCAL dataset only speciﬁes a

bounding box for each positive example of an object. We

treat the position of each object part as a latent variable. We

下载后可阅读完整内容，剩余7页未读，立即下载

luxiankao

粉丝: 0
资源: 8

DPM：变形部件模型在目标检测中的应用与优势

DPM算法实现源码，环境是matlab2014下

DPM_BC.rar_DPM颗粒_udf_udf dpm_udf反弹_颗粒 udf

dpm_face_detector.zip

DPM检测算法

DPM目标检测

DPM目标检测训练关键代码

DPM算法实现的行人检测

GPU并行化提升目标识别效率：HOG与DPM算法的协作

DPM目标检测算法python源代码

DPM目标检测算法python源代码检测树木

最新资源