HOG特征：CVPR 2005论文中的人体检测利器

5星 · 超过95%的资源 | 下载需积分: 9 | PDF格式 | 476KB | 更新于2024-09-14 | 115 浏览量 | 举报

1 收藏

本文档标题为"Histograms of Oriented Gradients for Human Detection"，由Navneet Dalal和Bill Triggs在2005年CVPR（计算机视觉和模式识别）会议上发表。论文探讨了在视觉对象识别，特别是人类检测任务中，特征集选择的重要性，特别关注基于线性支持向量机（SVM）的人类检测方法。作者首先回顾了已有的边缘和梯度基描述符，如SIFT和SURF等，然后实验性地展示了基于方向直方图（HOG）的特征网格在人类检测性能上明显优于传统方法。 HOG是一种广泛应用于计算机视觉领域的特征提取技术，它通过计算图像局部区域的梯度方向直方图来捕获物体的纹理和形状信息。HOG将图像分割成小的局部块（cell），对每个块内的像素进行梯度计算，然后根据预定义的梯度方向将其分配到适当的区间（bins）。接下来，通过对每个区间的计数进行归一化处理，形成一个方向分布的直方图，这样可以抵抗光照变化和图像旋转的影响。研究发现，几个关键因素对于HOG在人类检测中的良好表现至关重要。首先是精细尺度的梯度计算，这有助于捕捉细节；其次，细粒度的方向归并有助于区分不同方向的纹理；再者，较粗的空间划分减少了计算量，同时保持了足够的空间信息；最后，高质量的局部对比度归一化有助于提高描述符块之间的稳健性。论文的亮点在于，使用HOG特征的系统在原始MIT行人数据库上取得了接近完美的检测效果。然而，为了进一步评估方法的泛化能力，作者提出了一个更具挑战性的数据集，包含超过1800张标注的人体图像，这些图像具有更广泛的姿势变化和背景环境。这个新数据集的引入，不仅验证了HOG的有效性，还推动了后续的研究朝着更复杂和多样化的场景适应性发展。 "Histograms of Oriented Gradients for Human Detection"这篇论文对视觉对象检测领域产生了深远影响，尤其是在人脸识别和行人检测方面。它展示了HOG作为一种简单而有效的特征表示方式，至今仍被广泛应用于各种计算机视觉任务中，并且其基本原理和优化策略仍然是现代深度学习模型设计的重要参考。

Histograms of Oriented Gradients for Human Detection

Navneet Dalal and Bill Triggs

INRIA Rhˆone-Alps, 655 avenue de l’Europe, Montbonnot 38334, France

{Navneet.Dalal,Bill.Triggs}@inrialpes.fr, http://lear.inrialpes.fr

Abstract

We study the question of feature sets for robust visual ob-

ject recognition, adopting linear SVM based human detec-

tion as a test case. After reviewing existing edge and gra-

dient based descriptors, we show experimentally that grids

of Histograms of Oriented Gradient (HOG) descriptors sig-

niﬁcantly outperform existing feature sets for human detec-

tion. We study the inﬂuence of each stage of the computation

on performance, concluding that ﬁne-scale gradients, ﬁne

orientation binning, relatively coarse spatial binning, and

high-quality local contrast normalization in overlapping de-

scriptor blocks are all important for good results. The new

approach gives near-perfect separation on the original MIT

pedestrian database, so we introduce a more challenging

dataset containing over 1800 annotated human images with

a large range of pose variations and backgrounds.

1 Introduction

Detecting humans in images is a challenging task owing

to their variable appearance and the wide range of poses that

they can adopt. The ﬁrst need is a robust feature set that

allows the human form to be discriminated cleanly, even in

cluttered backgrounds under difﬁcult illumination. We study

the issue of feature sets for human detection, showing that lo-

cally normalized Histogram of Oriented Gradient (HOG) de-

scriptors provide excellent performance relative to other ex-

isting feature sets including wavelets [17,22]. The proposed

descriptors are reminiscent of edge orientation histograms

[4,5], SIFT descriptors [12] and shape contexts [1], but they

are computed on a dense grid of uniformly spaced cells and

they use overlapping local contrast normalizations for im-

proved performance. We make a detailed study of the effects

of various implementation choices on detector performance,

taking “pedestrian detection” (the detection of mostly visible

people in more or less upright poses) as a test case. For sim-

plicity and speed, we use linear SVM as a baseline classiﬁer

throughout the study. The new detectors give essentially per-

fect results on the MIT pedestrian test set [18,17], so we have

created a more challenging set containing over 1800 pedes-

trian images with a large range of poses and backgrounds.

Ongoing work suggests that our feature set performs equally

well for other shape-based object classes.

We brieﬂy discuss previous work on human detection in

§2, give an overview of our method §3, describe our data

sets in §4 and give a detailed description and experimental

evaluation of each stage of the process in §5–6. The main

conclusions are summarized in §7.

2 Previous Work

There is an extensive literature on object detection, but

here we mention just a few relevant papers on human detec-

tion [18,17,22,16,20]. See [6] for a survey. Papageorgiou et

al [18] describe a pedestrian detector based on a polynomial

SVM using rectiﬁed Haar wavelets as input descriptors, with

a parts (subwindow) based variant in [17]. Depoortere et al

give an optimized version of this [2]. Gavrila & Philomen

[8] take a more direct approach, extracting edge images and

matching them to a set of learned exemplars using chamfer

distance. This has been used in a practical real-time pedes-

trian detection system [7]. Viola et al [22] build an efﬁcient

moving person detector, using AdaBoost to train a chain of

progressively more complex region rejection rules based on

Haar-like wavelets and space-time differences. Ronfard et

al [19] build an articulated body detector by incorporating

SVM based limb classiﬁers over 1

and 2

order Gaussian

ﬁlters in a dynamic programming framework similar to those

of Felzenszwalb & Huttenlocher [3] and Ioffe & Forsyth

[9]. Mikolajczyk et al [16] use combinations of orientation-

position histograms with binary-thresholded gradient magni-

tudes to build a parts based method containing detectors for

faces, heads, and front and side proﬁles of upper and lower

body parts. In contrast, our detector uses a simpler archi-

tecture with a single detection window, but appears to give

signiﬁcantly higher performance on pedestrian images.

3 Overview of the Method

This section gives an overview of our feature extraction

chain, which is summarized in ﬁg. 1. Implementation details

are postponed until §6. The method is based on evaluating

well-normalized local histograms of image gradient orienta-

tions in a dense grid. Similar features have seen increasing

use over the past decade [4,5,12,15]. The basic idea is that

local object appearance and shape can often be characterized

rather well by the distribution of local intensity gradients or

inria-00548512, version 1 - 20 Dec 2010

Author manuscript, published in "International Conference on Computer Vision & Pattern Recognition (CVPR '05) 1 (2005) 886--893"

DOI : 10.1109/CVPR.2005.177

下载后可阅读完整内容，剩余7页未读，立即下载

sjtulyk

粉丝: 0
资源: 1

HOG特征：CVPR 2005论文中的人体检测利器

行人检测新方法：HOG特征在CVPR05论文中的应用

HOG特征检测：Dalal的2005年CVPR论文解析

CVPR2009论文：HOG特征在行人检测中的多线索融合与算法综述

TIM_Lipreading_CVPR2011_统一帧数_视频帧数归一化TIM.zip

hog-feature.rar_HOG描述子_HOG特征 SVM_feature.hog_物体检测_行人 识别

voc-release3.1.rar_DPM_hog算子_svm改进_改进HOG_改进的svm

HOG.rar_HOG特征_hog

CVPR_2:CVPR的第二课

定向梯度直方图：mex 函数，用于计算（定向）梯度的直方图（Dalal & Triggs CVPR 2005）。-matlab开发

最新资源

hog-feature.rar_HOG描述子_HOG特征 SVM_feature.hog_物体检测_行人识别