噪声不变局部特征在复杂环境中的人手检测

100 浏览量更新于2024-08-26 收藏 279KB PDF 举报

"这篇文章主要探讨了在复杂背景下的人手检测问题，提出了一种基于特征融合的新型策略。文章中引入了三个创新的噪声不变性特征：NCHOG（噪声补偿的梯度方向直方图）、NCLBP（噪声补偿的局部二进制模式）和HPCP（周长像素对的直方图）。这些特征被证明在性能上优于传统的HOG和LBP描述符。通过将新特征与现有特征结合，并利用偏最小二乘（PLS）方法确定特征权重，研究者在他们自己创建的多样化和复杂背景的手部图像数据集上取得了优秀的检测效果。" 文章详细阐述了人手检测面临的挑战，如多变的光照条件、手部的多种外观以及复杂的背景噪音，这些问题使得人手识别成为一个复杂的问题。为了解决这些问题，作者提出了一个新的解决方案，即使用稳健的局部描述符。他们创新地设计了三种特征，NCHOG、NCLBP和HPCP，旨在增强对噪声的抵抗能力，从而提高手部检测的准确性。 NCHOG是对HOG（梯度方向直方图）的改进，通过噪声补偿来提高描述符的鲁棒性。HOG是一种常用的特征提取方法，它通过计算图像中每个像素点的梯度方向和大小，然后统计每个小区域内的梯度直方图，以捕捉物体的形状信息。然而，在噪声较大的环境中，HOG的性能可能会下降。NCHOG的引入就是为了弥补这一不足。 NCLBP是针对LBP（局部二进制模式）的扩展。LBP是一种简单而有效的纹理描述符，它通过对像素邻域进行二值比较来描述局部纹理特征。NCLBP通过噪声补偿增强了LBP的稳定性，使其在不稳定的光照条件下也能保持良好的性能。 HPCP是另一种噪声抑制的特征，它关注的是图像中边缘像素对的分布，这有助于更好地识别手部的轮廓。文章中，作者将这些新型特征与传统的HOG和LBP等特征进行融合，并利用偏最小二乘（PLS）分析来优化特征组合的权重分配。PLS是一种统计方法，可以找出变量之间的最大相关性，从而提高模型的预测能力。通过这种方式，研究者在他们的数据集上实现了显著的检测性能提升，表明提出的特征融合策略是有效的。这篇文章提出了一个创新的方法，通过结合噪声补偿的局部描述符来改善人手检测的准确性和鲁棒性。这一方法对于计算机视觉领域，特别是安全监控、人机交互以及虚拟现实等应用有着重要的意义。

HUMAN HAND DETECTION USING ROBUST LOCAL DESCRIPTORS

Jianwei Niu

, Xiaoke Zhao

, Muhammad Ali Abdul Aziz

, Jiangwei Li

, Kongqiao Wang

, Aimin Hao

State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing, China

Nokia Research Center, Beijing, China

E-mail: niujianwei@buaa.edu.cn, zhaoke001@126.com, xerox414@hotmail.com,

{jiangwei.li, kongqiao.wang}@nokia.com, ham@buaa.edu.cn

ABSTRACT

To date, human hand detection in images remains a chal-

lenging task due to the variable lighting conditions, hand ap-

pearances and background noise. In this paper, we present

an effective strategy based on feature fusion for detecting

hands with cluttered surroundings. To form the fusions,

we propose three novel noise invariant features, namely: 1)

NCHOG (Noise Compensated Histogram of Oriented Gra-

dients), 2) NCLBP (Noise Compensated Local Binary Pat-

terns), and 3) HPCP (Histograms of Pairs of Circumference

Pixels). We show the superior performance of the NCHOG

and the NCLBP descriptors over their existing traditional

counterparts, i.e., HOG and LBP. Merging our novel features

with existing features in different permutations, and applying

Partial Least Squares (PLS) based feature weighting, yields

excellent detection results on our own dataset of hand images

with variegated and complex backgrounds.

Index Terms— Hand detection, NCHOG, NCLBP,

HPCP, PLS

1. INTRODUCTION

In the recent years, a number of approaches for hand detection

have been presented. The authors in [1] tackle hand posture

recognition with some degree of success by using Haar-like

features. Nonetheless, their dataset consists of only images

with very simple backgrounds. Skin color segmentation has

been utilized by several approaches like [2, 3]. However,

these methods are sensitive to quickly changing or mixed

lighting conditions. Kolsch and Turk [4] use fanned boost-

ing detection for classiﬁcation and get nearly real time results.

The major drawback of the technique is the constraints on the

resolution and aspect ratio of gesture template.

More recently, the idea of combining different features

into a larger feature set has been proposed in areas like object

detection, human detection and face detection. [5] employs a

combination of HOG [6] (Histogram of Oriented Gradients),

LBP [7] (Local Binary Patterns, here LBP is Color LBP) and

LTP [8] (Local Trinary Patterns) descriptors for object detec-

tion. In [9], the authors use HOG, CF (Color Frequency) and

texture cooccurrence features for human detection. Moreover,

works like [10, 11] detect humans accurately by using a mix-

ture of features. However, the effectiveness of the feature

fusion technique for the task of hand detection still remains

unexplored. In this paper, we aim to assess the suitability of

using fusions of heterogeneous and complementary features,

for hand detection. Our motivation for this is that having a re-

liable hand detector can facilitate many other tasks in human

temporal analysis. We use HOG and our proposed feature

NCHOG (Noise Compensated HOG) to encapsulate the ro-

bust edges of the hand. Then, to capture the distinct texture

of the hand, we make use of the CLBP (Color LBP), LTP and

our proposed descriptors: the CNCLBP (Color Noise Com-

pensated LBP) and the HPCP (Histograms of Pairs of Cir-

cumference Pixels). Finally, the color information is further

encoded using the CF feature.

We make the following major contributions in this paper:

1) We propose three new noise invariant features: NCHOG,

NCLBP and HPCP. The last feature is a histogram based vari-

ant of the CCS-POP (Circular Center Symmetric-Pairs of Pix-

els) feature presented in [12]. We prove that NCHOG is more

discriminative than HOG and similarly NCLBP is better than

LBP; 2) Based on our experiments, we ﬁnd that the feature

set incorporating NCHOG, CNCLBP, LTP and CF, exhibits

better performance than all other fusions in our feature fam-

ily, including the feature set HOG + CLBP + LTP proposed

by Hussain and Triggs [5].

The rest of the paper is structured as follows: Firstly,

Section 2 describes each of our proposed features (i.e.,

NCHOG, NCLBP, HPCP), dimensionality reduction tech-

nique and classiﬁer used. Then, in Section 3, feature real-

ization details are presented, and we discuss the results of the

experiments performed using individual features as well as

feature sets. Finally, conclusions are drawn in Section 4.

2. PROPOSED METHOD

2.1. Features

Selection of good visual features is crucial for reliable hand

detection as the hand postures are very rich in shape variation

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38736721

粉丝: 3
资源: 930

噪声不变局部特征在复杂环境中的人手检测

行人检测数据集（人体上半身，数量7000+）

人手检测训练数据库

是否佩戴手套检测数据集，包含VOC和YOLO数据格式

USB-HID设备类定义+用途表+加报告描述符详解

基于局部时域自相似性的运动轨迹识别

Matlab手写文字识别源代码

手写汉字识别程序.zip

手写阿拉伯字母识别：机器学习应用程序，用于识别手写阿拉伯字母

注意力机制情况下 手写字体的识别

gabor指纹识别

最新资源

注意力机制情况下手写字体的识别