基于卷积神经网络的鲁棒手势识别策略

120 浏览量更新于2024-08-26 收藏 415KB PDF 举报

本文探讨了卷积神经网络（Convolutional Neural Network, CNN）在手势识别领域的鲁棒性问题。随着非语言交流和自然人机交互的重要性日益凸显，手势识别技术的发展受到了广泛关注。然而，实际应用中，手势的复杂结构、个体差异（如手部大小和姿势的多样性）、以及环境光照等多变因素，导致手势识别的准确率受到严重影响。传统方法往往难以应对这些挑战，识别性能不稳定。本文的主要贡献是提出了一种基于卷积神经网络的鲁棒手势识别方法。作者们认识到，为了提高手势识别系统的鲁棒性，关键在于设计能够有效捕捉手势特征并同时适应不同个体特性和环境条件的模型。CNN由于其在图像处理中的成功应用，被选作核心架构，因为其能够通过局部感受野和权重共享特性，自动学习和提取手势图像中的特征，从而实现对手势的高效识别。首先，文章可能介绍了CNN的基础理论和在计算机视觉中的作用，包括卷积层、池化层和全连接层等组件，以及它们如何帮助模型从原始像素数据中提取低级到高级的特征表示。然后，针对手势识别任务，可能讨论了如何设计和调整网络结构，比如添加了专门针对手部形状和纹理的卷积核，或者采用了多尺度和多视角输入来增强对不同尺寸和姿态的手势识别。针对个体差异，研究者可能考虑了数据增强和迁移学习策略，通过训练一个包含大量不同个体手势样本的数据集，使模型能够泛化到新的个体。此外，光照条件的影响也得到了关注，可能通过光照归一化或使用光照不变性的特征表示来减少环境变化带来的影响。论文可能还包含了实验部分，展示了新方法在各种条件下（包括不同个体、光照变化等）与传统方法的比较，以证明其鲁棒性和优越性能。最后，论文可能总结了研究成果，并对未来的研究方向提出了建议，如结合深度学习和传统机器学习方法的融合，或者进一步探索更深层次的运动捕捉和姿态估计技术，以提升手势识别的精度和鲁棒性。这篇研究论文旨在解决手势识别中的鲁棒性问题，利用卷积神经网络的特性来构建一个能够在复杂环境中稳定且准确识别各种手势的系统，这对于未来人机交互和智能设备的发展具有重要意义。

A Robust Hand Gesture Recognition Method Via Convolutional Neural Network

Xing Yingxin

Beijing Key Laboratory of Multimedia and Intelligent

Software Technology

College of Metropolitan Transportation

Beijing University of Technology

Beijing, China

xingyingxin@emails.bjut.edu.cn

Li Jinghua, Wang Lichun, Kong Dehui

Beijing Key Laboratory of Multimedia and Intelligent

Software Technology

College of Metropolitan Transportation

Beijing University of Technology

Beijing, China

lijinghua, wanglc, kdh@bjut.edu.cn

Abstract—Hand gesture plays an important role in nonverbal

communication and natural human-computer interaction.

However, the complex hand gesture structure and various

environment factors lead to low recognition rate. For instance,

hand gesture depends on individuals, and different individuals’

hands are with different sizes and postures, in addition,

unconstrained environmental illumination also influences hand

gesture recognition performance. Therefore, hand gesture

recognition is still a challenging issue. This paper proposes a

robust method for hand gesture recognition based on

convolutional neural network, which is utilized to

automatically extract the spatial and semantic feature of hand

gesture. Our method consists of a modified Convolutional

Neural Network structure and data preprocessing, which

corporately increase hand gesture recognition performance.

The experimental results on both Cambridge Hand Gesture

Dataset and self-constructed dataset show that the proposed

method is effective and competitive.

Keywords- Hand Gesture Recognition, Convolutional Neural

Network (CNN) , Canny Edge Detection

NTRODUCTION

Today, digital home and intelligent home are making our

life better, of which, natural human machine interaction is

one of core technologies. Different from the traditional

popular keyboard and mouse interaction, hand gesture plays

the most natural and important role in current nonverbal

communication and intelligent interaction. However, hand

gesture recognition still faces challenge on account of its

complexity and variation [1]. As is known to us, different

persons sign a same hand gesture differently, and even the

same person signs a hand gesture differently each time. In

addition, the vision-based hand gesture recognition is also

susceptible to lighting, view and so on [2] [3].

The previous vision-based hand gesture recognition

approaches usually consist of two main steps. The first one is

to extract features, and the second one is to design a

classifier. Among them, robust and effective feature

representation is a major problem. The preceding hand-

crafted feature usually demands the user to have some priori

knowledge and some preprocessing such as image

transformation, segmentation and so on. The recent popular

deep learning method CNN has demonstrated competitive

performance in image representation and classification [4].

The success of CNNs partly lies in its invariance to

translation, rotation and scale, which is also due to its ability

to learn high level semantics. This paper utilizes CNN to

extract robust hand gesture feature, and our focus is the

structure and parameter setting of CNN model for static hand

gesture feature representation. The feature extracted by the

proposal method is easy to compute and able to describe the

hand gesture more excellently. Especially, in order to

enhance the hand gesture representation performance based

on CNN, canny edge detection is introduced beforehand to

remove variable illuminations inherent in the original hand

gesture data. The final experiment results and comparisons

demonstrate the effectiveness of data preprocessing

removing illuminations cooperating with the learned features

via CNN for hand gesture recognition.

The main contributions of this paper are: (1) the

proposed CNN structure and parameter are more suitable for

hand gesture spatial and semantic representation and

discriminative hand gesture understanding; (2) the

preprocessed edge data as CNN model input greatly improve

the robustness of hand gesture recognition with various

illuminations; (3) the learning-based feature representation

approach outperforms existing predefined methods on both

Cambridge Hand Gesture Dataset and self-constructed

dataset.

This paper is organized as follows. Sect. II reviews

related works. Sect. III presents the novel hand gesture

recognition method via CNN. The experimental results and

analysis based on the proposed method are shown in Sect. IV.

The last section summarizes this study and proposes the

future work.

II. R

ELATED

ORKS

With the development of natural human machine

interaction, research on hand gesture recognition is an active

field. A lot of the early works in hand gesture recognition

focused on designing hand-crafted features based on the

prior knowledge. Chen et al. [5] utilized Fourier descriptor

(FD) to extract spatial hand shape, and hand region must be

correctly segmented first. Auephanwiriyakul et al. [6] used

Scale Invariant Feature Transform (SIFT) to describe each

test frame. However, instead of directly comparing with the

training frame, this method constructed a signature library

database in advance, so as to match the test frame with

2016 6th International Conference on Digital Home

DOI 10.1109/ICDH.2016.20

2016 6th International Conference on Digital Home

DOI 10.1109/ICDH.2016.20

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38580959

粉丝: 3
资源: 961

基于卷积神经网络的鲁棒手势识别策略

基于卷积神经网络的手势识别初探.pdf

基于肤色特征和卷积神经网络的手势识别方法.pdf

基于Leap Motion和卷积神经网络的手势识别.pdf

基于mobilenet v3卷积神经网络的手势识别

基于卷积神经网络表面肌电手势识别方法的研究目的

matlab基于卷积神经网络的手势识别

卷积神经网络实现手势识别程序

手势识别_使用cnn(卷积神经网络)和opencv进行手势识别

深度学习复杂背景下手势识别的研究现状

基于深度学习的手势识别系统

最新资源