基于数据融合的Kinect V2实时手部手势识别

93 浏览量更新于2024-08-28 收藏 1.03MB PDF 举报

“基于数据融合的实时手势识别技术利用Kinect V2” 本文是一篇研究论文，探讨了基于数据融合的实时手势识别方法，利用微软的Kinect V2传感器进行实现。手部手势识别在人机交互领域具有重要意义，但现有的大多数方法因复杂度高、耗时长而限制了其在实时应用中的使用。该论文提出了一种新的模型，通过融合深度信息和骨架数据来实现高效的手势识别。 Kinect V2传感器提供了精确的手部分割和跟踪功能，这使得新模型能够实现实时性能。据论文所述，与一些最先进的方法相比，该模型的运行速度提高了18.7%，显著提升了实时处理能力。此外，该模型在实验中表现出对多种干扰因素的鲁棒性，包括旋转、翻转、缩放变化、光照变化、复杂背景以及图像扭曲等。深度信息和骨架数据的融合是该模型的核心。深度信息提供了三维空间中的手部位置，而骨架数据则包含了关节的位置和运动，两者结合可以更准确地识别出手部的运动和形状。这种数据融合策略增强了模型的识别精度，减少了误识别的可能性。在实验部分，作者通过对比实验验证了模型的有效性。他们可能采用了多种测试场景和手势库，以模拟实际应用中可能遇到的各种情况。实验结果证实，无论是在简单还是复杂的环境中，该模型都能保持较高的识别率，证明了其在不同场景下的泛化能力。论文还可能深入讨论了模型的实现细节，包括数据预处理、特征提取、分类器设计以及实时性能优化等方面。预处理可能涉及去除噪声、校正图像失真等步骤；特征提取可能包括关节位置、距离向量、角度信息等；分类器可能选择了支持向量机（SVM）、随机森林或者深度学习网络如卷积神经网络（CNN）；实时性能的优化可能依赖于高效的算法实现和计算资源的合理分配。这篇研究论文提出了一种基于Kinect V2的实时手部手势识别新方法，它结合了深度和骨架数据的优势，实现了更快、更准确的识别效果，同时能适应多种环境变化，对于推动人机交互领域的进步具有积极意义。

Data Fusion-based Real-Time Hand Gesture

Recognition with Kinect V2

Yuhai Lan

School of Information Engnieering

Nanchang University

Nanchang, 330031, China

Jing Li

School of Information Engnieering

Nanchang University

Nanchang, 330031, China

jingli@ncu.edu.cn

Zhaojie Ju

Intelligent Systems and Biomedical

Robotics Group, School of

Computing

University of Portsmouth

Portsmouth, PO1 3HE, U.K.

Abstract—Hand gesture recognition is an important topic in

human

-computer interaction. However, most of the current

methods are complicated and time

-consuming, which limits the

use of hand gesture recognition in real

-time circumstances. In

this paper, we propose a data fusion

-based hand gesture

recognition model by fusing depth information and skeleton data.

Because

of the accurate segmentation and tracking with Kinect

V2, the model can achieve real

-time performance, which is 18.7%

faster than some of the state

-of-the-art methods. Based on the

experimental results, the proposed model is accurate and robust

to rotation, flip, scale changes, lighting changes, cluttered

background, and distortions. This ensures its use in different

real

-world human-computer interaction tasks.

Keywords

—hand gesture recognition; skeleton; Kinect V2;

depth image; real

-time; data fusion

I. INTRODUCTION

As an important topic in human

-computer/robot

interaction, not only does hand gesture recognition provide

reliable information for exploring the meanings of human hand

gestures for friendly and comfortable interaction experience,

such as virtual reality

[1] and augmented reality [2]; but also it

underpins a wide range of computer vision applications,

including sign language recognition [3]

and advanced driver

assistance systems

[4].

Recently, researchers have proposed numerous hand

gesture recognition algorithms in the literature [9], most of

which can be divided into three different levels: 1) static hand

gesture recognition; 2) dynamic gesture recognition; and 3) 3D

hand gesture recognition. Traditionally, the approaches in the

first

two levels are mostly two-dimensional, which use 2D

color images as input while ignoring depth information. By

contrast, the technologies in the third level are 3D-based, where

depth information of images is explored to ensure more

effective hand gesture recognition performance. However, 3D

depth information cannot be obtained from a single camera,

needing

a special device to obtain. Due to the high-speed

development of human

-computer interaction and sensing

technologies, some famous depth cameras were developed,

including Microsoft Kinect, Intel RealSense, Leap Motion, and

Asus Xtion, wherein

Kinect is the most widely used devices in

computer vision research area. Kinect catches depth

information by time of flight (TOF) and

provide different kinds

of high

-quality data, e.g., color, depth, infrared, skeletons, and

solid SDK.

Fig. 1. Kinect V2 for Xbox One.

This paper proposed a model to accurately and fast

recognize hand gestures of different digits by fusing skeleton

information through depth images obtained by Kinect for Xbox

One

(‘Kinect V2’ for short). This is the first paper to use

Kinect V2

to recognize different human hand gestures. Kinect

for

Windows (‘Kinect’ for short) is a kind of human-computer

interaction facilities developed in 2010. It is fused with many

advanced visual technologies and has been widely used in

various kinds of computer vision tasks, such as face

recognition

[5][6], scene understanding [7], and human gesture

recognition

[8]. However, since its improved version - Kinect

was developed in 2014; there is little work in the literature.

Compared with Kinect, the hardware of Kincet V2 has been

largely improved. F

or example, Kinect V2 can track at most six

skeletons compared with two skeletons using Kinect. Skeleton

tracking has been upgrade

d to a large amount, where the

tracked positions ar

e more accurate and robust in anatomy, and

the tracked area is wider. Moreover, the Kinect for Windows

SDK h

as been updated continuously from SDK 1.8 to SDK 2.0.

This simple model can deal with various kinds of

challenges in hand gesture recognition, such as rotation, scale

changes, lighting changes, cluttered background, and

distortions. It is consisted of two parts: 1) hardware: Kinect

V2 is used to obtain depth images with RGB

-D cameras; and 2)

software: Microsoft Kinect SDK

combined with Open Source

Computer Vision (OpenCV)

is adopted for real-time

This research is supported by National Natural Science Foundation of

China (61463032, 51575412) and Scientific Research Foundation for

Returned Scholars, Ministry of Education of China.

307

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38590567

粉丝: 2
资源: 932

基于数据融合的Kinect V2实时手部手势识别

Robust hand gesture recognition for robotic hand control

Hand Gesture Recognition Using Kinect

hw4-hand-gesture-tracking-and-recognition-WeiyanZhu：hw4-hand-gesture-tracking-and-recognition-WeiyanZhu由GitHub Classroom创建

Computer-Control-Via-Mid-Air-Hand-Gesture-Recognition:使用Leap Motion，模拟IRON MAN的JARVIS命令中心。 它主要用Java内置，但使用JNA库进行OS级操作，例如音量控制

Static Hand Gesture Recognition with Electromagnetic Scattered Field via Complex Attention Convolutional Neural Network

Real-Time-Gesture-Recognition:通过网络摄像头检测手部和头部运动手势

Real-Time-Eye-Tracking-Interface-and-Gesture-Recognition:该项目旨在检测面部、眼睛和眼球，然后跟踪这些眼球，从而定义各种手势的动作。 除了眼球，检测到的人脸和眼睛也被用来定义手势

Hand Gesture Recognition using C#-开源

Real-time gesture recognition system and application.pdf

2017-J神-One-Shot-Learning Gesture Recognition Using HOG-HOF Feat

最新资源

Computer-Control-Via-Mid-Air-Hand-Gesture-Recognition:使用Leap Motion，模拟IRON MAN的JARVIS命令中心。它主要用Java内置，但使用JNA库进行OS级操作，例如音量控制

Real-Time-Eye-Tracking-Interface-and-Gesture-Recognition:该项目旨在检测面部、眼睛和眼球，然后跟踪这些眼球，从而定义各种手势的动作。除了眼球，检测到的人脸和眼睛也被用来定义手势