使用交叉与内通道并行卷积的多变量时间序列分类

版权申诉

120 浏览量更新于2024-09-11 收藏 208KB PDF 举报

"这篇论文探讨了一种用于多变量时间序列分类的卷积神经网络（CNN），该网络利用了跨通道和内在通道并行卷积。作者G. Devineau、W. Xi、F. Moutarde和J. Yang分别来自法国巴黎矿业大学和中国上海交通大学。他们提出的方法专门用于通过3D手部手势识别进行多变量时间序列分类，仅使用手部骨骼数据，不依赖深度图像。在DHG数据集（SHREC2017 3D形状检索竞赛）上的实验结果显示，该方法达到了最先进的性能，分类准确率为91.28%。" 详细说明: 本文关注的是多变量时间序列分类，特别是在3D手部手势识别中的应用。卷积神经网络（CNN）是处理这种类型问题的一种强大工具，尤其适用于从序列数据中提取特征。论文中提出的CNN模型创新地采用了跨通道（Inter-channel）和内在通道（Intra-channel）并行卷积，这使得模型能够更有效地处理多维度输入数据，如手部骨骼关节的位置。 1. **并行卷积**: 传统的CNN通常只在一个通道内进行卷积操作，而这里的模型扩展了这一概念，同时在跨通道和内在通道上执行卷积。跨通道卷积有助于捕捉不同通道之间的相互关系，而内在通道卷积则专注于每个通道内的局部特征提取。这种并行结构可以提高模型对复杂模式的理解能力。 2. **3D手部手势识别**: 论文的核心应用是3D手部手势识别，这是一个多变量时间序列分类问题。每个手势都可以表示为一个时间序列，其中包含多个关节（如手腕、手指等）在三维空间中的位置信息。通过使用这个CNN模型，系统可以学习这些关节运动的模式，并对不同手势进行准确分类。 3. **数据依赖**: 与许多其他3D手势识别方法不同，该模型不依赖深度图像，而是仅依赖于手部骨骼数据。这意味着它可能更适用于那些没有深度传感器或者深度信息质量较差的环境。 4. **实验结果**: 在DHG数据集上进行的实验表明，该模型在3D手势识别任务上表现优秀，达到了91.28%的分类准确率，这证明了并行卷积架构的有效性。DHG数据集是一个具有挑战性的数据集，包含了各种复杂的手势，因此，这样的性能表现对于实际应用具有重要意义。 5. **状态-of-the-art性能**: 达到的高分类准确率意味着该模型在当前技术中处于领先地位，为多变量时间序列分类和3D手势识别领域提供了新的研究方向和基准。总结，这篇论文展示了一种创新的CNN结构，该结构利用跨通道和内在通道并行卷积来处理多变量时间序列数据，特别适合3D手部手势识别任务。这种设计在不依赖深度信息的情况下，实现了高度准确的分类，展示了在相关领域的广阔应用前景。

Convolutional Neural Networks for Multivariate Time Series Classiﬁcation using

both Inter- & Intra- Channel Parallel Convolutions

G. Devineau

W. Xi

F. Moutarde

J. Yang

MINES ParisTech, PSL Research University, Center for Robotics, Paris, France

Shanghai Jiao Tong University, School of Electronic Information and Electrical Engineering, China

{guillaume.devineau, wang.xi, fabien.moutarde}@mines-paristech.fr

Abstract

In this paper, we study a convolutional neural network we

recently introduced in [9], intended to recognize 3D hand

gestures via multivariate time series classiﬁcation.

The Convolutional Neural Network (CNN) we propo-

sed processes sequences of hand-skeletal joints’ positions

using parallel convolutions. We justify the model’s ar-

chitecture and investigate its performance on hand ges-

ture sequence classiﬁcation tasks. Our model only uses

hand-skeletal data and no depth image. Experimental re-

sults show that our approach achieves a state-of-the-art

performance on a challenging dataset (DHG dataset from

the SHREC 2017 3D Shape Retrieval Contest).Our model

achieves a 91.28% classiﬁcation accuracy for the 14 ges-

ture classes case and an 84.35% classiﬁcation accuracy for

the 28 gesture classes case.

1 Introduction

Gesture is a natural way for a user to interact with one’s

environment. One preferred way to infer the intent of a

gesture is to use a taxonomy of gestures and to classify

the unknown gesture into one of the existing categories ba-

sed on the gesture data, e.g. using a neural network to per-

form the classiﬁcation. In this paper we present and study a

convolutional neural network architecture relying on intra-

and inter- parallel processing of sequences of hand-skeletal

joints’ positions to classify complete hand gestures. Where

most existing deep learning approaches to gesture recog-

nition use RGB-D image sequences to classify gestures

[41], our neural network only uses hand (3D) skeletal data

sequences which are quicker to process than image se-

quences. The rest of this paper is structured as follows. We

ﬁrst review common recognition methods in Section II. We

then present the DHG dataset we used to evaluate our net-

work in Section III. We detail our approach in Section IV in

terms of motivations, architecture and results. Finally, we

conclude in Section VI and discuss how our model can be

improved and integrated into a realtime interactive system.

Note that the contents of this paper are highly similar to

that of [9], especially sections 1, 2 and 3, as well as the ﬁ-

gure illustrating the network, however in this article we fo-

cus more on practical tips and on justifying the network ar-

chitecture whereas the original paper focus was more cen-

tered on gesture-related aspects. Readers familiar with [9]

can directly skip to the subsection Architecture Tuning of

section IV, in which the network architecture is justiﬁed

more thoroughly.

2 Deﬁnition & Related Work

We deﬁne a 3D skeletal data sequence s as a vector s =

· · · p

)

whose components p

are multivariate time se-

quences. Each component p

= (p

(t))

t∈N

represents a mul-

tivariate sequence with three (univariate sequences) com-

ponents p

= (x

(i)

) that alltogether represent a time

sequence of the positions p

(t) of the i-th skeletal joint j

Every skeletal joint j

represents a distinct and precise arti-

culation or part of one’s hand in the physical world.

In the following subsections, we present a short review

of some approaches to gesture recognition. Typical ap-

proaches to hand gesture recognition begin with the ex-

traction of spatial and temporal features from raw data.

The features are later classiﬁed by a Machine Learning

algorithm. The feature extraction step can either be ex-

plicit, using hand-crafted features known to be useful for

classiﬁcation, or implicit, using (machine) learned features

that describe the data without requiring human labor or ex-

pert knowledge. Deep Learning algorithms leverage such

learned features to obtain hierarchical representations (fea-

tures) that often describe the data better than hand-crafted

features. As we work on skeletal data only, with a deep-

learning perspective, this review pays limited attention to

non deep-learning based approaches and to depth-based

approaches ; a survey on the former approaches can be

found in [19] while several recent surveys on the latter ap-

proaches are listed in Neverova’s thesis [21].

2.1 Non-deep-learning methods using hand-

crafted features

Various hand-crafted representations of skeletal data can

be used for classiﬁcation. These representations often des-

cribe physical attributes and constraints, or easily interpre-

table properties and correlations of the data, with an em-

下载后可阅读完整内容，剩余7页未读，立即下载

Fun_He

粉丝: 19

使用交叉与内通道并行卷积的多变量时间序列分类

Kidney-Tumor-classification-DL-using-MLFlow-CICD-源码.zip

机器学习分类模型 Introduction-to-ML-Classification-Models-using-scikit-learn-master.zip

ECG-arrhythmia-classification-using-a-2-D-convolutional-neural-network.:使用MITDB数据集预测心律失常

【6】Going deeper with convolutions.pdf

python-sphinx-feature-classification-doc-0.3.2-1.el7.noarch.rpm

python-sphinx-feature-classification-doc-0.3.0-1.el7.noarch.rpm

python-sphinx-feature-classification-doc-0.4.1-1.el8.noarch.rpm

python-sphinx-feature-classification-doc-0.4.1-1.el7.noarch.rpm

Sound-Classification-using-Deep-Learning

Adult-Teenager-Classification-using-Deep-Learning

最新资源