3DMTG：一种结合3D运动趋势与几何属性的人体动作识别方法

需积分: 9 187 浏览量更新于2024-09-07 收藏 778KB PDF 举报

"基于三维关节移动趋势和几何属性的人体动作识别" 在计算机视觉领域，人体动作识别是一项重要的技术，尤其随着深度传感器的广泛应用，基于深度图像的动作识别受到了广泛关注。然而，由于不同对象的外观、姿势以及视频序列的多样性，准确识别人体动作仍然是一个挑战。针对这一问题，该研究提出了一种新颖的骨架关节描述符，即3D移动趋势和几何（3DMTG）属性，用于改善人体动作识别的准确性。 3DMTG特征描述符的核心在于结合了两个关键因素：3D移动趋势和几何信息。首先，对于每一关节，研究者构建了一个3D移动方向的直方图，这个直方图反映了连续帧间关节的运动趋势。这种3D移动趋势特征在空间域中捕获了关节运动的动态信息，有助于区分不同的动作模式。其次，他们通过关节在每一帧中的相对运动来建模几何信息，这与初始状态相比，揭示了关节位置的变化和动作的空间结构。实验部分，该方法在两个流行的数据集上进行了验证，结果表明，3DMTG特征描述符相对于现有的最优方法表现出优越的性能，尤其是在识别复杂动作时，识别率有显著提升。这表明，结合3D移动趋势和几何属性可以更有效地处理人体动作识别中的变异性，提高系统的鲁棒性。总结来说，这篇论文提出了一种创新的方法，将关节的3D运动趋势和几何属性相结合，为深度图像的人体动作识别提供了新的视角。通过这种方式，可以更精确地捕捉到动作的动态变化和静态结构，从而提高识别的准确性和效率。这一研究成果对于智能监控、人机交互以及虚拟现实等领域的应用具有重要意义，为解决实际场景中的人体动作识别难题提供了有效的工具和技术支持。

Combining 3D Joints Moving Trend and Geometry

Property for Human Action Recognition

Bangli Liu

, Hui Yu

, Xiaolong Zhou

1,3

, Honghai Liu

School of Computing, University of Portsmouth, UK

School of Creative Technologies, University of Portsmouth, UK

College of Computer Science and Technology, Zhejiang University of Technology, China

Abstract—Depth image based human action recognition has

attracted many attentions due to the popularity of the depth

sensors. However, accurate recognition still remains a challenge

because of various object appearances, poses and video sequences.

In this paper, a novel skeleton joints descriptor based on 3D

Moving Trend and Geometry (3DMTG) property is proposed

for human action recognition. Speciﬁcally, a histogram of 3D

moving directions between consecutive frames for each joint is

constructed to represent the 3D moving trend feature in spatial

domain. The geometry information of joints in each frame is

modelled by the relative motion with the initial status. The

proposed feature descriptor is evaluated on two popular datasets.

The experimental results demonstrate the superior performance

of our method over the state-of-the-art methods, especially the

higher recognition rates for complex actions.

Index Terms—Human action recognition, 3D Moving Trend,

geometry property.

I. INTRODUCTION

As immense applications in human-machine interaction,

vedio surveillance, elderly care and entertainment, human

action recognition has been attracting extensive attentions in

computer vision. Early proposed strategies mainly recognize

human action from 2D sequences captured by RGB cameras

[1][2][3][4]. However, the sensitivity to illumination changes

and subject texture variations often degrades the recognition

accuracy. These problems can be solved by using depth in-

formation acquired by depth sensors such as Microsoft Kinect

and ASUS Xtion, which have been promoting the research on

human action recognition. Because images from depth channel

provide another dimension information (the depth data), this

encourages a lot of depth sensors based recognition methods.

With the availability of 3D joint positions extracted by a

real time skeleton tracking algorithm [5], a lot of researchers

use these joints to build action representations. For example,

a histogram of 3D joint locations (HOJ3D) is proposed to

represent human postures in [6]. Gowayyed et al. [7] propose

a 2D trajectory descriptor for each skeleton joint, where the 3D

joint trajectory is projected into three plane, then a histogram

of oriented displacements(HOD) is used to record the angles

between two consecutive motion frames in each plane.

Inspired but quite different from [7], we partition moving di-

rections of joints into m even bins according to m vectors, and

introduce a histogram of 3D directions. The histogram records

the moving trend of each joint over the entire sequence.

Moreover, we also propose a sequenced motion feature by

extracting the geometry property of each joint. The ﬁnal

feature descriptor is the concatenation of these two types of

features. Contributions of this paper are as follows.

1) A new histogram projection method is proposed to extract

the 3D moving trend of each joint, which can describe its

speciﬁc tendency in 3D space.

2) The geometry property of joints is constructed by using

the relative motion of each frame with the initial status to

represent the evolution of actions.

3) A novel scale-invariant skeleton joints feature descriptor

based on 3D Moving Trend and Geometry (3DMTG) property,

which is named as 3DMTG descriptor, is proposed for human

action recognition. Experimental results show that the pro-

posed feature descriptor has superior performance over many

leading methods in the state-of-the-art, especially a better

recognition ability for actions in Cross Subject Tests.

The remainder of this paper is organized as follows: Section

II reviews related work for human action recognition. Section

III introduces the process of modelling the 3DMTG feature

descriptor. Section IV reports various experimental results

as well as the comparison with the state-of-the-art methods.

Section V summarizes the work of this paper.

II. RELATED WORK

In recent years, there is extensive literature on depth images

based human motion recognition. Depending on used feature

types, these methods can be broadly divided into two cat-

egories: depth maps-based methods and skeletal joints/body

parts-based methods.

Depth maps-based methods mainly extract space features

along time [8]. Some authors [9][10] project depth images

onto three 2D orthogonal planes to capture action features

from diverse viewpoints. In [9], depth motion map (DMM)

is generated by accumulating motion energy over the whole

sequence and the histogram of gradient (HOG) for each

DMM is computed to describe actions. Local interest points

and occupancy patterns are also presented as descriptors of

actions [11][12]. Vieiral et al [12] apply space-time occupancy

patterns (STOP), where the depth map sequence is represented

as a 4D grid with same-size cells whose occupancy value

are recorded. A saturation scheme is used to enhance the

cells containing more information about either silhouettes or

moving parts of the body. In [13], the 4D spatio-temporal

feature is captured using information from both RGB and

下载后可阅读完整内容，剩余6页未读，立即下载

adspvg

粉丝: 0
资源: 7

3DMTG：一种结合3D运动趋势与几何属性的人体动作识别方法

基于深度学习的行人属性识别

行人属性识别算法合集

Python-人体属性识别相关文献大列表

三维人体动作识别一般用什么方法

三维物体识别 三维点云

什么是基于RGB-D数据的三维人脸识别

基于三维点云的汽车检测技术的研究现状

基于二维图像的三维信息挖掘有什么背景和意义

基于yolo的三维检测

视觉几何三维重建-openmvs源码解析

最新资源

三维物体识别三维点云