运动微动LLC编码在3D人体姿态估计中的应用

18 浏览量更新于2024-08-29 收藏 453KB PDF 举报

"Pose estimation with motionlet LLC coding是关于3D人体姿态估计的研究，该技术在众多应用中具有重要价值。由于人类姿态空间的高维性和分布的多模态性，从图像描述符到3D人体姿态配置的非线性映射学习是一个难题。为了解决这些问题，文章提出了一种新颖的基于运动小波的LLC编码方法。运动小波由覆盖图像空间、姿态空间和时间流局部区域的训练样本组成。首先，将最具信息性和有益的训练样本聚类成运动小波，然后通过LLC编码学习非线性映射并得到候选姿态，最后选择最合适的姿态作为结果估计。为了进一步消除歧义和提高鲁棒性，该框架还扩展到了多视图设置中。" 在3D人体姿态估计领域，研究者们一直在寻找有效的解决方案来应对高维度姿态空间和数据分布的多模态特性。本文提出的"运动小波LLC编码"方法提供了一个创新的视角。"运动小波"是作者为解决这一问题设计的一种新概念，它结合了图像空间、姿态空间和时间序列的信息，通过对这些关键区域的训练样本进行编码，以捕捉到更丰富的动作特征。局部线性编码（LLC）是一种降维和编码技术，常用于特征表示和学习。在本文中，LLC被应用于运动小波中，以学习从图像描述符到3D姿态的非线性映射。通过这种方式，可以生成一系列可能的姿态候选人。然后，通过比较和分析这些候选，选取最符合输入图像特征的姿势作为最终的估计结果。为了提高估计的准确性和稳定性，作者进一步将单视图方法扩展到多视图设置。多视图方法利用不同角度的图像信息，可以减少姿态估计的不确定性，从而增强系统对光照变化、遮挡或复杂背景的鲁棒性。这种方法有助于减少姿态估计中的歧义，并提高在实际应用中的性能。 "Pose estimation with motionlet LLC coding"提供了一种有效处理3D人体姿态估计挑战的新策略，通过运动小波和LLC编码，结合多视图信息，提高了姿态估计的准确性和鲁棒性，对于计算机视觉和人工智能领域具有重要的理论和实践意义。

Pose Estimation with Motionlet LLC Coding

Li Sun, Mingli Song, Jiajun Bu, and Chun Chen

Zhejiang Provincial Key Laboratory of Service Robot,

College of Computer Science, Zhejiang University

{lsun,brooksong,bjj,chenc}@zju.edu.cn

Abstract. 3D human pose estimation is a challenging but important

research topic with abundant applications. As for discriminative human

pose estimation, the main goal is to learn a nonlinear mapping from im-

age descriptors to 3D human pose conﬁgurations, which is diﬃcult due

to the high-dimensionality of human pose space and the multimodal-

ity of the distribution. To address these problems, we propose a novel

motionlet LLC coding on a discriminative framework. A motionlet con-

sists of training examples covering a local area in terms of image space,

pose space and time stream. We ﬁrst group most informative and help-

ful training examples into motionlets, then perform LLC Coding to learn

the nonlinear mapping and get candidate poses, and ﬁnally choose the

most appropriate pose as the result estimate. To further eliminate am-

biguities and improve robustness, we extend our framework to incorpo-

rate multiviews. We conduct qualitative evaluation on our Taichi data

set and quantitative evaluation on HumanEva data set, which show that

our approach has gained the-state-of-the-art performance and signiﬁcant

improvement against previous approaches.

Keywords: human pose estimation, multimodality, multiview, motion-

let, LLC coding.

1 Introduction

3D human pose estimation from images is a challenging but important research

topic with applications in many areas including Human-Computer Interaction,

robotics, surveillance, computer graphics and sport science. Recent approaches

to 3D human pose estimation can be roughly classiﬁed into two categories, gen-

erative and discriminative. Generative approaches explicitly model human body

appearance and kinematic constraints and usually concentrate on development

of eﬃcient inference methods that are able to handle the high dimensionality of

human pose. Discriminative approaches directly learn the mapping from image

space to pose space.

Discriminative approaches are popular due to their ﬂexibility of choosing

image descriptors, easy adaptation to diﬀerent learning methods, no need for

initialization, and most importantly, the ability of fast inference in real-world

databases. The main goal of discriminative 3D human pose estimation is to

W. Lin et al. (Eds.): PCM 2012, LNCS 7674, pp. 435–443, 2012.

 Springer-Verlag Berlin Heidelberg 2012

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38746926

粉丝: 12
资源: 994

运动微动LLC编码在3D人体姿态估计中的应用

最新资源