机器学习在人体运动分析中的理论与应用

需积分: 5 162 浏览量更新于2024-06-28 收藏 9.96MB PDF 举报

"《机器学习的人体运动分析理论与实践》是Liang Wang、Li Cheng和Guoying Zhao等作者的著作，该书探讨了在人工智能领域中，如何利用机器学习技术来理解和分析人体运动。出版于美国的Medical Information Science Reference（IGI Global的印记）." 本书深入探讨了机器学习在人体运动分析中的应用，涵盖了从理论到实践的多个层面。机器学习作为人工智能的一个关键分支，其在理解复杂生物行为，特别是人体运动模式方面展现出巨大的潜力。作者通过本书向读者介绍如何利用各种机器学习算法，如支持向量机(SVM)、神经网络、决策树、随机森林以及深度学习模型，如卷积神经网络(CNN)和循环神经网络(RNN)，来解析和预测人体的运动行为。首先，书中详细阐述了人体运动的生理基础和数据采集技术，包括运动捕捉系统、传感器技术以及视频分析等。这些技术是获取高质量运动数据的关键，为后续的机器学习分析提供了可靠的数据源。其次，书中讨论了预处理和特征提取步骤，这是机器学习模型成功与否的关键环节。这部分内容可能涉及时间序列分析、图像处理、信号处理技术，以及如何从原始数据中提取出有意义的特征，如关节角度、速度和加速度。接下来，作者介绍了多种机器学习算法，并解释了它们如何应用于人体运动识别、分类和预测。例如，支持向量机在小样本学习中的优势，神经网络在处理非线性关系时的能力，以及深度学习模型在处理高维数据和模式识别上的效能。此外，书中还涵盖了模型评估和优化策略，如交叉验证、超参数调优和集成学习，这些方法对于提升模型性能至关重要。同时，作者也会讨论实际应用中遇到的挑战，如数据不平衡问题、实时性需求以及隐私保护等。最后，书中可能包含一些实际案例研究，展示如何将这些理论应用于体育竞技、康复医学、虚拟现实、人机交互等领域。这些案例不仅展示了机器学习在人体运动分析中的实用性，也为相关领域的研究人员和工程师提供了宝贵的经验和指导。《机器学习的人体运动分析理论与实践》是一本全面介绍如何运用机器学习技术理解、分析和预测人体运动的权威著作，对于从事人工智能、生物力学、运动科学以及相关领域的学者和从业者具有很高的参考价值。

Chapter 7, A Generic Framework for 2D and 3D Upper Body Tracking, targets upper body tracking,

a problem to track the pose of human body from video sequences. It is difcult due to such problems

as the high dimensionality of the state space, the self-occlusion, the appearance changes, etc. In this

chapter, they propose a generic framework that can be used for both 2D and 3D upper body tracking

and can be easily parameterized without heavily depending on supervised training. They rst construct

a Bayesian Network (BN) to represent the human upper body structure and then incorporate into the BN

various generic physical and anatomical constraints on the parts of the upper body. They also explicitly

model part occlusion in the model, which allows to automatically detect the occurrence of self-occlusion

and to minimize the effect of measurement errors on the tracking accuracy due to occlusion. Using the

proposed model, upper body tracking can be performed through probabilistic inference over time. A

series of experiments were performed on both monocular and stereo video sequences to demonstrate the

effectiveness and capability of the model in improving upper body tracking accuracy and robustness.

Chapter 8, Real-Time Recognition of Basic Human Actions, describes a simple and computationally

efcient, appearance-based approach for real-time recognition of basic human actions. They apply a

technique that depicts the differences between two or more successive frames accompanied by a threshold

lter to detect the regions of the video frames where some type of human motion is observed. From each

frame difference, the algorithm extracts an incomplete and unformed human body shape and generates a

skeleton model which represents it in an abstract way. Eventually, the recognition process is formulated

as a time-series problem and handled by a very robust and accurate prediction method (Support Vector

Regression). The proposed technique could be employed in applications such as vision-based autono-

mous robots and surveillance systems.

Chapter 9, Fast Categorisation of Articulated Human Motion, exploits the problem of visual cat-

egorisation of human motion in video clips. Most published methods either analyse an entire video and

assign it a single category label, or use relatively large look-ahead to classify each frame. Contrary to

these strategies, the human visual system proves that simple categories can be recognised almost in-

stantaneously. Here they present a system for categorisation from very short sequences (“snippets”) of

1–10 frames, and systematically evaluate it on several data sets. It turns out that even local shape and

optic ow for a single frame are enough to achieve 80-90% correct classication, and snippets of 5-7

frames (0.2-0.3 seconds of video) yield results on par with the ones state-of-the-art methods obtain on

entire video sequences.

Chapter 10, Human Action Recognition with Expandable Graphical Models, proposes an action

recognition system that is independent of the subjects who perform the actions, independent of the

speed at which the actions are performed, robust against noisy extraction of features used to character-

ize the actions, scalable to large number of actions and expandable with new actions. In this chapter,

they describe a recently proposed expandable graphical model of human actions that has the promise

to realize such a system. This chapter rst presents a brief review of the recent development in human

action recognition. Then, the expandable graphical model is presented in detail and a system that learns

and recognizes human actions from sequences of silhouettes using the expandable graphical model is

developed.

Chapter 11, Detection and Classication of Interacting Persons, presents a way to classify interac-

tions between people. Examples of the interactions they investigate are; people meeting one another,

walking together and ghting. A new feature set is proposed along with a corresponding classication

method. Results are presented which show the new method performing signicantly better than the

previous state of the art method as proposed by Oliver et al.

Chapter 12, Action Recognition, rst reviews the current action recognition methods from the fol-

lowing two aspects: action representation and recognition strategy. Then, a novel method for classifying

xvi

human actions from image sequences is investigated. In this method, the human action is represented

by a set of shape context features of human silhouette, and a dominant sets-based approach is employed

to classify the predened actions. The comparison between the dominant sets-based approach with K-

means, mean shift, and Fuzzy-Cmean is also discussed.

Chapter 13, Distillation: A Super-Resolution Approach for the Selective Analysis of Noisy and

Unconstrained Video Sequences, argues that image super-resolution is one of the most appealing appli-

cations of image processing, capable of retrieving a high resolution image by fusing several registered

low resolution images depicting an object of interest. However, employing super-resolution in video

data is challenging: a video sequence generally contains a lot of scattered information regarding several

objects of interest in cluttered scenes. The objective of this chapter is to demonstrate why standard im-

age super-resolution fails in video data, which are the problems that arise, and how they can overcome

these problems. They propose a novel Bayesian framework for super-resolution of persistent objects of

interest in video sequences, called Distillation. With Distillation, they extend and generalize the image

super-resolution task, embedding it in a structured framework that accurately distills all the informative

bits of an object of interest. They also extend the Distillation process to deal with objects of interest

whose transformations in the appearance are not (only) rigid. The ultimate product of the overall pro-

cess is a strip of images that describe at high resolution the dynamics of the video, switching between

alternative local descriptions in response to visual changes. The approach is rst tested on synthetic

data, obtaining encouraging comparative results with respect to known super-resolution techniques, and

a good robustness against noise. Second, real data coming from different videos are considered, trying

to solve the major details of the objects in motion.

In summary, this book contains an excellent collection of theoretical and technical chapters written

by different authors who are worldwide-recognized researchers on various aspects of human motion

understanding using machine learning methods. The targeted audiences are mainly researchers, engineers

as well as graduate students in the areas of computer vision and machine learning. The book is also

intend to be accessible to a broader audience including practicing professionals working with specic

vision applications such as video surveillance, sport event analysis, healthcare, video conferencing,

motion video indexing and retrieval. We wish this book would help toward the development of robust

yet exible vision systems.

Liang Wang

University of Bath, UK

Li Cheng

TTI-Chicago, USA

Guoying Zhao

University of Oulu, Finland

May 20, 2009

xvii

Acknowledgment

Human motion analysis and understanding is fundamental in many real applications including surveillance

and monitoring, human-machine interface, sport event analysis, medical motion analysis and diagnosis,

motion kinematics modeling, etc. Statistical learning approach is one major frontier for computer vision

research. In recent years, machine learning, and especially, statistical learning theories and techniques,

have evidenced rapid and fruitful developments, and are under the way to make signicant contributions

to the area of vision-based human motion understanding. This edited book provides a comprehensive

treatment of recent developments in the application of modern statistical machine learning approaches

for modeling, analyzing and understanding human motions from video data. We would like to express

our sincere thanks to IGI Global to offer us the opportunity to edit such a book on this exciting area.

During the edition of this book, we received much help and support. First of all, we would like to

thank all of the authors for submitting their wonderful works and apologize that not all chapter submis-

sions could be accepted. We are also grateful to all of the chapter reviewers for their remarkable efforts

on providing timely reviews of high quality. It was a great honor to have those worldwide leading experts

to join the Editorial Advisory Board of this book. They are Prof. Richard Hartley (Australian National

University, Australia), Prof. Terry Caelli (National ICT Australian, Australia), Prof. Weiming Hu (Chi-

nese Academy of Sciences, China), Prof. Matti Pietikäinen (University of Oulu, Finland), Prof. Greg

Mori (Simon Fraser University, Canada), and Prof. Dit-Yan Yeung (Hong Kong University of Science

and Technology, China). We appreciate their valuable suggestions to strengthen the overall quality of

this book and help to promote this publication.

This book could not be possible without the help of the people involved in the IGI Global. As a

full-service publishing company, IGI Global staff handles all tasks related to production, registration,

marketing and promotion, overseas distribution and so on. As well as thanking the nancial and techni-

cal support from IGI Global, special thanks go to Tyler Heath (Assistant Development Editor), for his

assistance in guiding us each step of the way.

Liang Wang

University of Bath, UK

Li Cheng

TTI-Chicago, USA

Guoying Zhao

University of Oulu, Finland

May 20, 2009

Chapter 1

Human Motion

Tracking in Video:

A Practical Approach

Tony Tung

Kyoto University, Japan

Takashi Matsuyama

Kyoto University, Japan

1. IntroductIon

Human motion tracking is a common require-

ment for many real world applications, such as

video surveillance, games, cultural and medical

applications (e.g. for motion and behavior study).

The literature has provided successful algorithms

to detect and track objects of a predefined class

in image streams or videos. Simple object can be

detected and tracked using various image features

such as color regions, edges, contours, or texture.

On the other hand, complex objects such as hu-

man faces require more sophisticated features to

handle the multiple possible instances of the object

ABStrAct

This chapter presents a new formulation for the problem of human motion tracking in video. Tracking

is still a challenging problem when strong appearance changes occur as in videos of humans in motion.

Most trackers rely on a predened template or on a training dataset to achieve detection and tracking.

Therefore they are not efcient to track objects whose appearance is not known in advance. A solution

is to use an online method that updates iteratively a subspace of reference target models. In addition,

we propose to integrate color and motion cues in a particle lter framework to track human body parts.

The algorithm process consists of two modes, switching between detection and tracking. The detection

steps involve trained classiers to update estimated positions of the tracking windows, whereas tracking

steps rely on an adaptive color-based particle lter coupled with optical ow estimations. The Earth

Mover distance is used to compare color models in a global fashion, and constraints on ow features

avoid drifting effects. The proposed method has revealed its efciency to track body parts in motion and

can cope with full appearance changes. Experiments were performed on challenging real world videos

with poorly textured models and non-linear motions.

DOI: 10.4018/978-1-60566-900-7.ch001

剩余317页未读，继续阅读

承让@

粉丝: 8
资源: 380

机器学习在人体运动分析中的理论与应用

Machine Learning for Human Motion Analysis Theory and Practice

机器学习相关学习与实践资料

人体运动分析

机器学习入门与进阶：从理论到实践

机器学习优化算法全览：从理论到实践

Python机器学习初探：鸢尾花分类实践

Python机器学习实践案例解析

概率视角下的机器学习指南：深度解析与实践算法

银行客户行为预测：机器学习模型对比分析

机器学习基础理论知识详解：从算法到深度学习

最新资源