使用动态贝叶斯网络量化自发面部动作单元强度

4 浏览量更新于2024-08-27 收藏 1.6MB PDF 举报

"这篇论文探讨了使用动态贝叶斯网络（Dynamic Bayesian Network, DBN）来测量自发面部动作单元（Facial Action Units, AUs）的强度，这在自动面部表情分析领域具有重要意义。FACS（Facial Action Coding System）是基于面部肌肉运动的编码系统，用于描述所有可能的面部表情。AU的强度测量在行为科学和发育心理学的一些研究中至关重要。文章还提到了DISFA数据库，这是一个用于分析自发面部表情的数据集。" 正文: 在过去的二十年里，自动面部表情分析已经在各种应用中受到了广泛的关注。这是因为面部表情是人类情感交流的关键元素，能够揭示个体的内心情绪状态。FACS，即面部动作编码系统，是这一领域的基础工具，它将所有的面部表情分解为一系列称为动作单元（AUs）的肌肉活动。FACS不仅能够识别这些动作单元，还能量化它们的强度，这对于理解表情的细微差异和深度至关重要。 AU的强度测量是一个复杂的过程，因为面部肌肉的微小变化可能导致表情的显著差异。例如，微笑的强度可以从轻微的嘴角上扬到大笑时的全脸参与。在行为科学和发育心理学的研究中，精确地测量AU强度可以帮助研究人员更深入地了解人类的情绪反应、情绪发展以及社交互动。本文提出了一种新的方法，即使用动态贝叶斯网络来处理这个挑战。DBN是一种概率模型，能够处理时间序列数据并捕获变量之间的动态依赖关系。在面部表情分析中，DBN可以捕捉到AUs随时间的变化，并估计其强度水平。这种模型的优点在于它能适应面部表情的非线性和动态特性，同时考虑到相邻AUs之间的相互影响。 DISFA数据库在该研究中起到了关键的作用。DISFA是一个大规模的面部表情数据库，包含多个被试在观看不同情感激发视频时的连续、自发的面部表情数据。这些数据提供了丰富的现实场景，使得DBN模型可以在实际情境下进行训练和验证，从而提高AU强度测量的准确性和鲁棒性。这篇论文通过结合FACS理论、DBN技术以及DISFA数据库，为自动测量自发面部表情的强度提供了一个有力的工具。这一工作不仅有助于提升面部表情识别系统的性能，还对情感计算、人机交互、心理疾病诊断等领域有深远的影响。未来的研究可能会进一步探索如何优化DBN模型，以及如何将这种技术应用于更多复杂的情感识别任务。

dynamic nature of facial actions, individually recognizing each AU

intensity is not accurate and reliable for spontaneous facial expres-

sions. Understanding spontaneous facial expression requires not

only improving facial motion observations, but more importantly,

exploiting the spatiotemporal interactions among facial motions

since the coherent, coordinated, and synchronized interactions

among AUs produce a meaningful facial expression.

Some previous studies focused on exploiting the semantic and

dynamic relationships among facial actions [35,35,25]. Lien et al. [35]

and Valstar et al. [36] employed a set of Hidden Markov Models

(HMMs) t o represent the facial actions evolution in time. Tong et al.

[25,3 7] constructed a Dynamic Bayesian Network (DBN) based model

to further exploit the semantic and temporal dependencies among

facial actions. However, these works are all limited to detection of the

presence and absence of AUs mainly in posed expr essions. Recently ,

there are some works exploiting the interactions between AU

intensity values. Sandbach et al. [58] employed a Markov Random

Field to model the static correlation relationship among AU inten-

sities, which improves the recognition accuracy compared to regres-

sors, i.e., SVRs and Directed Acyclic Graph Support Vector Machines.

A Conditional Ordinal Random Fields (CORF) is extended [59] to

applicationinAUintensityestimation. However, this method [59]

can only estimate the AU intensity when the presence of the AU is

already known. Baltrušaitis et al. [57] combine ANNs and Continuous

Conditional Random Fields (CCRF) as a Continuous Conditional

Neural Fields (CCNF) for structured regression for AU intensity

estimation, where each hidden component consists of a neutral lay er.

In this work, we construct a DBN to systematically model the

spatiotemporal interactions among multi-level AU intensities.

Advanced machine learning techniques are employed to train

the framework from both subjective prior knowledge and training

data. The proposed method differs from previous works [25,37] in

both theory and applications. Theoretically, this paper focuses on

exploiting the AU intensity correlations, and modeling the spatio-

temporal dependencies among multi-level AU intensities. In terms

of applications, the focus of this paper is to design and develop an

automatic system to measure the intensity of spontaneous facial

action units, which is much more challenging than detecting

posed facial action units.

Fig. 2 gives the ﬂowchart of the proposed online AU intensity

measuring system, which consists of two independent but colla-

borative phases: image observation extraction and DBN inference.

First, we detect the face and 66 facial landmark points in videos

automatically. Then we register the images according to the

detected facial landmark points. HOG and Gabor features are

employed to describe local appearance changes of the face. After

that, SVM classiﬁcation produces an observation score for each AU

intensity individually. Given the image observations for all AUs, the

AU intensity recognition is accomplished through probabilistic

inference by systematically integrating the image observation with

the proposed DBN model.

The remainder of the paper is organized as follows. Section 3

describes the AU observation extraction method. In Section 4,we

build DBN model for AU intensity recognition, including BN model

structure learning (Section 4.1), DBN model parameter learning

(Section 4.3) and DBN inference (Section 4.4). Section 5 presents

our experimental results and discussion and Section 6 concludes

the paper.

3. AU intensity observation extraction

In this section we describe our AU intensity image observation

extraction method, which consists of face registration, facial image

representation, dimensionality reduction and SVM classiﬁ

cation.

3.1. Face registration

Image registration is a commonly used technique to align

similar data (i.e. the reference and sensed image). In order to

points is exploited for representing and aligning images. In our

study, we used 66 landmark points of DISFA database to represent

the location of important facial components [9]. To obtain the

reference landmark points we averaged the 6 6 landmark points

over the entire training set. A 2D similarity transformation and the

bilinear interpolation technique were utilized to transform the

new image into the reference coordinate system. The registered

images are then masked to extract the facial regions and resized to

128  108 pixels.

3.2. Facial image representation

After registering facial images, we utilized two well-known

feature extraction techniques that are capable of representing the

appearance information. These features are Histogram of Oriented

Gradient (HOG), and Localized Gabor Features which are

described below.

Fig. 2. The ﬂowchart of the proposed online AU intensity recognition system.

Y. Li et al. / Pattern Recognition ∎ (∎∎∎∎) ∎∎∎–∎∎∎ 3

Please cite this article as: Y. Li, et al., Measuring the intensity of spontaneous facial action units with dynamic Bayesian network, Pattern

Recognition (2015), http://dx.doi.org/10.1016/j.patcog.2015.04.022i

剩余10页未读，继续阅读

weixin_38577200

粉丝: 9
资源: 907

使用动态贝叶斯网络量化自发面部动作单元强度

3-Measuring the Objectness of Image Windows

High-precision method for measuring the photothermal properties of transparent media with digital holography (Invited Paper)

A method of measuring the refractive index of extraordinary ray in uniaxial crystal with optic axis at an arbitrary orientation

The Transplant of Technology of Capacitive Displacement Transducer to Phase Modulation Grating-the Research of the Grating with nm Measuring Resolution

Lecture 7 - Measuring the value of high commitment HRM (1).pptx

Estimating the extent of surface oxidation by measuring the porosity dependent dielectrics of oxygenated porous silicon

论文研究-Measuring the Impact of Web Features in Peer-to-Peer File Sharing Systems.pdf

Experimental method for the extraction of intensity profiles by imaging the scattered pattern with a charge-coupled device

More Than You Ever Wanted to Know about Synchronization - Synchrobench, Measuring the Impact of the Synchronization on Concurrent Algorithms - 2015 (gramoli-synchrobench)-计算机科学

Measuring size of objects in an image with OpenCV源代码

最新资源