ROI深度卷积神经网络驱动的高效表情识别提升策略

需积分: 9 60 浏览量更新于2024-08-26 收藏 538KB PDF 举报

本文主要探讨了"基于ROI深度卷积神经网络的改进表情识别方法"，该研究发表于2017年的第七届情感计算与智能交互国际会议(ACII)。作者们，来自合肥工业大学计算机与信息学院的肖松、吕曼、以及来自日本神户大学计算科学系的泉畅勤和来自同一学院的任富机，共同提出了这一创新性的面部表情识别技术。传统的面部表情识别往往依赖于全局特征提取，然而，这种方法可能忽视了表情关键区域的信息。为解决这个问题，研究者们引入了Region of Interest (ROI)的概念，即关注与表情变化紧密相关的特定面部区域。通过ROI指导深度卷积神经网络（CNN），这种方法能够增强训练数据的有效性，因为不同ROI区域之间的关系可以被利用来强化预测结果的可靠性。在具体实施阶段，研究者采用了两种不同的识别方法进行比较和优化。一种可能是利用ROI的分割策略，将原始图像分解成多个包含表情特征的小区域，然后分别输入到CNN中，从而捕捉每个区域的特征并整合。另一种可能是设计特殊的ROI注意力机制，让CNN能够自动学习并突出显示那些对表情识别至关重要的区域，提高识别精度。此外，文章可能还讨论了如何处理ROI选择的问题，比如动态ROI选取、ROI区域大小的调整、以及如何平衡全局信息和局部特征的融合。同时，他们可能也探讨了在有限的数据集上应用ROI-CNN的挑战，如过拟合的抑制和数据增强技术的使用。为了验证方法的有效性，研究者可能进行了大量的实验，包括但不限于不同表情数据库的测试，对比与传统方法的性能差异，以及在不同光照、姿态和表情复杂度条件下的鲁棒性评估。论文最后可能会总结出ROI深度卷积神经网络在表情识别中的优势，以及对未来工作的展望，例如如何进一步提升模型的泛化能力和实时性。这篇研究论文旨在通过ROI引导的深度学习策略，提升面部表情识别的准确性和效率，为相关领域的研究和技术应用提供了新的思路和实践方案。

2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII)

Improved Facial Expression Recognition Method Based on ROI Deep Convolutional

Neutral Network

Xiao Sun

School of Computer and Information

Hefei University of Technology

Hefei, Anhui, 230009

Email: sunx@hfut.edu.cn

Man Lv

School of Computer and Information

Hefei University of Technology

Hefei, Anhui, 230009

Email: lvxman@foxmail.com

Changqin Quan

Department of Computational Science

Kobe University

Kobe, Japan, 6578501

Email: quanchqin@gold.kobe-u.ac.jp

Fuji Ren

School of Computer and Information

Hefei University of Technology

Hefei, Anhui, 230009

Email: ren2fuji@gmail.com

Abstract—This paper, we proposed an improved facial expres-

sion recognition (FER) method based on region of interesting

(ROI) to guide the convolutional neutral networks (CNN) focus

on the areas associated with the expression. This method can

not only augment the training data, the relationship between

the different ROI areas is helpful to intensify the reliability of

the predicted targets. In test stage, we investigated two recog-

nition methods: identify the test image directly; implemented

decision fusion strategy on ROI areas. The model we used

is ﬁne-tuned from pre-trained deep CNN instead of training

from scratch. In addition, we presented an innovative region-

based image augmentation method named artiﬁcial face to

increase the limited database. This method using expression

retargeting as an expression-preserving data augmentation

which is speciﬁc for FER. The performance of the proposed

method has been validated on the public CK+ databases.

1. Introduction

Facial expression recognition (FER) has received a great

deal of attention during the last decade because it is an

important tool when automatic interactions between humans

and machines, such as in developing hospital nurse robot

assistants, automatic animation, and intelligent tutoring sys-

tems. Despite efforts made in developing various methods

for FER [1], existing approaches traditionally lack gen-

eralizability and ﬂexibility when classify images captured

in wild, therefore present a misleading high-accuracy. So,

recognizing facial expression in real time with high accuracy

is still a challenging problem due to image variations caused

by pose, illumination, age and occlusion.

Recently, convolutional neural network (CNN) has been

successfully used in a wide variety of image classiﬁcation

tasks [2]. However, learning CNNs, amounts to estimating

millions of parameters and requires a very large number of

annotated image samples. This property currently prevents

application of CNNs to FER because the public databases

are limited. So, how to use a small amount of raw data to ef-

fectively expand dataset is a worthy of study. This paper, we

proposed an innovative data augmentation method named

artiﬁcial face to increase the limited database. This method

using expression retargeting as an expression-preserving

data augmentation which is speciﬁc for FER.

Almost all of the CNNs based method employed whole

face region as input and every part of the face is treated

equally no matter if it is relevant to the facial expression.

Studies in psychological showed that facial features of

expressions are located around the subjects mouth, nose

and eyes. In [3], Paul Ekman proposed the Facial Action

Coding System (FACS) which enumerated these regions and

described how every facial expression can be described as

the combination of multiple action units (AUs). Inspired

by the locations of AUs, we proposed an improved facial

expression recognition method based on region of interesting

(ROI) to guide the CNN focus on the areas associated with

the expression. This method can not only augment the train-

ing data, the relationships between the different ROI areas is

helpful to intensify the reliability of the predicted targets. In

test stage, we investigated two recognition methods: identify

the test image directly; implemented decision fusion strategy

on ROI areas. We employed the pre-trained AlexNet ﬁne-

tuned on FER database to evaluation the performance of our

method.

2. Data Preparation

In order to solve the problem of insufﬁcient data, we

proposed an innovative data augmentation method named

artiﬁcial face to increase the limited FER database. Then,

some preprocessing methods were implemented in all im-

ages we generated.

978-1-5386-0563-9/17/$31.00

2017 IEEE

256

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38579899

粉丝: 2
资源: 979

ROI深度卷积神经网络驱动的高效表情识别提升策略

ROI Proposal实现过程总结

基于ROI-KNN卷积神经网络的面部表情识别.pdf

一种基于场景规则与深度卷积神经网络的行人检测方法.pdf

基于卷积神经网络的掌纹识别方法.pdf

连接卷积神经网络人脸表情识别算法.pdf

基于深度卷积神经网络的糖尿病视网膜病变分期及病灶检测.pdf

改进深度学习块卷积神经网络的人脸表情识别.pdf

基于R-FCN深度卷积神经网络的机器人疏果前苹果目标的识别.pdf

基于区域卷积神经网络的车辆检测方法.pdf

基于卷积神经网络的目标检测方法.pdf

最新资源