胶囊网络驱动的NAO机器人面部表情识别

65 浏览量更新于2024-08-26 收藏 235KB PDF 举报

本文介绍了一种基于胶囊网络(Capsule Network)的人形机器人面部表情识别方法，该方法在处理面部表情识别任务时，相比于传统的卷积神经网络(CNN)，能够使用较少的网络层并更快地达到收敛。胶囊网络的核心是将神经元从标量转变为向量，形成胶囊，通过动态路由选择合适的胶囊作为最终输出。实验中，利用经过数据增强处理的Cohn-Kanade Database Plus (CK+)数据库进行训练，并将识别结果应用到NAO机器人上，使机器人能够通过改变眼睛颜色来表达识别出的情绪。正文: 在当前的人工智能领域，深度学习技术已经在图像识别、语音识别等多个方面取得了显著成就。其中，卷积神经网络(CNN)因其在图像处理上的强大能力而备受青睐。然而，CNN存在某些局限性，如对局部特征的捕捉过于依赖，以及丢失物体的结构信息等。为了解决这些问题，Hinton提出了胶囊网络(Capsule Network)。胶囊网络保留了CNN的卷积特性，但将单个神经元升级为胶囊，每个胶囊包含多个维度的信息，可以捕获对象的属性和姿态，更好地保持了对象的完整性。胶囊网络的关键在于动态路由算法，它允许网络自适应地决定哪些下层胶囊应该连接到上层胶囊。这种机制使得网络在没有大量层次的情况下也能有效地学习和区分复杂特征。在面部表情识别任务中，胶囊网络可以更精确地捕捉到人脸的不同表情特征，如眉毛的形状变化、眼睛的开合程度等。为了进一步提升模型的性能，本文采用了反卷积(deconvolution)技术来恢复图像，以优化原始图像和恢复图像之间的误差。反卷积层在胶囊网络中起到上采样的作用，帮助恢复图像的细节，有助于提高识别精度。实验部分，研究者使用了Cohn-Kanade Database Plus (CK+)数据库，这是一个广泛用于面部表情识别研究的数据集，包含了丰富的情感状态。通过数据增强技术，如旋转、缩放和翻转等，增加了数据的多样性，使模型能够更好地泛化到不同条件下的面部表情。最后，实验结果被应用到NAO机器人上，这是一个先进的人形机器人平台。当胶囊网络识别出特定的表情后，NAO机器人会通过改变眼睛颜色的方式来直观地展示识别出的情绪，实现了理论与实践的结合，增强了人机交互的体验。这项研究展示了胶囊网络在面部表情识别领域的潜力，尤其是在减少网络复杂性和提高识别准确性方面。未来的研究可能会进一步探索胶囊网络与其他深度学习架构的融合，以及如何将其应用于更多实际的人机交互场景。

Capsule Network-based Facial Expression Recognition

Method for a Humanoid Robot

Jingru Zhang

Nanfeng Xiao

School of Computer Science and Engineering

South China University of Technology, GuangZhou, CHINA

E-mail: 305109567@qq.com

Abstract. Compared to the classical Convolutional Neural Network(CNN), the

capsule net Hinton put forward can use fewer network layers to achieve the

classification tasks very well and arrive at the convergence with a faster speed.

The principle of the capsule net is based on the CNN, and it is just that the

neuron form is converted from the scalar to the vector, which is a capsule, and

then choose the suitable capsule for the final output through the dynamic

routing method

. In this paper, on the basis of the capsule net, use

deconvolution to restore images and optimize the error between original images

and restored images. The classical facial emotions database named Cohn-

Kanade Database Plus (CK+) that is processed through Data Augmentation is

used to conduct experiments. Lately, the classification results are combined

with the NAO robot. The NAO robot is able to visualize the emotion by

changing its eyes colors and speaking the results, achieving the purpose of

combining theory with practice.

Keywords: Capsule, Convolution Neural Network (CNN), Facial expression,

NAO robot

1 Introduction

With the development of various technologies and theories, more and more work

can be done by robots. Along with the rapid development of artificial intelligence,

robots also become smarter

. Facial recognition will be an integral part of a service

robot if it is to perform functions such as emotional substitution. This function needs

the basis of emotion classification model such as the hidden Markov model, which

can transform the state of emotion from a calm state to the corresponding emotional

state under the stimulus through a series of calculation of transition of probability

matrix

Nowadays, various kinds of CNNs are proliferating, and their abilities are varied

such as Inception v3 and ResNet-50 and so on. The idea of the capsule net is based on

cognitive neurologic and CNN, put forward by Hinton, who wants to realize the

recognition process of human eyes. Hinton noticed an interesting fact that most

current neural anatomy studies support (most of the mammals, especially primates)

that in cerebral cortex there exists micro-column structures called Cortical mini-

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38571104

粉丝: 3
资源: 944

胶囊网络驱动的NAO机器人面部表情识别

理解胶囊网络 Understanding Capsule Networks

胶囊网络介绍

胶囊网络代码

基于双重定向胶囊网络的低分辨率人脸识别方法

基于胶囊神经网络的车型精细识别研究.pdf

基于胶囊网络的指静脉识别研究

基于胶囊神经网络的交通标志识别算法研究.pdf

网络游戏-一种基于无线网络的胶囊可见光缺陷识别方法.zip

基于胶囊网络的工业互联网入侵检测方法.docx

基于图像暗区自转方位的双半球胶囊机器人导航方法.pdf

最新资源