Column of CCF NCCA 2020

Fully Expression Frame Localization and Recognition Based on Dynamic Face Image Sequences

Expand
  • 1. Institute of Artificial Intelligence Application, Central South University of Forestry and Technology, Changsha 410004, Hunan, China;
    2. Hunan Key Laboratory of Intelligent Logistics Technology, Central South University of Forestry and Technology, Changsha 410004, Hunan, China;
    3. School of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha 410004, Hunan, China

Received date: 2020-08-20

  Online published: 2021-06-08

Abstract

Considering that the evolution of facial expressions is a continuous process, compared to static images, dynamic image sequences are more suitable as the research objects for facial expression recognition. This paper proposes a sequence frame positioning model based on embedding network. The pre-trained Inception ResNet v1 network extracts the feature vectors of each frame, and then calculates the Euclidean distance between the feature vectors to position the complete frame with the maximum expression intensity, so a standardized facial expression sequences are obtained. In order to further verify the accuracy of the positioning model, we adopt VGG16 network and ResNet50 network to perform facial expression recognition on the positioned complete frame, respectively. Experiments were conducted on the CK+ and MMI facial expression databases. The average accuracy of the sequential frame positioning model proposed in this paper reached 98.31% and 98.08%, respectively. As using the VGG16 network and ResNet50 network to perform expression recognition on the positioned complete frame, the recognition accuracies on the two databases reached 96.32% and 96.5%, 87.23% and 87.88%, respectively. These experimental results show that the proposed model can pick up the complete frame from the facial expression sequence accurately and achieve better performance on facial expression recognition as well.

Cite this article

SIMA Yi, YI Jizheng, CHEN Aibin, ZHOU Mengna . Fully Expression Frame Localization and Recognition Based on Dynamic Face Image Sequences[J]. Journal of Applied Sciences, 2021 , 39(3) : 357 -356 . DOI: 10.3969/j.issn.0255-8297.2021.03.002

References

[1] Sirai E A A, Aran L R, Wong F. A Review of methods in speech and facial expressions recognition for human-computer interaction[J]. Advanced Science Letters, 2017, 23(10): 10236-10240.
[2] Aung N, Tewogbola P. The impact of emotional labor on the health in the workplace: a narrative review of literature from 2013–2018[J]. AIMS public health, 2019, 6(3): 268. DOI: 10.3934/publichealth.2019.3.268
[3] Assari M A, Rahmati M. Driver drowsiness detection using face expression recognition[C]//2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA). IEEE, 2011: 337-341.
[4] 张金刚, 方圆, 袁豪, 等. 一种识别表情序列的卷积神经网络[J]. 西安电子科技大学学报, 2018, 45(1): 150-155. Zhang J G, Fang Y, Yuan H, et al. Multiple convolutional neural networks for facial expression sequence recognition[J]. Journal of Xidian University, 2018, 45(1): 150-155. (in Chinese)
[5] 王素琴, 张峰, 高宇豆, 等. 基于图像序列的学习表情识别[J]. 系统仿真学报, 2020(7): 1322-1330. Wang S Q, Zhang F, Gao Y D, et al. Learning expression recognition based on image sequence[J]. Journal of System Simulation, 2020(7): 1322-1330. (in Chinese)
[6] 王晓华, 潘丽娟, 彭穆子, 等. 基于层级注意力模型的视频序列表情识别[J]. 计算机辅助设计与图形学学报, 2020, 32(1): 27-35. Wang X H, Pan L J, Peng M Z, et al. Video emotion recognition based on hierarchical attention model[J]. Journal of Computer-Aided Design & Computer Graphics, 2020, 32(1): 27-35. (in Chinese)
[7] Yu M, Zheng H, Peng Z, et al. Facial expression recognition based on a multi-task global-local network[J]. Pattern Recognition Letters, 2020, 131: 166-171.
[8] Qiu Y, Zhao J, Wang Y. Facial expression recognition using temporal relations among facial movements[J]. Acta Electronica Sinica, 2016, 44(6): 1307-1313.
[9] 邵洁, 董楠. RGB-D动态序列的人脸自然表情识别[J]. 计算机辅助设计与图形学学报, 2015, 27(5): 847-854. Shao J, Dong N. Spontaneous facial expression recognition based on RGB-D dynamic sequences[J]. Journal of Computer-Aided Design & Computer Graphics, 2015, 27(5): 847-854. (in Chinese)
[10] Yi J, Sima Y, Zhou M, et al. Facial expression sequence interception based on feature point movement[C]//Proceedings of the 2019 IEEE 11th International Conference on Advanced Infocomm Technology (ICAIT). IEEE, Jinan, China, 2019: 58-62.
[11] Yi J, Chen A, Cai Z, et al. Facial expression recognition of intercepted video sequences based on feature point movement trend and feature block texture variation[J]. Applied Soft Computing, 2019, 82: 105540.
[12] Parkhi O M, Vedaldi A, Zisserman A. Deep face recognition[C]//British Machine Vision Conference (BMVC), 2015, 1: 6.
[13] He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778.
[14] Chen J, Lü Y, Xu R, et al. Automatic social signal analysis: facial expression recognition using difference convolution neural network[J]. Journal of Parallel and Distributed Computing, 2019, 131: 97-102.
[15] Cai J, Meng Z, Khan A S, et al. Island loss for learning discriminative features in facial expression recognition[C]//2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018). IEEE, 2018: 302-309.
Outlines

/