Video facial emotion recognition based on local enhanced motion history image and CNN-CTSLSTM networks

Min Hu1,2, Haowen Wang1,2, Xiaohua Wang1,2, Juan Yang2, Ronggui Wang2
1Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, 230009 Hefei, China
2School of Computer and Information of Hefei University of Technology, China

Tóm tắt

Từ khóa


Tài liệu tham khảo

Ranjan, 2018, Deep learning for understanding faces: machines may be just as good, or better, than humans, IEEE Signal Process Mag., 35, 66, 10.1109/MSP.2017.2764116

W. Liu, Y. Wen, Z. Yu, et al., Sphere face: deep hypersphere embedding for face recognition, arXiv preprint arXiv:1704.08063, 2017.

Liu, 2002, Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition, IEEE Trans. Image Process., 11, 467, 10.1109/TIP.2002.999679

Ahonen, 2006, Face description with local binary patterns: Application to face recognition, IEEE Trans. Pattern Anal. Mach. Intell., 12, 2037, 10.1109/TPAMI.2006.244

Qi, 2014, Pairwise rotation invariant co-occurrence local binary pattern, TPAMI, 36, 2199, 10.1109/TPAMI.2014.2316826

S. Leutenegger, M. Chli, R. Siegwart, BRISK: Binary robust invariant scalable keypoints. In ICCV, pages 2548–2555, 2011.

Yueqi, 2018, Learning Deep Binary Descriptor with Multi-Quantization[J], IEEE Trans. Pattern Anal. Mach. Intell.

Lu, 2015, Learning compact binary face descriptor for face recognition, IEEE Trans. Pattern Anal. Mach. Intell, 37, 2041, 10.1109/TPAMI.2015.2408359

Duan, 2018, Context-aware local binary feature learning for face recognition, IEEE Trans. Pattern Anal. Mach. Intell., 40, 1139, 10.1109/TPAMI.2017.2710183

Lu, 2018, Simultaneous local binary feature learning and encoding for homogeneous and heterogeneous face recognition, IEEE Trans. Pattern Anal. Mach. Intell., 40, 1979, 10.1109/TPAMI.2017.2737538

M. Wang, W. Deng, Deep face recognition: a survey, arXiv preprint arXiv:1804.06655, 2018.

Y. LeCun, F.J. Huang, L. Bottou, Learning methods for generic object recognition with invariance to pose and lighting, in: Computer Vision and Pattern Recognition, 2004, CVPR 2004, Proceedings of the 2004 IEEE Computer Society Conference on. IEEE, 2004, 2: II-104.

Liu, 2016, Video-based emotion recognition using CNN-RNN and C3D hybrid networks, 445

Donahue, 2017, Long-term recurrent convolutional networks for visual recognition and description, IEEE Trans. Pattern Anal. Mach. Intell., 39, 677, 10.1109/TPAMI.2016.2599174

Kim, 2015, Hierarchical committee of deep CNNs with exponentially-weighted decision fusion for static facial expression recognition, 427

G. Pons, D. Masip, Multi-task, multi-label and multi-domain learning with residual convolutional networks for emotion recognition, arXiv preprint arXiv:1802.06664, 2018.

S. Hosseini, S.H. Lee, N.I. Cho, Feeding hand-crafted features for enhancing the performance of convolutional neural networks, arXiv preprint arXiv:1801.07848, 2018.

Koelstra, 2010, A dynamic texture-based approach to recognition of facial actions and their temporal models, IEEE Trans. Pattern Anal. Mach. Intell., 32, 1940, 10.1109/TPAMI.2010.50

Hasani, 2017, Facial expression recognition using enhanced deep 3D convolutional neural networks, 2278

Schuster, 2002, Bidirectional recurrent neural networks, IEEE Trans. Signal Process., 45, 2673, 10.1109/78.650093

Kim, 2017, Discrimination between genuine versus fake emotion using long-short term memory with parametric bias and facial landmarks, 3065

C.Y. Ma, M.H. Chen, Z. Kira, et al., TS-LSTM and temporal-inception: exploiting spatiotemporal dynamics for activity recognition, 2017.

Bergstra, 2012, Random search for hyper-parameter optimization, J. Mach. Learning Res., 13

Davis, 1996, Appearance-based motion recognition of human actions, Massachusetts Institute of Technology

Dhall, 2011, Static facial expression analysis in tough conditions: Data, evaluation protocol and benchmark, 2106

Goodfellow, 2015, Challenges in representation learning: A report on three machine learning contests, Neural Networks, 64, 59, 10.1016/j.neunet.2014.09.005

Pantic, 2005, Web-based database for facial expression analysis, 5

Lucey, 2010, The extended cohn-kanade dataset (CK+): A complete dataset for action unit and emotion-specified expression, 94

Dhall, 2012, Collecting large, richly annotated facial-expression databases from movies, IEEE Multimedia, 19, 34, 10.1109/MMUL.2012.26

Mayer, 2014, Cross-database evaluation for facial expression recognition, Pattern Recognit Image Anal., 24, 124, 10.1134/S1054661814010106

Lee, 2017, Intra-class variation reduction using training expression images for sparse representation based facial expression recognition, IEEE Trans. Affective Comput., 5, 340, 10.1109/TAFFC.2014.2346515

Taheri, 2014, Structure-preserving sparse decomposition for facial expression analysis, IEEE Trans Image Process, 23, 3590, 10.1109/TIP.2014.2331141

Liu, 2014, Deeply learning deformable facial action parts model for dynamic expression analysis, 143

Liu, 2014, Learning expressionlets on spatio-temporal manifold for dynamic facial expression recognition, 1749, 10.1007/978-1-4471-6515-6

Shan, 2009, Facial expression recognition based on Local Binary Patterns: A comprehensive study, Image Vis. Comput., 27, 803, 10.1016/j.imavis.2008.08.005

Mohammadi, 2014, PCA-based dictionary building for accurate facial expression recognition via sparse representation, J. Vis. Commun. Image Represent, 25, 1082, 10.1016/j.jvcir.2014.03.006

Fan, 2017, A dynamic framework based on local Zernike moment and motion history image for facial expression recognition, Pattern Recogn., 64, 399, 10.1016/j.patcog.2016.12.002

Yao, 2015, Capturing AU-aware facial features and their latent relations for emotion recognition in the wild, 451

Schmidhuber, 2012, Multi-column deep neural networks for image classification, 3642