Video facial emotion recognition based on local enhanced motion history image and CNN-CTSLSTM networks
Tóm tắt
Từ khóa
Tài liệu tham khảo
Ranjan, 2018, Deep learning for understanding faces: machines may be just as good, or better, than humans, IEEE Signal Process Mag., 35, 66, 10.1109/MSP.2017.2764116
W. Liu, Y. Wen, Z. Yu, et al., Sphere face: deep hypersphere embedding for face recognition, arXiv preprint arXiv:1704.08063, 2017.
Liu, 2002, Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition, IEEE Trans. Image Process., 11, 467, 10.1109/TIP.2002.999679
Ahonen, 2006, Face description with local binary patterns: Application to face recognition, IEEE Trans. Pattern Anal. Mach. Intell., 12, 2037, 10.1109/TPAMI.2006.244
Qi, 2014, Pairwise rotation invariant co-occurrence local binary pattern, TPAMI, 36, 2199, 10.1109/TPAMI.2014.2316826
S. Leutenegger, M. Chli, R. Siegwart, BRISK: Binary robust invariant scalable keypoints. In ICCV, pages 2548–2555, 2011.
Yueqi, 2018, Learning Deep Binary Descriptor with Multi-Quantization[J], IEEE Trans. Pattern Anal. Mach. Intell.
Lu, 2015, Learning compact binary face descriptor for face recognition, IEEE Trans. Pattern Anal. Mach. Intell, 37, 2041, 10.1109/TPAMI.2015.2408359
Duan, 2018, Context-aware local binary feature learning for face recognition, IEEE Trans. Pattern Anal. Mach. Intell., 40, 1139, 10.1109/TPAMI.2017.2710183
Lu, 2018, Simultaneous local binary feature learning and encoding for homogeneous and heterogeneous face recognition, IEEE Trans. Pattern Anal. Mach. Intell., 40, 1979, 10.1109/TPAMI.2017.2737538
M. Wang, W. Deng, Deep face recognition: a survey, arXiv preprint arXiv:1804.06655, 2018.
Y. LeCun, F.J. Huang, L. Bottou, Learning methods for generic object recognition with invariance to pose and lighting, in: Computer Vision and Pattern Recognition, 2004, CVPR 2004, Proceedings of the 2004 IEEE Computer Society Conference on. IEEE, 2004, 2: II-104.
Liu, 2016, Video-based emotion recognition using CNN-RNN and C3D hybrid networks, 445
Donahue, 2017, Long-term recurrent convolutional networks for visual recognition and description, IEEE Trans. Pattern Anal. Mach. Intell., 39, 677, 10.1109/TPAMI.2016.2599174
Kim, 2015, Hierarchical committee of deep CNNs with exponentially-weighted decision fusion for static facial expression recognition, 427
G. Pons, D. Masip, Multi-task, multi-label and multi-domain learning with residual convolutional networks for emotion recognition, arXiv preprint arXiv:1802.06664, 2018.
S. Hosseini, S.H. Lee, N.I. Cho, Feeding hand-crafted features for enhancing the performance of convolutional neural networks, arXiv preprint arXiv:1801.07848, 2018.
Koelstra, 2010, A dynamic texture-based approach to recognition of facial actions and their temporal models, IEEE Trans. Pattern Anal. Mach. Intell., 32, 1940, 10.1109/TPAMI.2010.50
Hasani, 2017, Facial expression recognition using enhanced deep 3D convolutional neural networks, 2278
Schuster, 2002, Bidirectional recurrent neural networks, IEEE Trans. Signal Process., 45, 2673, 10.1109/78.650093
Kim, 2017, Discrimination between genuine versus fake emotion using long-short term memory with parametric bias and facial landmarks, 3065
C.Y. Ma, M.H. Chen, Z. Kira, et al., TS-LSTM and temporal-inception: exploiting spatiotemporal dynamics for activity recognition, 2017.
Bergstra, 2012, Random search for hyper-parameter optimization, J. Mach. Learning Res., 13
Davis, 1996, Appearance-based motion recognition of human actions, Massachusetts Institute of Technology
Dhall, 2011, Static facial expression analysis in tough conditions: Data, evaluation protocol and benchmark, 2106
Goodfellow, 2015, Challenges in representation learning: A report on three machine learning contests, Neural Networks, 64, 59, 10.1016/j.neunet.2014.09.005
Pantic, 2005, Web-based database for facial expression analysis, 5
Lucey, 2010, The extended cohn-kanade dataset (CK+): A complete dataset for action unit and emotion-specified expression, 94
Dhall, 2012, Collecting large, richly annotated facial-expression databases from movies, IEEE Multimedia, 19, 34, 10.1109/MMUL.2012.26
Mayer, 2014, Cross-database evaluation for facial expression recognition, Pattern Recognit Image Anal., 24, 124, 10.1134/S1054661814010106
Lee, 2017, Intra-class variation reduction using training expression images for sparse representation based facial expression recognition, IEEE Trans. Affective Comput., 5, 340, 10.1109/TAFFC.2014.2346515
Taheri, 2014, Structure-preserving sparse decomposition for facial expression analysis, IEEE Trans Image Process, 23, 3590, 10.1109/TIP.2014.2331141
Liu, 2014, Deeply learning deformable facial action parts model for dynamic expression analysis, 143
Liu, 2014, Learning expressionlets on spatio-temporal manifold for dynamic facial expression recognition, 1749, 10.1007/978-1-4471-6515-6
Shan, 2009, Facial expression recognition based on Local Binary Patterns: A comprehensive study, Image Vis. Comput., 27, 803, 10.1016/j.imavis.2008.08.005
Mohammadi, 2014, PCA-based dictionary building for accurate facial expression recognition via sparse representation, J. Vis. Commun. Image Represent, 25, 1082, 10.1016/j.jvcir.2014.03.006
Fan, 2017, A dynamic framework based on local Zernike moment and motion history image for facial expression recognition, Pattern Recogn., 64, 399, 10.1016/j.patcog.2016.12.002
Yao, 2015, Capturing AU-aware facial features and their latent relations for emotion recognition in the wild, 451
Schmidhuber, 2012, Multi-column deep neural networks for image classification, 3642