Emotion recognition and school violence detection from children speech

Tian Han1,2, Jincheng Zhang1, Zhu Zhang2,3, Guobing Sun2,4, Liang Ye2, Hany Ferdinando2,5, Esko Alasaarela2, Tapio Seppänen6, Xiaoyang Yu7, Shuchang Yang1
1Department of Internet of Things Engineering, Harbin University of Science and Technology, Harbin, China
2Optoelectronics and Measurement Technique Unit, University of Oulu, Oulu, Finland
3Department of Communication Engineering, Harbin University of Science and Technology, Harbin, China
4Department of Automation, Heilongjiang University, Harbin, China
5Department of Electrical Engineering, Petra Christian University, Surabaya, Indonesia
6Physiological Signal Analysis Team, University of Oulu, Oulu, Finland
7Department of Measurement-control technology and instrumentation, Harbin University of Science and Technology, Harbin, China

Tóm tắt

School violence is a serious problem all over the world, and violence detection is significant to protect juveniles. School violence can be detected from the biological signals of victims, and emotion recognition is an important way to detect violence events. In this research, a violence simulation experiment was designed and performed for school violence detection system. Emotional voice from the experiment was extracted and analyzed. Consecutive elimination process (CEP) algorithm was proposed for emotion recognition in this paper. After parameters optimization, SVM was chosen as the classifier and the algorithm was validated by Berlin database which is an emotional speech database of adults, and the mean accuracy for seven emotions was 79.05%. The emotional speech database of children extracted in violence simulation was also classified by SVM classifier with proposed CEP algorithm, and the mean accuracy was 66.13%. The results showed that high classification performance could be achieved with the CEP algorithm. The classification result was also compared with database of adults, and the results indicated that children and adults’ voice should be treated differently in speech emotion recognition researches. The accuracy of children database is lower than adult database; the accuracy of violence detection will be improved by other signals in the system.

Tài liệu tham khảo

D. Olweus, School bullying: development and some important challenges. Annu. Rev. Clin. Psychol. 9, 751–780 (2013). https://doi.org/10.1146/annurev-clinpsy-050212-185516 E. Menesini, C. Salmivalli, Bullying in schools: the state of knowledge and effective interventions. Psychol. Health Med. 22, 240–253 (2017). https://doi.org/10.1080/13548506.2017.1279740 J. Wang, R.J. Iannotti, T.R. Nansel, School bullying among adolescents in the United States: physical, verbal, relational, and cyber. J. Adolesc. Health 45(4), 368–375 (2009). https://doi.org/10.1016/j.jadohealth.2009.03.021 T. Vaillancourt, R. Faris, F. Mishna, Cyberbullying in children and youth: implications for health and clinical practice. Can. J. Psychiatry 62(6), 368–373 (2017). https://doi.org/10.1177/0706743716684791 A. Foteini, H. Dimitris, K. Anderson Adam, ECG pattern analysis for emotion detection. IEEE Trans. Affect. Comput. 3(1), 102–115 (2012). https://doi.org/10.1109/T-AFFC.2011.28 L. Ye, H. Ferdinando, T. Seppanen, An Instance-Based Physical Violence Detection Algorithm for School Bullying Prevention (11th IEEE International Wireless Communications and Mobile Computing Conference, Dubrovnik, 2015), pp. 24–25 M.G. Borg, The extent and nature of bullying among primary and secondary schoolchildren. Educ. Res. 41(2), 137–153 (1999). https://doi.org/10.1080/0013188990410202 L.R. Barhight, J.A. Hubbard, C.T. Hyde, Children’s physiological and emotional reactions to witnessing bullying predict bystander intervention. Child Dev. 84(1), 375–380 (2013). https://doi.org/10.1111/j.1467-8624.2012.01839.x M. Giménez Gualdo Ana, C. Hunter Simon, D. Kevin, The emotional impact of cyberbullying: differences in perceptions and experiences as a function of role. Comput. Educ. 82, 228–235 (2015). https://doi.org/10.1016/j.compedu.2014.11.013 M.A. Quiros-Ramirez, T. Onisawa, Considering cross-cultural context in the automatic recognition of emotions. Int. J. Mach. Learn. Cybern. 6(1), 119–127 (2015). https://doi.org/10.1007/s13042-013-0192-2 M. Waseem, M. Ryan, C.B. Foster, Assessment and management of bullied children in the emergency department. Pediatr. Emerg. Care 29(3), 389–398 (2013). https://doi.org/10.1097/PEC.0b013e31828575d7 H. Saarimaki, A. Gotsopoulos, I.P. Jaaskelainen, Discrete neural signatures of basic emotions. Cereb. Cortex 26(6), 2563–2573 (2016). https://doi.org/10.1093/cercor/bhv086 H. Schlosberg, Three dimensions of emotion. Psychol. Rev. 61(2), 81–88 (1954). https://doi.org/10.1037/h0054570 X.D. Wu, V. Kumar, J.R. Quinlan, Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008). https://doi.org/10.1007/s10115-007-0114-2 A. Keskinarkaus, S. Huttunen, A. Siipo, MORE - a multimodal observation and analysis system for social interaction research. Multimed. Tools Appl. 75(1), 6321–6345 (2016). https://doi.org/10.1007/s11042-015-2574-9