Sound event recognition through expectancy-based evaluation ofsignal-driven hypotheses

Pattern Recognition Letters - Tập 31 - Trang 1552-1559 - 2010
J.D. Krijnders1, M.E. Niessen1, T.C. Andringa1
1Artificial Intelligence, University of Groningen, P.O. Box 407, 9700 AK Groningen, The Netherlands

Tài liệu tham khảo

Aucouturier, 2007, The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes …, J. Acoust. Soc. Amer., 122, 881, 10.1121/1.2750160 Ballas, 1987, Interpreting the language of environmental sounds, Environ. Behav., 19, 91, 10.1177/0013916587191005 Barker, 2005, Decoding speech in the presence of other sources, Speech Comm., 45, 5, 10.1016/j.specom.2004.05.002 Bregman, 1990 Cowling, 2003, Comparison of techniques for environmental sound recognition, Pattern Recognition Lett., 24, 2895, 10.1016/S0167-8655(03)00147-8 Crestani, 1997, Application of spreading activation techniques in information retrieval, Artif. Intell. Rev., 11, 453, 10.1023/A:1006569829653 Ellis, 1999, Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures, Speech Comm., 27, 281, 10.1016/S0167-6393(98)00083-1 Griffiths, 2004, What is an auditory object?, Nature Rev. Neurosci., 5, 887, 10.1038/nrn1538 Guastavino, 2007, Categorization of environmental sounds, Can. J. Exp. Psychol., 61, 54, 10.1037/cjep2007006 Irino, 1997, A time-domain, level-dependent auditory filter: The gammachirp, J. Acoust. Soc. Amer., 101, 412, 10.1121/1.417975 McClelland, 1981, An interactive activation model of context effects in letter perception: I. An account of basic findings, Psychol. Rev., 88, 375, 10.1037/0033-295X.88.5.375 Moore, 1996, A revision of Zwicker’s loudness model, Acta Acust. United Acust., 82, 335 Niessen, M.E., Kootstra, G., De Jong, S., Andringa, T.C., 2009. Expectancy-based robot localization through context evaluation. In: Proceedings of the ICAI 2009, Las Vegas, pp. 371–377. Niessen, 2008, Disambiguating sound through context, Internat. J. Semantic Comput., 2, 327, 10.1142/S1793351X08000506 O’Shaughnessy, 2008, Invited paper: Automatic speech recognition: History, methods and challenges, Pattern Recognition, 41, 2965, 10.1016/j.patcog.2008.05.008 Quillian, 1968, Semantic memory, 216 Roman, 2006, Pitch-based monaural segregation of reverberant speech, J. Acoust. Soc. Amer., 120, 458, 10.1121/1.2204590 Salton, 1988, Term-weighting approaches in automatic text retrieval, Inform. Process. Manage., 24, 513, 10.1016/0306-4573(88)90021-0 Schafer, 1977 Shinn-Cunningham, 2008, Object-based auditory and visual attention, Trends Cognit. Sci., 12, 182, 10.1016/j.tics.2008.02.003 van Hengel, P., Andringa, T.C., 2007. Verbal aggression detection in complex social environments. In: Proceedings of AVSS 2007. London, pp. 15–20. Van Maanen, L., Van Rijn, H., Van Grootel, M., Kemna, S., Klomp, M., Scholtens, E., in press. Personal publication assistant: Abstract recommendation by a cognitive model. Cognit. Syst. Res. doi:10.1016/j.cogsys.2008.08.002. Witten, 2005 Zajdel, W., Krijnders, J.D., Andringa, T.C., Gavrila, D., 2007. Cassandra: Audio–video sensor fusion for aggression detection. In: Proceedings of AVSS 2007. London, pp. 200–205.