Global statistical features-based approach for Acoustic Event Detection

Applied Acoustics - Tập 139 - Trang 113-118 - 2018
S.L. Jayalakshmi1, S. Chandrakala2, R. Nedunchelian3
1Department of CSE, Velammal Engineering College, Chennai, Tamil Nadu, India
2School of Computing, SASTRA University, Thanjavur, Tamil Nadu, India
3Department of CSE, Sri Venkateswara College of Engineering, Pennalur, Sriperumbudur, India

Tài liệu tham khảo

Bengio, 2013, Representation learning: a review and new perspectives, IEEE Trans Pattern Anal Mach Intell, 35, 1798, 10.1109/TPAMI.2013.50 Temko, 2009, Acoustic event detection and classification, Comput Hum Interact Loop, 61, 10.1007/978-1-84882-054-8_7 Crocco, 2016, Audio surveillance: a systematic review, ACM Comput Surv (CSUR), 48, 52, 10.1145/2871183 Atrey, 2006, Audio based event detection for multimedia surveillance, vol. 5 Rabaoui, 2008, Using one-class SVMS and wavelets for audio surveillance, IEEE Trans Inf Forensics Secur, 3, 763, 10.1109/TIFS.2008.2008216 Lozano H, Hernáez I, Picón A, Camarena J, Navas E. Audio classification techniques in home environments for elderly/dependant people. In: International conference on computers for handicapped persons; 2010. p. 320–3. Cheffena, 2016, Fall detection using smartphone audio features, IEEE J Biomed Health Inf, 20, 1073, 10.1109/JBHI.2015.2425932 Moeslund TB, Javed O, Jiang Y-G, Manmatha R. Special issue on multimedia event detection; 2014. Malik, 2013, Acoustic environment identification and its applications to audio forensics, IEEE Trans Inf Forensics Secur, 8, 1827, 10.1109/TIFS.2013.2280888 McLoughlin, 2015, Robust sound event classification using deep neural networks, IEEE/ACM Trans Audio Speech Langu Process, 23, 540, 10.1109/TASLP.2015.2389618 Beltrán, 2015, Scalable identification of mixed environmental sounds, recorded from heterogeneous sources, Pattern Recogn Lett, 68, 153, 10.1016/j.patrec.2015.08.027 Piczak KJ. ESC: dataset for environmental sound classification. In: Proceedings of the 23rd ACM international conference on multimedia; 2015. p. 1015–8. Mesaros A, Heittola T, Virtanen T. Tut database for acoustic scene classification and sound event detection. In: 2016 24th European signal processing conference (EUSIPCO); 2016. p. 1128–32. Chachada, 2013, Environmental sound recognition: a survey, 1 El Ayadi, 2011, Survey on speech emotion recognition: features, classification schemes, and databases, Pattern Recogn, 44, 572, 10.1016/j.patcog.2010.09.020 Stowell, 2015, Detection and classification of acoustic scenes and events, IEEE Trans Multimedia, 17, 1733, 10.1109/TMM.2015.2428998 Dufaux A, Besacier L, Ansorge M, Pellandini F. Automatic sound detection and recognition for noisy environment. In: 2000 10th European signal processing conference; 2000. p. 1–4. Ntalampiras, 2011, Probabilistic novelty detection for acoustic surveillance under real-world conditions, IEEE Trans Multimedia, 13, 713, 10.1109/TMM.2011.2122247 Mesaros A, Heittola T, Eronen A, Virtanen T. Acoustic event detection in real life recordings. In: 2010 18th European signal processing conference; 2010. p. 1267–71. Plinge A, Grzeszick R, Fink GA. A bag-of-features approach to acoustic event detection. In: IEEE international conference on acoustics, speech and signal processing (ICASSP); 2014. p. 3704–8. Gemmeke, 2013, An exemplar-based NMF approach to audio event detection, 1 Lu, 2014, Sparse representation based on a bag of spectral exemplars for acoustic event detection, 6255 Shuyang Z, Heittola T, Virtanen T. Active learning for sound event classification by clustering unlabeled data. In: 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP); 2017. p. 751–5. Hinton, 2012, Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups, IEEE Signal Process Mag, 29, 82, 10.1109/MSP.2012.2205597 Salamon J, Jacoby C, Bello JP. A dataset and taxonomy for urban sound research. In: Proceedings of the 22nd ACM international conference on multimedia; 2014. p. 1041–4. Eronen, 2006, Audio-based context recognition, IEEE Trans Audio Speech Lang Process, 14, 321, 10.1109/TSA.2005.854103 Chang, 2011, Libsvm: a library for support vector machines, ACM Trans Intell Syst Technol (TIST), 2, 27 Kuccukbay SE, Sert M. Audio-based event detection in office live environments using optimized MFCC-SVM approach. In: IEEE international conference on, semantic computing (ICSC); 2015. p. 475–80.