Feature-switching: Dynamic feature selection for an i-vector based speaker verification system

Speech Communication - Tập 93 - Trang 53-62 - 2017
Saranya M.S.1, Padmanabhan R.2, Hema A. Murthy1
1Indian Institute of Technology Madras, Chennai, Tamilnadu, India
2Indian Institute of Technology Mandi, Himachal Pradesh, India

Tài liệu tham khảo

Banno, 1998, Efficient representation of short-time phase based on group delay, Vol. 2, 861 Barras, 2003, Feature and score normalization for speaker verification of cellular data, Vol. 2 Battiti, 1994, Using mutual information for selecting features in supervised neural net learning, IEEE Trans. Neural Networks, 5, 537, 10.1109/72.298224 Brummer, N. URL https://sites.google.com/site/nikobrummer/focalbilinear. Brümmer, 2007 Burget, 2009, BUT system for NIST 2008 speaker recognition evaluation, 2335 Burget, 2011, Discriminatively trained probabilistic linear discriminant analysis for speaker verification, 4832 Campbell, 2006, SVM based speaker verification using a GMM supervector kernel and nap variability compensation, Vol. 1 Corporation, 2004 Dehak, 2011, Front-end factor analysis for speaker verification, IEEE Trans. Audio Speech Lang. Process., 19, 788, 10.1109/TASL.2010.2064307 Ellis, 2000, Using mutual information to design feature combinations, 79 Eriksson, 2005, An information-theoretic perspective on feature selection in speaker recognition, IEEE Signal Process. Lett., 12, 500, 10.1109/LSP.2005.849495 Ferrer, 2008, System combination using auxiliary information for speaker verification, 4853 Garcia-Romero, 2011, Analysis of i-vector length normalization in speaker recognition systems, 249 Garcia-Romero, 2011, Analysis of i-vector length normalization in speaker recognition systems, 249 Hatch, 2006, Within-class covariance normalization for SVM-based speaker recognition Hegde, 2007, Significance of the modified group delay feature in speech recognition, IEEE Trans. Audio Speech Lang. Process., 15, 190, 10.1109/TASL.2006.876858 Hegde, 2007, Significance of joint features derived from the modified group delay function in speech processing, EURASIP J. Audio Speech Music Process, 2007, 1, 10.1155/2007/79032 Henderson, 1997, Sammon mapping, Pattern Recognit. Lett., 18, 1307 Hershey, 2007, Approximating the Kullback–Leibler divergence between Gaussian mixture models Kenny, 2007, Joint factor analysis versus eigenchannels in speaker recognition, IEEE Trans. Audio Speech Lang. Process., 15, 1435, 10.1109/TASL.2006.881693 Kinnunen, 2004, Fusion of spectral feature sets for accurate speaker identification Matejka, 2011, Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification, 4828 Md. Jahangir Alam, 2015, Combining amplitude and phase-based features for speaker verification with short duration utterances Mermelstein, 1980, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Trans. Audio Speech Lang. Process., 28, 357, 10.1109/TASSP.1980.1163420 Murthy, 2003, The modified group delay function and its application to phoneme recognition, Vol. 1 Murthy, 2004, The modified group delay feature: a new spectral representation of speech Oppenheim, 2000 Padmanabhan, 2012 Padmanabhan, 2009, Dynamic selection of magnitude and phase based acoustic feature streams for speaker verification, 1244 Plchot, 2013, Developing a speaker identification system for the DARPA rats project, 6768 Rajan, 2010, Acoustic feature diversity and speaker verification Reynolds, 2000, Speaker verification using adapted Gaussian mixture models, Digit. Signal Process., 10, 19, 10.1006/dspr.1999.0361 Romero, 2011, Analysis of i-vector length normalization in speaker recognition systems, 249 Sammon, 1969, A nonlinear mapping for data structure analysis, IEEE Trans. Comput., 18, 401, 10.1109/T-C.1969.222678 The NIST Year, 2010 The NIST Year, 2015 Scheffer, 2011, The SRI NIST 2010 speaker recognition evaluation system, 5292