Neural networks for discrimination and modelization of speakers

Speech Communication - Tập 17 - Trang 159-175 - 1995
Younès Bennani1, Patrick Gallinari2
1LIPN, CNRS-URA 1507, University of Paris XIII, Paris, France
2LAFORIA, CNRS-URA 1095, University of Paris VI, Paris, France

Tài liệu tham khảo

Artieres, 1991, Connectionist and conventional models for freetext talker identification tasks, Neuro-Nimes, France, 1991 Artieres, 1993, Neural models for extracting speaker characteristics in speech modelization systems Artieres, 1995, Multi-state predictive neural networks for text independent for speaker recognition, Eurospeech 95, 10.21437/Eurospeech.1995-106 Bengio, 1992, Global optimization of a neural network hidden Markov model hybrid, IEEE Trans. Neural Networks, Vol. 3, 252, 10.1109/72.125866 Bengio, 1992, Learning the dynamic nature of speech with back propagation for sequences, Pattern Recognition Letters, Vol. 13, 375, 10.1016/0167-8655(92)90035-X Bengio, 1991, Global optimization of a neural network-Hidden Markov model hybrid, 10.1109/IJCNN.1991.155435 Bennani, 1992, Speaker identification through a modular connectionist architecture: Evaluation on the TIMIT database, 607 Bennani, 1992, Text-independent talker identification system combining connectionist and conventional models Bennani, 1993, Probabilistic cooperation of connectionist expert modules: Validation on a speaker identification task Bennani, 1994, Multi-modular and hybrid connectionist approach for pattern recognition, Internat. J. Neural Systems, 10.1142/S0129065794000220 Bennani, 1991, On the use of TDNN Extracted features information in talker identification, 385 Bennani, 1992, Task decomposition through a modular connectionist architecture: A talker identification system, IEEE ICANN 1992 Bennani, 1990, A connectionist approach for speaker identification, 265 Bottou, 1992, Local learning algorithms, Neural Computation, Vol. 4, 888, 10.1162/neco.1992.4.6.888 Bourlard, 1990, How connectionnist models could improve Markov models for speech recognition Bourlard, 1992, CDNN a context dependent neural network for continuous speech recognition, 349 Bridle, 1990, Alpha-nets: A recurrent ‘neural’ network architecture with a Hidden Markov Model interpretation, Speech Communication, Vol. 9, 83, 10.1016/0167-6393(90)90049-F Carey, 1991, A speaker verification system using alpha-nets, 397 Devillers, 1992, Incorporating acousticphonetic knowledge in hybrid TDNN/Viterbi framework, 421 Driancourt, 1991, Multi layer perceptron, learning vector quantization and dynamic programming: Comparison and cooperation, IEEE-IJCNN 1991 Driancourt, 1990, TDNN-extracted features, Neuro-Nimes 1990 Driancourt, 1992, A speech recognizer optimaly combining learning vector quantization, dynamic programming and multi-layer perceptron Farell, 1994, Speaker recognition using neural networks and conventional classifiers, IEEE Trans. Speech Audio Process., Vol. 2, 194, 10.1109/89.260362 Franzini, 1990, Connectionist Viterbi training: A new hybrid method for continuous speech recognition, 425 Fisher, 1987, An acoustic-phonetic data base, J. Acoust. Soc. Amer. Suppl.(A), Vol. 81, S92, 10.1121/1.2034854 Gallinari, 1992, Hybrid systems for speech processing, 19 Gallinari, 1991, On the relations between discriminant analysis and multi-layer perceptrons, Neural Networks, Vol. 4, 349, 10.1016/0893-6080(91)90071-C Gish, 1990, A probabilistic aproach to the understanding and training of neural network classifier, 1361 Haffner, 1992, Multi-state time delay neural networks for continuous speech recognition, NIPS, 135 Haffner, 1991, Integrating time alignement and neural networks for high performance continuous speech recognition Hampshire, 1990, The meta-pi network: Connectionist rapid adaptation for high-performance multi-speaker phoneme recognition, 165 Hampshire, 1989, Connectionist architectures for multi-speaker phoneme recognition Hattori, 1992, Text-independent speaker recognition using neural networks, Vol. II, 153 Hertz, 1991 Hornik, 1989, Multilayer feed forward networks are universal approximators, Neural Networks, Vol. 2, 359, 10.1016/0893-6080(89)90020-8 Iso, 1990, Speaker independent word recognition using a neural prediction model Iso, 1991, Large vocabulary speech recognition using neural prediction model, 57 Jacobs, 1993, Learning piecewise control strategies in a modular neural network architecture, IEEE Trans. Systems Man Cybernet, Vol. 23, 337, 10.1109/21.229447 Jacobs, 1991, Adaptive mixtures of local experts, Neural Computation, Vol. 3, 79, 10.1162/neco.1991.3.1.79 Kohonen, 1989 Lang, 1988, The development of the time delay neural network architecture for speech recognition Levin, 1990, Word recognition using Hidden Control Neural Architecture, 433 Levin, 1990, Modelling time varying systems using hidden control neural architecture, NIPS, Vol. 3, 147 Mellouk, 1993, A discriminative neural prediction system for speech recognition Morgan, 1990, Continuous speech recognition using multilayer perceptrons with hidden Markov models, 413 Niles, 1990, Combining hidden Markov models and neural networks classifiers, 417 Naik, 1994, A hybrid HMM-MLP Speaker verification algorithm for telephone speech, Vol. I, 153 Nirajan, 1990, Neural networks and radial basis functions in classifying static speech patterns, Computer Speech and Language, Vol. 4, 275, 10.1016/0885-2308(90)90009-U Oglesby, 1990, Optimisation of neural models for speaker identification, 261 Oglesby, 1991, Radial basis function networks for speaker recognition, 393 Poggio, 1990, Regularisation algorithms for learning that are equivalent to multilayer networks, Science, Vol. 247, 978, 10.1126/science.247.4945.978 Renals, 1992, Connectionnist probability estimation in the decipher speech recognition system, 601 Robinson, 1992, A real time recurrent error back propagation network word recognition system, 617 Richard, 1991, Neural network classifiers estimate Bayesian a posteriori probabilities, Neural Computation, Vol. 3, 461, 10.1162/neco.1991.3.4.461 Rudasi, 1991, Text-independent talker identification with neural networks, 389 Rumelhart, 1986, Learning internal representations by error propagation, Vol. 1 Sorensen, 1993, Pi-sigma and hidden control based self-structuring models for text-independent speaker recognition, 537 Tebelskis, 1991, Continuous speech recognition using linked predictive neural networks, 61 Tsoi, 1994, Locally recurrent globally feedforward networks, A critical review of architectures, IEEE Trans. Neural Networks Waibel, 1987, Phoneme recognition using time-delay neural networks White, 1990, Learning in artificial neural networks a statistical perspective, Neural Computation, Vol. 1, 425, 10.1162/neco.1989.1.4.425