Multi-band automatic speech recognition

Computer Speech & Language - Tập 15 - Trang 151-174 - 2001
Christophe Cerisara1, Dominique Fohr1
1LORIA, UMR 7503, Campus Scientifique, BP 239, 54506 Vandoeuvre-les-Nancy Cedex, France

Tài liệu tham khảo

Allen, 1994, How do humans process and recognize speech?, IEEE Transactions on Speech and Audio Processing, 2, 567, 10.1109/89.326615 L. R. Bahl, P. F. Brown, P. V. de Souza, R. L. Mercer, Proceedings of the International Conference on Acoustics, Speech and Signal Processing, New York, U.S.A, April 1988, 1988 F. Berthommier, H. Glotin, E. Tessier, H. Bourlard, Proceedings of the International Conference on Spoken Language Processing, Sydney, Australia, 1998 L. Besacier, 1998 H. Bourlard, S. Dupont, Proceedings of the International Conference on Spoken Language Processing, Philadelphia, U.S.A, 1996 C. Cerisara, J.-P. Haton, J.-F. Mari, D. Fohr, Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Seattle, U.S.A, May 1998, 1998 K. Daoudi, D. Fohr, C. Antoine, Proceedings of the International Conference on Spoken Language Processing, Beijing, China, October 2000, 2000 P. Duchnowsky, 1993 Fletcher, 1953 Forney, 1973, The viterbi algorithm, IEEE Transactions, 61, 268 M. J. F. Gales, 1995 J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallet, N. L. Dahlgren G. Gravier, M. Sigelle, G. Chollet, Proceedings of the International Conference on Spoken Language Processing, Sydney, Australia, 1998 H. Hermansky, S. Sharma, Proceedings of the International Conference on Spoken Language Processing, Sydney, Australia, November 1998, 1998 H. Hermansky, S. Tibrewala, M. Pavel, Proceedings of the International Conference on Spoken Language Processing, Philadelphia, U.S.A, 1996 Juang, 1992, Discriminative learning for minimum error classification, IEEE Transactions on Signal Processing, 40, 3043, 10.1109/78.175747 Mari, 1996, Perception de signaux complexes et intéraction homme–machine N. N. Mirghafori, 1999 N. Mirghafori, N. Morgan, Proceedings of the International Conference on Spoken Language Processing, Sydney, Australia, November 1998, 1998 A. Morris, A. Hagen, H. Bourlard, Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech’99), Budapest, Hungary, September 1999, 1999 Myers, 1981, Connected digit recognition using a level-building DTW algorithm, IEEE Transactions on Acoustics, Speech and Signal Processing, 29, 351, 10.1109/TASSP.1981.1163586 NOISE-ROM-0 S. Okawa, E. Bocchieri, A. Potamianos, Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Seattle, U.S.A, 1998 Rabiner, 1993 Sakoe, 1979, Two-level DP-matching; a dynamic programming-based pattern matching algorithm for connected word recognition, IEEE Transactions on Acoustics, Speech and Signal Processing, 6 N. Ström, Phoneme probability estimation with dynamic sparsely connected artificial neural networks, The Free Speech Journal, 5, http://cslu.cse.ogi.edu/fsj/issues/issue5/sparse-ann/index.html J. Verhasselt, J.-P. Martens, I. Illina, J.-P. Haton, Y. Gong, Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Munich, Germany, April 1997, 1997