Automatic segmentation of continuous speech using minimum phase group delay functions

Speech Communication - Tập 42 - Trang 429-446 - 2004
V Kamakshi Prasad1, T Nagarajan1, Hema A Murthy1
1Department of Computer Science and Engineering, Indian Institute of Technology, Madras, IIT Campus, Chennai, Tamil Nadu 600036, India

Tài liệu tham khảo

Berkhout, 1973, On the minimum length property of one-sided signals, Geophysics, 38, 657, 10.1190/1.1440365 Berkhout, 1974, Related properties of minimum phase and zero phase time functions, Geophys. Prospect., 683, 10.1111/j.1365-2478.1974.tb00111.x Fisher, W.M., Doddington, G.R., Goudie-Marshal, K.M., 1986. The darpa speech recognition research database: specifications and status. In: Proc. DARPA Workshop on Speech Recognition. pp. 93–99 Ganapathiraju, 2001, Syllable-based large vocabulary continuous speech recognition, IEEE Trans. Speech, Audio Process., 9, 358, 10.1109/89.917681 Greenberg, 1999, Speaking in short hand––a syllable-centric perspective for understanding pronunciation variation, Speech Comm., 29, 159, 10.1016/S0167-6393(99)00050-3 Hema A. Murthy, 1992. Algorithms for processing fourier transform phase of signals. PhD dissertation, Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India Hema A. Murthy, 1997. The real root cepstrum and its applications to speech processing. In: National Conf. on Communication. 180–183 Hema A. Murthy, 1991, Formant extraction from minimum phase group delay function, Speech Comm., 10, 209, 10.1016/0167-6393(91)90011-H Leonard, R.G., 1984. A database for speaker independent digit recognition. In: Proc. IEEE Internat. Conf. on Acoust., Speech, and Signal Processing, Vol. 3. pp. 42–45 Mermelstein, 1975, Automatic segmentation of speech into syllabic units, J. Acoust. Soc. Amer., 58, 880, 10.1121/1.380738 Nagarajan, T., Kamakshi Prasad, V., Hema A. Murthy, 2001. Minimum phase signal derived from the magnitude spectrum and its application to speech segmentation. In: 6th Biennial Conf. Proc. on Signal Processing and Communications. IISc, Bangalore, India, pp. 95–101 Nagarajan, 2003, Minimum phase signal derived from root cepstrum, IEE Electron. Lett., 39, 941, 10.1049/el:20030616 Rabiner, 1982, A bootstrapping training technique for obtaining demisyllabic reference patterns, J. Acoust. Soc. Amer., 71, 1588, 10.1121/1.387813 Sargent, 1974, Syllabic detection in continuous speech, J. Acoust. Soc. Amer., 45, 880 van Hemert, 1991, Automatic segmentation of speech, IEEE Trans. Signal Process., 39, 1008, 10.1109/78.80941 Wilpon, J.G., Juang, B.H., Rabiner, L.R., 1987. An Investigation on the use of acoustic sub-word units for automatic speech recognition. In: Proc. of IEEE Internat. Conf. on Acoust., Speech, and Signal Processing. Dallas, TX, pp. 821–824 Yegnanarayana, 1992, Significance of group delay functions in spectrum estimation, IEEE Trans. Signal Process., 40, 2281, 10.1109/78.157227 Yegnanarayana, 1984, Significance of group delay functions in signal reconstruction from spectral magnitude or phase, IEEE Trans. Acoust., Speech, Signal Process., 32, 610, 10.1109/TASSP.1984.1164365