Automatic segmentation of continuous speech using minimum phase group delay functions
Tài liệu tham khảo
Berkhout, 1973, On the minimum length property of one-sided signals, Geophysics, 38, 657, 10.1190/1.1440365
Berkhout, 1974, Related properties of minimum phase and zero phase time functions, Geophys. Prospect., 683, 10.1111/j.1365-2478.1974.tb00111.x
Fisher, W.M., Doddington, G.R., Goudie-Marshal, K.M., 1986. The darpa speech recognition research database: specifications and status. In: Proc. DARPA Workshop on Speech Recognition. pp. 93–99
Ganapathiraju, 2001, Syllable-based large vocabulary continuous speech recognition, IEEE Trans. Speech, Audio Process., 9, 358, 10.1109/89.917681
Greenberg, 1999, Speaking in short hand––a syllable-centric perspective for understanding pronunciation variation, Speech Comm., 29, 159, 10.1016/S0167-6393(99)00050-3
Hema A. Murthy, 1992. Algorithms for processing fourier transform phase of signals. PhD dissertation, Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India
Hema A. Murthy, 1997. The real root cepstrum and its applications to speech processing. In: National Conf. on Communication. 180–183
Hema A. Murthy, 1991, Formant extraction from minimum phase group delay function, Speech Comm., 10, 209, 10.1016/0167-6393(91)90011-H
Leonard, R.G., 1984. A database for speaker independent digit recognition. In: Proc. IEEE Internat. Conf. on Acoust., Speech, and Signal Processing, Vol. 3. pp. 42–45
Mermelstein, 1975, Automatic segmentation of speech into syllabic units, J. Acoust. Soc. Amer., 58, 880, 10.1121/1.380738
Nagarajan, T., Kamakshi Prasad, V., Hema A. Murthy, 2001. Minimum phase signal derived from the magnitude spectrum and its application to speech segmentation. In: 6th Biennial Conf. Proc. on Signal Processing and Communications. IISc, Bangalore, India, pp. 95–101
Nagarajan, 2003, Minimum phase signal derived from root cepstrum, IEE Electron. Lett., 39, 941, 10.1049/el:20030616
Rabiner, 1982, A bootstrapping training technique for obtaining demisyllabic reference patterns, J. Acoust. Soc. Amer., 71, 1588, 10.1121/1.387813
Sargent, 1974, Syllabic detection in continuous speech, J. Acoust. Soc. Amer., 45, 880
van Hemert, 1991, Automatic segmentation of speech, IEEE Trans. Signal Process., 39, 1008, 10.1109/78.80941
Wilpon, J.G., Juang, B.H., Rabiner, L.R., 1987. An Investigation on the use of acoustic sub-word units for automatic speech recognition. In: Proc. of IEEE Internat. Conf. on Acoust., Speech, and Signal Processing. Dallas, TX, pp. 821–824
Yegnanarayana, 1992, Significance of group delay functions in spectrum estimation, IEEE Trans. Signal Process., 40, 2281, 10.1109/78.157227
Yegnanarayana, 1984, Significance of group delay functions in signal reconstruction from spectral magnitude or phase, IEEE Trans. Acoust., Speech, Signal Process., 32, 610, 10.1109/TASSP.1984.1164365