Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis

Speech Communication - Tập 46 - Trang 405-417 - 2005
Takeshi Saitou1, Masashi Unoki1, Masato Akagi1
1School of Information Science, Japan Advanced Institute of Science and Technology (JAIST), 1-1 Asahidai, Nomi, Ishikawa 923-1292, Japan

Tài liệu tham khảo

Akagi, M., Kitakaze, H., 2000. Perception of synthesized singing-voices with fine-fluctuations in their fundamental frequency fluctuations. In: Proc. ICSLP2000, vol. 3, pp. 458–461. Akagi, M., Iwaki, M., Minakawa, T., 1998. Fundamental frequency fluctuation in continuous vowel utterance and its perception. In: Proc. ICSLP98, Sydney, vol. 4, pp. 1519–1522. de Cheveigne, A., Kawahara, H., 2001. Comparative evaluation of F0 estimation algorithms. In: Proc. Eurospeech2001, pp. 2451–2454. de Krom, G., Bloothooft, G., 1995. Timing and accuracy of fundamental frequency changes in singing. In: Proc. ICPhS’95, vol. 1, pp. 206–209. Fujisaki, 1981, Analysis control in singing Fujisaki, H., Ohno, S., Narusawa, S., 2000. Physiological mechanisms and biomechanical modeling of fundamental frequency control for the common Japanese and the standard Chinese. In: Proc. 5th Seminar on Speech Production 2000, pp. 145–148. Hakes, 1987, Acoustic characteristics of vocal oscillations: vibrato, exaggerated vibrato, trill, and trillo, J. Voice, 1, 326, 10.1016/S0892-1997(88)80006-7 Horii, 1989, Acoustic analysis of vocal vibrato: a theoretical interpretation of data, J. Voice, 3, 151, 10.1016/S0892-1997(89)80120-1 Ishizaka, 1972, Synthesis of voiced sounds from two-mass model of the vocal cords, Bell System Tech. J., 51, 1233, 10.1002/j.1538-7305.1972.tb02651.x Kawahara, H., Katayose, A., Patterson, R.D., de Cheveigne, A., 1999a. Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity. In: Proc. Eurospeech’99, pp. 2781–2784. Kawahara, 1999, Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency based on F0 extraction: possible role of a repetitive structure in sounds, Speech Comm., 27, 187, 10.1016/S0167-6393(98)00085-5 Klatt, 1980, Software for a cascade/parallel formant synthesizer, J. Acoust. Soc. Amer., 67, 971, 10.1121/1.383940 Mori, H., Odagiri, W., Kasuya, H., 2004. Transition characteristics of fundamental frequency in singing. In: Proc. ICA2004, Mo 5. C1.3, pp. I499–I500. Moriyama, T., Ogawa, H., Tenpaku, S., 1996. A new control model based on rising and falling fundamental frequency. In: Proc. ASA and ASJ Third Joint Meeting, pp. 1171–1176. Myers, 1987, Vibrato and pitch transitions, J. Voice, 1, 157, 10.1016/S0892-1997(87)80039-5 Nakayama, I., 2004. Comparative studies on vocal expression in Japanese traditional and western classical-style singing, using a common verse. In: Proc. ICA2004, Mo4. C1.1, pp. I295–I296. Nakayama, 1996, Singing voice: charm and troubles on voice quality, J. Acoust. Soc. Jpn., 52, 383 Press, 1988 Seashore, 1938 Sundberg, 1979, Maximum speed of pitch changes in singers and untrained subjects, J. Phonetics, 7, 71, 10.1016/S0095-4470(19)31040-X Sundberg, 1987