Statistical learning of language pronunciation structure
Tóm tắt
This paper presents a new approach to rule based pronunciation generation. The system presented can automatically learn a new language pronunciation structure and use this knowledge for pronunciation generation for an arbitrary context sensitive language. Unlike conventional text-to-speech systems which are based on the cost expensive human expert knowledge about a specific language, this system can learn by using only a set of spellings and pronunciations. The pronunciations can be obtained either from a pronunciation dictionary or from a phonetically labeled database. The system's ability to learn the pronunciation structure for any context sensitive language makes it a valuable tool for development of multilingual speech recognition systems. We present experimental results on automatic generation of pronunciations for English, German, Spanish, French and Italian.
Từ khóa
#Statistical learning #Speech synthesis #Humans #Dictionaries #Databases #Natural languages #Speech recognition #Multimedia communication #Costs #Decision treesTài liệu tham khảo
korkmazskiy, 1999, Generating Alternative Pronunciations from a Dictionary, Proc Euro speech 99
10.1016/S0167-6393(98)00066-1
amdal, 2000, Data-Driven Pronunciation Modelling for Non-Native Speakers Using Association Strength Between Phones, Proc ISCA ITRW ASR2000, 85
korkmazskiy, 1999, Pronunciation Modeling by Genetic Algorithm, Proc of ASRU ', 58
amdal, 2000, Joint Pronunciation Modeling of Non-Native Speakers Using Data-Driven Methods, Proc ICSLP, 622
10.1007/978-1-4613-1367-0_12
gremelie, 1998, In search of pronunciation rules, Proc Modeling Pronunciation Variation for Automatic Speech, 23