Segment Inventories for Speech Synthesis

Language and Speech - Tập 4 Số 1 - Trang 27-90 - 1961
Eva Sivertsen1
1University of Michigan and Norges Lærerhøgskole, Trondheim

Tóm tắt

Speech synthesis may be based on a segmentation of the speech continuum either into simultaneous components or into successive time segments. The time segments may be of varying size and type: phonemes, phoneme dyads, syllable nuclei and margins, half-syllables, syllables, syllable dyads, and words. In order to obtain an estimate of the size of the segment inventory for each type of segment, a phonological study was made of the particular phoneme sequences which occur in English, particularly in relation to the immediate constituents of the syllable (nucleus and margin) and to the syllable. An estimate was also made of the number of prosodic conditions required for each type of phoneme sequence. It was found that in general there is a direct relationship between the length of the segment and the size of the inventory. However, when the borders of the proposed segments do not coincide with the borders of linguistic units, the inventory has to be relatively large. The value of using the various types of segment for speech synthesis is discussed, both for basic research on speech and for practical application to a communication system with high intelligibility.

Từ khóa


Tài liệu tham khảo

10.1080/00437956.1958.11659660

10.1080/03637754409390092

10.1177/002383095800100207

10.1121/1.1907227

10.1121/1.1907654

10.2307/452241

10.1080/00437956.1953.11659461

10.1080/00335635809382321

10.1121/1.1906875

10.1177/002383096000300204

10.1121/1.1909746

10.1121/1.1907874

Thorndike, E. L., 1931, A Teacher's Word Book of Twenty Thousand Words Found Most Frequently and Widely in General Reading for Children and Young People

Thorndike, E. L., 1944, The Teacher's Word Book of 30,000 Words

10.1080/00335634209380756

10.1177/002383096000300302

10.1121/1.1909747

Whorf, B. L., 1940, The Technology Review, 42