Automatic evaluation methods of a speech translation system's capability
Tóm tắt
The main goal of the paper is to propose automatic schemes for the translation paired comparison method, which was proposed by the authors to evaluate precisely a speech translation system's capability. In the method, the outputs of the speech translation system are subjectively compared with the results of native Japanese taking the Test of English for International Communication (TOEIC), which is used as a measure of a person's speech translation capability. Experiments are conducted on TDMT, which is a subsystem of the Japanese-to-English speech translation system ATR-MATRIX developed at ATR Interpreting Telecommunications Research Laboratories. The winning rate of TDMT shows a good correlation with the TOEIC scores of the examinees. A regression analysis on the subjective results shows that the translation capability of TDMT matches a person scoring around 700 on the TOEIC. The automatic evaluation methods use DP-based similarity, which is calculated by DP distances between a translation output and multiple translation answers. The answers are collected by two methods: paraphrasing and query from a parallel corpus. In both types of collection, the similarity shows the same good correlation with the TOEIC scores of the examinees as the subjective winning rate. Regression analysis using similarity shows that the system's matched point is around 750. We also show effects of paraphrased data.
Từ khóa
#Speech analysis #Natural languages #System testing #Regression analysis #Humans #Laboratories #Performance evaluation #Costs #Text recognitionTài liệu tham khảo
sumita, 1999, Solutions to Problems Inherent in Spoken Language Translation: The ATR-MATRIX Approach, Proc of MT Summit, 229
wahlster, 2000, Verbmobil Foundations of Speech-to-Speech Translation, 10.1007/978-3-662-04230-4
sugaya, 2000, In Proceedings of ICSLP'2000, Evaluation of the ATR-MATRIX speech translation system with a paired comparison method between the system and humans, iii, 1105
tomita, 1993, Evaluation of MT Systems by TOEFL, Proceedings of the 5th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI'93), 252
10.3115/992133.992137
takezawa, 1999, Building a bilingual travel conversation database for speech recognition research, Proc Oriental COCOSDA Workshop
sugaya, 1999, End-to-end evaluation in ATR-MATRIX: speech translation system between English and Japanese, Proceedings of EUROSPEECH'99, 2431
takezawa, 1999, A New Evaluation Method for Speech Translation Systems and a Case Study on ATR-MATRIX from Japanese to English, Proc of MT Summit, 299
takezawa, 1998, A Japanese-to-English speech translation system: ATR-MATRIX, Proc ICSLP, 2779