Automatic evaluation methods of a speech translation system's capability

F. Sugaya1, K. Yasuda2,1, T. Takezawa1, S. Yamamoto1
1ATR Spoken Language Translation Research Laboratories, Soraku-gun, Kyoto, Japan
2Graduate School of Engineering, Doshisha University, Kyotanabe, Kyoto, Japan

Tóm tắt

The main goal of the paper is to propose automatic schemes for the translation paired comparison method, which was proposed by the authors to evaluate precisely a speech translation system's capability. In the method, the outputs of the speech translation system are subjectively compared with the results of native Japanese taking the Test of English for International Communication (TOEIC), which is used as a measure of a person's speech translation capability. Experiments are conducted on TDMT, which is a subsystem of the Japanese-to-English speech translation system ATR-MATRIX developed at ATR Interpreting Telecommunications Research Laboratories. The winning rate of TDMT shows a good correlation with the TOEIC scores of the examinees. A regression analysis on the subjective results shows that the translation capability of TDMT matches a person scoring around 700 on the TOEIC. The automatic evaluation methods use DP-based similarity, which is calculated by DP distances between a translation output and multiple translation answers. The answers are collected by two methods: paraphrasing and query from a parallel corpus. In both types of collection, the similarity shows the same good correlation with the TOEIC scores of the examinees as the subjective winning rate. Regression analysis using similarity shows that the system's matched point is around 750. We also show effects of paraphrased data.

Từ khóa

#Speech analysis #Natural languages #System testing #Regression analysis #Humans #Laboratories #Performance evaluation #Costs #Text recognition

Tài liệu tham khảo

sumita, 1999, Solutions to Problems Inherent in Spoken Language Translation: The ATR-MATRIX Approach, Proc of MT Summit, 229 wahlster, 2000, Verbmobil Foundations of Speech-to-Speech Translation, 10.1007/978-3-662-04230-4 sugaya, 2000, In Proceedings of ICSLP'2000, Evaluation of the ATR-MATRIX speech translation system with a paired comparison method between the system and humans, iii, 1105 tomita, 1993, Evaluation of MT Systems by TOEFL, Proceedings of the 5th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI'93), 252 10.3115/992133.992137 takezawa, 1999, Building a bilingual travel conversation database for speech recognition research, Proc Oriental COCOSDA Workshop sugaya, 1999, End-to-end evaluation in ATR-MATRIX: speech translation system between English and Japanese, Proceedings of EUROSPEECH'99, 2431 takezawa, 1999, A New Evaluation Method for Speech Translation Systems and a Case Study on ATR-MATRIX from Japanese to English, Proc of MT Summit, 299 takezawa, 1998, A Japanese-to-English speech translation system: ATR-MATRIX, Proc ICSLP, 2779