A cluster-based representation for multi-system MT evaluation

Stroppa, Nicolas; Owczarzak, Karolina; Way, Andy

Stroppa, Nicolas, Owczarzak, Karolina and Way, Andy ORCID: 0000-0001-5736-5930 (2007) A cluster-based representation for multi-system MT evaluation. In: TMI-07 - Proceedings of The 11th Conference on Theoretical and Methodological Issues in Machine Translation, 7-9 September 2007, Skövde, Sweden.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

Automatic evaluation metrics are often used to compare the quality of different systems. However, a small difference between the scores of two systems does not necessary reflect a real difference between their performance. Because such a difference can be significant or only due to chance, it is inadvisable to use a hard ranking to represent the evaluation of multiple systems. In this paper, we propose a cluster-based representation for quality ranking of Machine Translation systems. A comparison of rankings produced by clustering based on automatic MT evaluation metrics with those based on human judgements shows that such interpretation of automatic metric scores provides dependable means of ordering MT systems with respect to their quality. We report experimental results comparing clusterings produced by BLEU, NIST, METEOR, and GTM with those derived from human judgement (of adequacy and fluency) on the IWSLT-2006 evaluation campaign data.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	Yes
Uncontrolled Keywords:	machine translation evaluation;
Subjects:	Computer Science > Machine translating
DCU Faculties and Centres:	Research Institutes and Centres > National Centre for Language Technology (NCLT) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:	http://www.computing.dcu.ie/~away/TMI-07/
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:	15227
Deposited On:	18 Feb 2010 13:34 by DORAS Administrator . Last Modified 16 Nov 2018 09:41

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
107kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

A cluster-based representation for multi-system MT evaluation

Downloads