Relations between comprehensibility and adequacy errors in machine translation output

Popović, Maja ORCID: 0000-0001-8234-8745 (2020) Relations between comprehensibility and adequacy errors in machine translation output. In: 24th Conference on Computational Natural Language Learning, 19-20 Nov 2020, Online.

[+][-]

Abstract

This work presents a detailed analysis of translation errors perceived by readers as comprehensibility and/or adequacy issues. The main finding is that good comprehensibility, similarly to good fluency, can mask a number of adequacy errors. Of all major adequacy errors, 30% were fully comprehensible, thus fully misleading the reader to accept the incorrect information. Another 25% of major adequacy errors were perceived as almost comprehensible, thus being potentially misleading. Also, a vast majority of omissions (about 70%) is hidden by comprehensibility. Further analysis of misleading translations revealed that the most frequent error types are ambiguity, mistranslation, noun phrase error, word-by-word translation, untranslated word, subject-verb agreement, and spelling error in the source text. However, none of these error types appears exclusively in misleading translations, but are also frequent in fully incorrect (incomprehensible inadequate) and discarded correct (incomprehensible adequate) translations. Deeper analysis is needed to potentially detect underlying phenomena specifically related to misleading translations.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	Yes
Subjects:	Computer Science > Machine translating Humanities > Language
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > ADAPT
Published in:	Proceedings of the 24th Conference on Computational Natural Language Learning. . Association for Computational Linguistics (ACL).
Publisher:	Association for Computational Linguistics (ACL)
Official URL:	https://doi.org/10.18653/v1/2020.conll-1.19
Copyright Information:	© 2020 Association for Computational Linguistics
Funders:	European Association for Machine Translation (EAMT) under its programme “2019 Sponsorship of Activities” at the ADAPT Research Centre at Dublin City University., Science Foundation Ireland through the SFI Research Centres Programme Grant 13/RC/2106, European Regional Development Fund (ERDF)
ID Code:	28356
Deposited On:	23 May 2023 11:47 by Maja Popovic . Last Modified 23 May 2023 11:47

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0
208kB

Metrics

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Relations between comprehensibility and adequacy errors in machine translation output

Altmetric Badge

Dimensions Badge

Downloads