Relations between comprehensibility and adequacy errors in machine translation output
Popović, MajaORCID: 0000-0001-8234-8745
(2020)
Relations between comprehensibility and adequacy errors in machine translation output.
In: 24th Conference on Computational Natural Language Learning, 19-20 Nov 2020, Online.
This work presents a detailed analysis of translation errors perceived by readers as comprehensibility and/or adequacy issues. The main finding is that good comprehensibility, similarly to good fluency, can mask a number of adequacy errors. Of all major adequacy errors, 30% were fully comprehensible, thus fully misleading the reader to accept the incorrect information. Another 25% of major adequacy errors were perceived as almost comprehensible, thus being potentially misleading. Also, a vast majority of omissions (about 70%) is hidden by comprehensibility. Further analysis of misleading translations revealed that the most frequent error types are ambiguity, mistranslation, noun phrase error, word-by-word translation, untranslated word, subject-verb agreement, and spelling error in the source text. However, none of these error types appears exclusively in misleading translations, but are also frequent in fully incorrect (incomprehensible inadequate) and discarded correct (incomprehensible adequate) translations. Deeper analysis is needed to potentially detect underlying phenomena specifically related to misleading translations.
European Association for Machine Translation (EAMT) under its programme “2019 Sponsorship of Activities” at the ADAPT Research Centre at Dublin City University., Science Foundation Ireland through the SFI Research Centres Programme Grant 13/RC/2106, European Regional Development Fund (ERDF)
ID Code:
28356
Deposited On:
23 May 2023 11:47 by Maja Popovic. Last Modified 23 May 2023 11:47