Multi-engine machine translation by recursive sentence decomposition
Mellebeek, Bart, Owczarzak, Karolina, van Genabith, Josef and Way, AndyORCID: 0000-0001-5736-5930
(2006)
Multi-engine machine translation by recursive sentence decomposition.
In: AMTA 2006 - 7th Conference of the Association for Machine Translation of the Americas, 8-12 August 2006, Cambridge, Massachusetts, USA.
In this paper, we present a novel approach to combine the outputs of multiple MT engines into a consensus translation. In contrast to previous Multi-Engine Machine
Translation (MEMT) techniques, we do not rely on word alignments of output hypotheses, but prepare the input sentence for multi-engine processing. We do this by using a recursive decomposition algorithm that produces simple chunks as input to the MT engines. A consensus translation
is produced by combining the best chunk translations, selected through majority voting, a trigram language model
score and a confidence score assigned to each MT engine. We report statistically significant relative improvements
of up to 9% BLEU score in experiments (English→Spanish) carried out on an 800-sentence test set extracted from the Penn-II Treebank.