One major drawback of using Translation Memories (TMs) in phrase-based Machine
Translation (MT) is that only continuous phrases are considered. In contrast, syntax-based MT
allows phrasal discontinuity by learning translation rules containing non-terminals. In this paper,
we combine a TM with syntax-based MT via sparse features. These features are extracted during
decoding based on translation rules and their corresponding patterns in the TM. We have tested
this approach by carrying out experiments on real English–Spanish industrial data. Our results
show that these TM features significantly improve syntax-based MT. Our final system yields
improvements of up to +3.1 BLEU, +1.6 METEOR, and -2.6 TER when compared with a stateof-the-art phrase-based MT system.
This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:
People Programme (Marie Curie Actions) of the European Union’s Framework Programme (FP7/2007-2013) under REA grant agreement no 317471, The ADAPT Centre for Digital Content Technology is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund
ID Code:
23307
Deposited On:
16 May 2019 12:04 by
Thomas Murtagh
. Last Modified 16 May 2019 12:59