Exploiting parallel treebanks to improve phrase-based statistical machine translation
Tinsley, John, Hearne, Mary and Way, AndyORCID: 0000-0001-5736-5930
(2007)
Exploiting parallel treebanks to improve phrase-based statistical machine translation.
In: TLT 2007 - The 6th International Workshop on Treebanks and Linguistic Theories, 7-8 December, 2007, Bergen, Norway.
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. We then show that combining the data extracted from both the treebanks and the corpora into a single translation model can improve the translation quality in a baseline phrase-based statistical machine translation system.