MaTrEx: the DCU machine translation system for ICON 2008
Srivastava, Ankit Kumar, Haque, RejwanulORCID: 0000-0003-1680-0099, Naskar, Sudip Kumar and Way, AndyORCID: 0000-0001-5736-5930
(2008)
MaTrEx: the DCU machine translation system for ICON 2008.
In: NLP Tools Contest: Statistical Machine Translation (English to Hindi), 6th International Conference on Natural Language Processing, 19 December 2008, Pune, India.
In this paper, we give a description of the machine translation system developed at DCU that was used for our participation in the NLP Tools Contest of the International
Conference on Natural Language Processing (ICON 2008). This was our first ever attempt at working on any Indian language. In this participation, we focus on various techniques for word and phrase alignment to improve system quality. For the English-Hindi translation task we exploit
source-language reordering. We also carried out experiments combining both in-domain and out-of-domain data to improve
the system performance and, as a post-processing step we transliterate out-of-vocabulary items.