Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

MaTrEx: the DCU machine translation system for ICON 2008

Srivastava, Ankit Kumar and Haque, Rejwanul and Naskar, Sudip Kumar and Way, Andy (2008) MaTrEx: the DCU machine translation system for ICON 2008. In: NLP Tools Contest: Statistical Machine Translation (English to Hindi), 6th International Conference on Natural Language Processing, 19 December 2008, Pune, India.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


In this paper, we give a description of the machine translation system developed at DCU that was used for our participation in the NLP Tools Contest of the International Conference on Natural Language Processing (ICON 2008). This was our first ever attempt at working on any Indian language. In this participation, we focus on various techniques for word and phrase alignment to improve system quality. For the English-Hindi translation task we exploit source-language reordering. We also carried out experiments combining both in-domain and out-of-domain data to improve the system performance and, as a post-processing step we transliterate out-of-vocabulary items.

Item Type:Conference or Workshop Item (Paper)
Event Type:Other
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
Research Initiatives and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland, SFI 07/CE/I1142
ID Code:15200
Deposited On:16 Feb 2010 16:45 by DORAS Administrator. Last Modified 16 Feb 2017 10:56

Download statistics

Archive Staff Only: edit this record