Browse DORAS
Browse Theses
Search
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Facilitating translation using source language paraphrase lattices

Du, Jinhua and Jiang, Jie and Way, Andy (2010) Facilitating translation using source language paraphrase lattices. In: the Conference on Empircal Methods in Natural Language Processing (EMNLP 2010), 9-11 Oct 2010, Cambridge, Massachusetts.

Full text available as:

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
698Kb

Abstract

For resource-limited language pairs, coverage of the test set by the parallel corpus is an important factor that affects translation quality in two respects: 1) out of vocabulary words; 2) the same information in an input sentence can be expressed in different ways, while current phrase-based SMT systems cannot automatically select an alternative way to transfer the same information. Therefore, given limited data, in order to facilitate translation from the input side, this paper proposes a novel method to reduce the translation difficulty using source-side lattice-based paraphrases. We utilise the original phrases from the input sentence and the corresponding paraphrases to build a lattice with estimated weights for each edge to improve translation quality. Compared to the baseline system, our method achieves relative improvements of 7.07%, 6.78% and 3.63% in terms of BLEU score on small, medium and largescale English-to-Chinese translation tasks respectively. The results show that the proposed method is effective not only for resourcelimited language pairs, but also for resource sufficient pairs to some extent.

Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:parallel corpus; language pairs
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in:Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. . Association for Computational Linguistics.
Publisher:Association for Computational Linguistics
Official URL:http://aclweb.org/anthology-new/D/D10/D10-1041.pdf
Copyright Information:© 2010 ACL.
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16010
Deposited On:20 Jul 2011 14:30 by Shane Harper. Last Modified 20 Jul 2011 14:30

Download statistics

Archive Staff Only: edit this record