Du, Jinhua
ORCID: 0000-0002-3267-4881, Jiang, Jie and Way, Andy
ORCID: 0000-0001-5736-5930
(2010)
Facilitating translation using source language paraphrase lattices.
In: the Conference on Empircal Methods in Natural Language Processing (EMNLP 2010), 9-11 Oct 2010, Cambridge, Massachusetts.
Abstract
For resource-limited language pairs, coverage of the test set by the parallel corpus is an important factor that affects translation quality in two respects: 1) out of vocabulary words; 2) the same information in an input
sentence can be expressed in different ways, while current phrase-based SMT systems cannot automatically select an alternative way to transfer the same information. Therefore,
given limited data, in order to facilitate translation
from the input side, this paper proposes a novel method to reduce the translation difficulty using source-side lattice-based paraphrases. We utilise the original phrases from the input sentence and the corresponding paraphrases to build a lattice with estimated weights for each edge to improve translation quality. Compared to the baseline system, our method achieves relative improvements of 7.07%, 6.78% and 3.63% in terms of BLEU score on small, medium and largescale
English-to-Chinese translation tasks respectively. The results show that the proposed method is effective not only for resourcelimited language pairs, but also for resource sufficient pairs to some extent.
Metadata
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Event Type: | Conference |
| Refereed: | Yes |
| Uncontrolled Keywords: | parallel corpus; language pairs |
| Subjects: | Computer Science > Machine translating |
| DCU Faculties and Centres: | Research Institutes and Centres > Centre for Next Generation Localisation (CNGL) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
| Published in: | Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. . Association for Computational Linguistics. |
| Publisher: | Association for Computational Linguistics |
| Official URL: | http://aclweb.org/anthology-new/D/D10/D10-1041.pdf |
| Copyright Information: | © 2010 ACL. |
| Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
| ID Code: | 16010 |
| Deposited On: | 20 Jul 2011 13:30 by Shane Harper . Last Modified 09 Nov 2018 14:18 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
715kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record