Tinsley, John, Hearne, Mary and Way, Andy ORCID: 0000-0001-5736-5930 (2007) Exploiting parallel treebanks to improve phrase-based statistical machine translation. In: TLT 2007 - The 6th International Workshop on Treebanks and Linguistic Theories, 7-8 December, 2007, Bergen, Norway.
Abstract
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. We then show that combining the data extracted from both the treebanks and the corpora into a single translation model can improve the translation quality in a baseline phrase-based statistical machine translation system.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Workshop |
Refereed: | Yes |
Uncontrolled Keywords: | parallel treebanks; statistical machine translation; |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | Research Institutes and Centres > National Centre for Language Technology (NCLT) |
Official URL: | http://tlt07.uib.no/index.php?page=main |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland, SFI 05/RF/CMS064 |
ID Code: | 15266 |
Deposited On: | 09 Mar 2010 16:48 by DORAS Administrator . Last Modified 16 Nov 2018 10:39 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
124kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record