Pecina, Pavel, Toral, Antonio ORCID: 0000-0003-2357-2960, Way, Andy ORCID: 0000-0001-5736-5930, Papavassiliou, Vassilis, Prokopidis, Prokopis and Giagkou, Maria (2011) Towards using web-crawled data for domain adaptation in statistical machine translation. In: The 15th conference of the European Association for Machine Translation (EAMT 2011), 30-31 May 2011, Leuven, Belgium, .
Abstract
This paper reports on the ongoing work focused on domain adaptation of statistical machine translation using domain-specific data obtained by domain-focused web crawling. We present a strategy for crawling monolingual and parallel data and their exploitation for testing, language modelling, and system tuning in a phrase--based machine translation framework. The proposed approach is evaluated on the domains of Natural Environment and Labour Legislation and two language
pairs: English–French and English–Greek.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 16468 |
Deposited On: | 05 Aug 2011 12:49 by Shane Harper . Last Modified 08 Feb 2023 13:51 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
371kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record