Browse DORAS
Browse Theses
Search
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Towards using web-crawled data for domain adaptation in statistical machine translation

Pecina, Pavel and Toral, Antonio and Way, Andy and Papavassiliou, Vassilis and Prokopidis, Prokopis and Giagkou, Maria (2011) Towards using web-crawled data for domain adaptation in statistical machine translation. In: The 15th conference of the European Association for Machine Translation (EAMT 2011), 30-31 May 2011, Leuven, Belgium, .

Full text available as:

[img]PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
362Kb

Abstract

This paper reports on the ongoing work focused on domain adaptation of statistical machine translation using domain-specific data obtained by domain-focused web crawling. We present a strategy for crawling monolingual and parallel data and their exploitation for testing, language modelling, and system tuning in a phrase--based machine translation framework. The proposed approach is evaluated on the domains of Natural Environment and Labour Legislation and two language pairs: English–French and English–Greek.

Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16468
Deposited On:05 Aug 2011 13:49 by Shane Harper. Last Modified 05 Aug 2011 13:49

Download statistics

Archive Staff Only: edit this record