Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Hybrid rule-based - example-based MT: feeding apertium with sub-sentential translation units

Sánchez-Martínez, Felipe, Forcada, Mikel and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2009) Hybrid rule-based - example-based MT: feeding apertium with sub-sentential translation units. In: EBMT 2009 - 3rd Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin, Ireland.

Abstract
This paper describes a hybrid machine translation (MT) approach that consists of integrating bilingual chunks (sub-sentential translation units) obtained from parallel corpora into an MT system built using the Apertium free/open-source rule-based machine translation platform, which uses a shallow-transfer translation approach. In the integration of bilingual chunks, special care has been taken so as not to break the application of the existing Apertium structural transfer rules, since this would increase the number of ungrammatical translations. The method consists of (i) the application of a dynamic-programming algorithm to compute the best translation coverage of the input sentence given the collection of bilingual chunks available; (ii) the translation of the input sentence as usual by Apertium; and (iii) the application of a language model to choose one of the possible translations for each of the bilingual chunks detected. Results are reported for the translation from English-to-Spanish, and vice versa, when marker-based bilingual chunks automatically obtained from parallel corpora are used.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Institutes and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland, SFI 07/W.1/I1802, SFI 05/IN/1732, SFI 06/RF/CMS064
ID Code:15153
Deposited On:12 Feb 2010 16:08 by DORAS Administrator . Last Modified 14 Nov 2018 16:27
Documents

Full text available as:

[thumbnail of Sanchez-MartinezEtAl_ebmt3_09.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
154kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record