Hybrid example-based SMT: the best of both worlds?

Groves, Declan; Way, Andy

Groves, Declan and Way, Andy ORCID: 0000-0001-5736-5930 (2005) Hybrid example-based SMT: the best of both worlds? In: ACL 2005 Workshop on Building and Using Parallel Texts, 29-30 June 2005, Ann Arbor, MI, USA.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

(Way and Gough, 2005) provide an indepth comparison of their Example-Based Machine Translation (EBMT) system with a Statistical Machine Translation (SMT) system constructed from freely available tools. According to a wide variety of automatic evaluation metrics, they demonstrated that their EBMT system outperformed the SMT system by a factor of two to one. Nevertheless, they did not test their EBMT system against a phrase-based SMT system. Obtaining their training and test data for English–French, we carry out a number of experiments using the Pharaoh SMT Decoder. While better results are seen when Pharaoh is seeded with Giza++ word- and phrase-based data compared to EBMT sub-sentential alignments, in general better results are obtained when combinations of this 'hybrid' data is used to construct the translation and probability models. While for the most part the EBMT system of (Gough & Way, 2004b) outperforms any flavour of the phrasebased SMT systems constructed in our experiments, combining the data sets automatically induced by both Giza++ and their EBMT system leads to a hybrid system which improves on the EBMT system per se for French–English.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Workshop
Refereed:	Yes
Uncontrolled Keywords:	example-based machine translation; statistical machine translation;
Subjects:	Computer Science > Machine translating
DCU Faculties and Centres:	Research Institutes and Centres > National Centre for Language Technology (NCLT) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Publisher:	Association for Computational Linguistics
Official URL:	http://www.aclweb.org/anthology-new/W/W05/
Copyright Information:	© Association for Computational Linguistics, 2005
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:	Irish Research Council for Science Engineering and Technology
ID Code:	15292
Deposited On:	12 Mar 2010 11:35 by DORAS Administrator . Last Modified 16 Nov 2018 11:56

Documents

Full text available as:

[thumbnail of GrovesWay_aclwkshop_05.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
49kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Hybrid example-based SMT: the best of both worlds?

Downloads