Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Seeding statistical machine translation with translation memory output through tree-based structural alignment

Zhechev, Ventsislav and van Genabith, Josef orcid logoORCID: 0000-0003-1322-7944 (2010) Seeding statistical machine translation with translation memory output through tree-based structural alignment. In: SSST-4 - 4th Workshop on Syntax and Structure in Statistical Translation, 28 August 2010, Beijing, China.

Abstract
With the steadily increasing demand for high-quality translation, the localisation industry is constantly searching for technologies that would increase translator throughput, with the current focus on the use of high-quality Statistical Machine Translation (SMT) as a supplement to the established Translation Memory (TM) technology. In this paper we present a novel modular approach that utilises state-of-the-art sub-tree alignment to pick out pre-translated segments from a TM match and seed with them an SMT system to produce a final translation. We show that the presented system can outperform pure SMT when a good TM match is found. It can also be used in a Computer-Aided Translation (CAT) environment to present almost perfect translations to the human user with markup highlighting the segments of the translation that need to be checked manually for correctness.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Institutes and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in: Proceedings of the 4th Workshop on Syntax and Structure in Statistical Translation. . Association for Computational Linguistics.
Publisher:Association for Computational Linguistics
Official URL:http://www.aclweb.org/anthology/W/W10/
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:European Framework Programme 7
ID Code:15992
Deposited On:08 Dec 2010 14:59 by Shane Harper . Last Modified 21 Jan 2022 16:28
Documents

Full text available as:

[thumbnail of Seeding_Statistical_Machine_Translation_with_Translation_Memory_Output_through_Tree-Based_Structural_Alignment.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
634kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record