Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Source-side syntactic reordering patterns with functional words for improved phrase-based SMT

Jiang, Jie and Du, Jinhua and Way, Andy (2010) Source-side syntactic reordering patterns with functional words for improved phrase-based SMT. In: SSST 2010 - 4th Workshop on Syntax and Structure in Statistical Translation, 28 August 2010, Beijing, China.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


Inspired by previous source-side syntactic reordering methods for SMT, this paper focuses on using automatically learned syntactic reordering patterns with functional words which indicate structural reorderings between the source and target language. This approach takes advantage of phrase alignments and source-side parse trees for pattern extraction, and then filters out those patterns without functional words. Word lattices transformed by the generated patterns are fed into PBSMT systems to incorporate potential reorderings from the inputs. Experiments are carried out on a medium-sized corpus for a Chinese–English SMT task. The proposed method outperforms the baseline system by 1.38% relative on a randomly selected testset and 10.45% relative on the NIST 2008 testset in terms of BLEU score. Furthermore, a system with just 61.88% of the patterns filtered by functional words obtains a comparable performance with the unfiltered one on the randomly selected testset, and achieves 1.74% relative improvements on the NIST 2008 testset.

Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in:Proceedings of the 4th Workshop on Syntax and Structure in Statistical Translation. . Association for Computational Linguistics.
Publisher:Association for Computational Linguistics
Official URL:
Copyright Information:© 2010 Association for Computational Linguistics
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland
ID Code:15808
Deposited On:10 Nov 2010 16:14 by Shane Harper. Last Modified 10 Nov 2010 16:14

Download statistics

Archive Staff Only: edit this record