Jiang, Jie, Du, Jinhua ORCID: 0000-0002-3267-4881 and Way, Andy ORCID: 0000-0001-5736-5930 (2010) Source-side syntactic reordering patterns with functional words for improved phrase-based SMT. In: SSST 2010 - 4th Workshop on Syntax and Structure in Statistical Translation, 28 August 2010, Beijing, China.
Abstract
Inspired by previous source-side syntactic reordering methods for SMT, this paper focuses on using automatically learned syntactic reordering patterns with functional words which indicate structural reorderings between the source and target language. This approach takes advantage of phrase alignments and source-side parse trees for pattern extraction, and then filters out those patterns without functional words. Word lattices transformed by the generated patterns are fed into PBSMT systems to incorporate potential reorderings from the inputs. Experiments are carried out on a medium-sized corpus for a Chinese–English SMT task. The proposed method outperforms the baseline system by 1.38% relative on a randomly selected testset and 10.45% relative on the NIST 2008 testset in terms of BLEU score. Furthermore, a system with just 61.88% of the patterns filtered by functional words obtains a comparable performance with the unfiltered one on the randomly selected testset, and achieves 1.74% relative improvements on the NIST 2008 testset.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Workshop |
Refereed: | Yes |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | Research Institutes and Centres > Centre for Next Generation Localisation (CNGL) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Published in: | Proceedings of the 4th Workshop on Syntax and Structure in Statistical Translation. . Association for Computational Linguistics. |
Publisher: | Association for Computational Linguistics |
Official URL: | http://www.aclweb.org/anthology/W/W10/W10-3803.pdf |
Copyright Information: | © 2010 Association for Computational Linguistics |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland |
ID Code: | 15808 |
Deposited On: | 10 Nov 2010 16:14 by Shane Harper . Last Modified 09 Nov 2018 15:55 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
135kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record