Liu, Chao-Hong ORCID: 0000-0002-1235-6026, Cruz Silva, Catarina, Wang, Longyue ORCID: 0000-0002-9062-6183 and Way, Andy ORCID: 0000-0001-5736-5930 (2019) Pivot machine translation using Chinese as pivot language. In: 14th China Workshop, CWMT 2018, 25-26 Oct 2018, Wuyishan, China.
Abstract
Pivoting through a popular language with more parallel corpora available (e.g. English and Chinese) is a common approach to build machine translation
(MT) systems for low-resource languages. For example, to build a Russian-to Spanish MT system, we could build one system using the Russian–Spanish corpus
directly. We could also build two systems, Russian-to-English and English-to Spanish, as the resources of the two language pairs are much larger than the
Russian–Spanish pair, and use them cascadingly to translate texts in Russian
into Spanish by pivoting through English. There are, however, some confusing
results on the Pivot MT approach in the literature. In this paper, we reviewed the
performance of Pivot MT with the United Nations Parallel Corpus v1.0 (UN6Way)
using both English and Chinese as pivot languages. We also report our system
performance on the CWMT 2018 Pivot MT shared task, where Japanese patent
sentences are translated into English using Chinese as the pivot language.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Workshop |
Refereed: | Yes |
Uncontrolled Keywords: | Pivot MT; Pivot language; Patent MT |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | Research Institutes and Centres > ADAPT |
Published in: | Machine Translation. Communications in Computer and Information Science 954. Springer. |
Publisher: | Springer |
Official URL: | https://doi.org/10.1007/978-981-13-3083-4_7 |
Copyright Information: | © 2018 Springer |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | ADAPT Centre for Digital Content Technology is funded under the SFI Research Centres Programme (Grant No. 13/RC/2106) and is co-funded under the European Regional Development Fund., European Union’s Horizon 2020 Research and Innovation programme under the Marie Skłodowska-Curie Actions (Grant No. 734211; the EU INTERACT project). |
ID Code: | 23196 |
Deposited On: | 17 Apr 2019 14:01 by Thomas Murtagh . Last Modified 17 Apr 2019 14:01 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
102kB |
Metrics
Altmetric Badge
Dimensions Badge
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record