Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Using images to improve machine-translating E-commerce product listings

Calixto, Iacer, Stein, Daniel, Matusov, Evgeny, Lohar, Pintu orcid logoORCID: 0000-0002-5328-1585, Castilho, Sheila orcid logoORCID: 0000-0002-8416-6555 and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2017) Using images to improve machine-translating E-commerce product listings. In: 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), 3-7 April 2017, Valencia, Spain. ISBN 978-1-945626-35-7

Abstract
In this paper we study the impact of using images to machine-translate user-generated ecommerce product listings. We study how a multi-modal Neural Machine Translation (NMT) model compares to two text-only approaches: a conventional state-of-the-art attentional NMT and a Statistical Machine Translation (SMT) model. User-generated product listings often do not constitute grammatical or well-formed sentences. More often than not, they consist of the juxtaposition of short phrases or keywords. We train our models end-to-end as well as use text-only and multimodal NMT models for re-ranking n-best lists generated by an SMT model. We qualitatively evaluate our user-generated training data also analyse how adding synthetic data impacts the results. We evaluate our models quantitatively using BLEU and TER and find that (i) additional synthetic data has a general positive impact on text-only and multi-modal NMT models, and that (ii) using a multi-modal NMT model for re-ranking n-best lists improves TER significantly across different nbest list sizes.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Institutes and Centres > ADAPT
Published in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers. 2. Association for Computational Linguistics (ACL). ISBN 978-1-945626-35-7
Publisher:Association for Computational Linguistics (ACL)
Official URL:http://aclweb.org/anthology/E17-2101
Copyright Information:© 2017 ACL
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland Research Centres Programme (Grant 13/RC/2106) co-funded under the European Regional Development Fund.
ID Code:23066
Deposited On:08 Mar 2019 10:44 by Thomas Murtagh . Last Modified 05 May 2023 16:28
Documents

Full text available as:

[thumbnail of Using_Images_to_Improve_Machine_TranslatingE_Commerce_Product_Listings.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
747kB
Metrics

Altmetric Badge

Dimensions Badge

Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record