Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Building neural machine translation systems for multilingual participatory spaces

Lohar, Pintu orcid logoORCID: 0000-0002-5328-1585, Xie, Guodong orcid logoORCID: 0000-0003-0037-8495, Gallagher, Daniel and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2023) Building neural machine translation systems for multilingual participatory spaces. Analytics, 2 (2). pp. 393-409. ISSN 2813-2203

Abstract
This work presents the development of the translation component in a multistage, multilevel, multimode, multilingual and dynamic deliberative (M4D2) system, built to facilitate automated moderation and translation in the languages of five European countries: Italy, Ireland, Germany, France and Poland. Two main topics were to be addressed in the deliberation process: (i) the environment and climate change; and (ii) the economy and inequality. In this work, we describe the development of neural machine translation (NMT) models for these domains for six European languages: Italian, English (included as the second official language of Ireland), Irish, German, French and Polish. As a result, we generate 30 NMT models, initially baseline systems built using freely available online data, which are then adapted to the domains of interest in the project by (i) filtering the corpora, (ii) tuning the systems with automatically extracted in-domain development datasets and (iii) using corpus concatenation techniques to expand the amount of data available. We compare our results produced by the domain-adapted systems with those produced by Google Translate, and demonstrate that fast, high-quality systems can be produced that facilitate multilingual deliberation in a secure environment.
Metadata
Item Type:Article (Published)
Refereed:Yes
Uncontrolled Keywords:neural machine translation; domain adaptation; parallel data; deliberative democracy; citizens’ assemblies
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Institutes and Centres > ADAPT
Publisher:MDPI
Official URL:https://doi.org/10.3390/analytics2020022
Copyright Information:© 2023 The Authors.
Funders:European Commission under H2020-EU.3.6.—SOCIETAL CHALLENGES—Europe In A Changing World—Inclusive, Innovative And Reflective Societies, grant agreement ID: 959234, Science Foundation Ireland Grant Agreement No. 13/RC/2106_P2 at the ADAPT SFI Research Centre at Dublin City University
ID Code:28311
Deposited On:05 May 2023 16:23 by Pintu Lohar . Last Modified 05 May 2023 16:25
Documents

Full text available as:

[thumbnail of analytics-02-00022.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0
650kB
Metrics

Altmetric Badge

Dimensions Badge

Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record