Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

A critical examination of document-level machine translation systems

Nayak, Prashanth orcid logoORCID: 0000-0003-1962-9135 (2024) A critical examination of document-level machine translation systems. PhD thesis, Dublin City University.

Abstract
The need for accurate and effective translation cannot be overstated in an increasingly globalised world where communication is paramount. Bridging language barriers is important for promoting understanding and cooperation among diverse individuals and communities, making translation an indispensable tool. Over the past two decades, Machine Translation (MT) has undergone remarkable advancements, with significant progress attributed to the emergence of Neural Machine Translation (NMT), primarily the groundbreaking Transformer models. This rapid development in MT, which started with a focus on sentence-level translation, has not only bridged communication gaps but also brought MT systems close to delivering human-like performance on various translation tasks. While these advances are significant, focusing mainly on sentence-level modeling and evaluation, they miss the valuable contextual information around each sentence. Contextual information in document-level translation helps resolve language ambiguities and ensures consistency and coherence in the translated text, making the translation more accurate and readable. While considerable efforts have been made to incorporate context into NMT systems, the community has not reached a consensus on the most effective methods and the types of context to be integrated. In this thesis, my primary focus is on understanding document-level systems. Specifically, I explore how these systems incorporate context into their translation processes and investigate the span of context utilised by these systems. I also investigate the terminology translation mechanisms within these systems. Furthermore, with the emergence of modern-day powerful Large Language Models (LLMs), I examine their capabilities in terminology translation and propose new methodologies to improve terminology translation for these powerful models.
Metadata
Item Type:Thesis (PhD)
Date of Award:March 2024
Refereed:No
Supervisor(s):Way, Andy, Haque, Rejwanul and Kelleher, John
Uncontrolled Keywords:Terminology
Subjects:Computer Science > Artificial intelligence
Computer Science > Computational linguistics
Computer Science > Information technology
Computer Science > Machine learning
Computer Science > Machine translating
Humanities > Language
Humanities > Linguistics
Humanities > Translating and interpreting
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Institutes and Centres > ADAPT
Research Institutes and Centres > d-real
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License
Funders:Science Foundation Ireland, Microsoft Research
ID Code:29345
Deposited On:22 Mar 2024 13:42 by Andrew Way . Last Modified 22 Mar 2024 13:42
Documents

Full text available as:

[thumbnail of 19213962_final_thesis_upload.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial 4.0
1MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record