Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Community post-editing of machine-translated user-generated content

Mitchell, Linda (2015) Community post-editing of machine-translated user-generated content. PhD thesis, Dublin City University.

Abstract
With the constant growth of user-generated content (UGC) online, the demand for quick translations of large volumes of texts increases. This is often met with a combination of machine translation (MT) and post-editing (PE). Despite extensive research in post-editing with professional translators or translation students, there are few PE studies with lay post-editors, such as domain experts. This thesis explores lay post-editing as a feasible solution for UGC in a technology support forum, machine translated from English into German. This context of lay post-editing in an online community prompts for a redefinition of quality. We adopt a mixed-methods approach, investigating PE quality quantitatively with an error annotation, a domain specialist evaluation and an end-user evaluation. We further explore post-editing behaviour, i.e. specific edits performed, from a qualitative perspective. With the involvement of community members, the need for a PE competence model becomes even more pressing. We investigate whether Gopferich’s translation competence (TC) model (2009) may serve as a basis for lay post-editing. Our quantitative data proves with statistical significance that lay post-editing is a feasible concept, producing variable output, however. On a qualitative level, post-editing is successful for short segments requiring ~35% post-editing effort. No post-editing patterns were detected for segments requiring more PE effort. Lastly, our data suggests that PE quality is largely independent of the profile characteristics measured. This thesis constitutes an important advance in lay post-editing and benchmarking the evaluation of its output, uncovering difficulties in pinpointing reasons for variance in the resulting quality.
Metadata
Item Type:Thesis (PhD)
Date of Award:November 2015
Refereed:No
Supervisor(s):O'Brien, Sharon, Roturier, Johann and Hollowood, Fred
Uncontrolled Keywords:Post-editing; Community
Subjects:Computer Science > Machine translating
Humanities > Translating and interpreting
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Humanities and Social Science > School of Applied Language and Intercultural Studies
Research Institutes and Centres > Centre for Translation and Textual Studies (CTTS)
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:Symantec
ID Code:20463
Deposited On:23 Nov 2015 14:11 by Sharon O'brien . Last Modified 19 Jul 2018 15:05
Documents

Full text available as:

[thumbnail of PhDThesis_LindaMitchell_Final.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
3MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record