How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

Callan, Dominic; Foster, Jennifer; .

Home
Browse By

Author

DCU Faculties and Centres

Theses

Subject

Year

Publication Type

Year of Award

Supervisors
About / FAQ
Statistics
Login (DCU Staff Only)

How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

Callan, Dominic ORCID: 0000-0002-9163-1777, Foster, Jennifer ORCID: 0000-0002-7789-4853 and . (2023) How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text. Expert Systems, 40 (6). ISSN 0266-4720

Abstract
Metadata
Downloads
Documents
Metrics

[+][-]

Abstract

Evaluation of the narrative text generated by machines has traditionally been a challenge, particularly when attempting to evaluate subjective elements such as interest or believability. Recent improvements in narrative machine text generation have been largely driven by the emergence of transformer-based language models, trained on massive quantities of data, resulting in higher quality text generation. In this study, a corpus of stories is generated using the pre-trained GPT-Neo transformer model, with human-written prompts as inputs upon which to base the narrative text. The stories generated through this process are subsequently evaluated through both human evaluation and two automated metrics: BERTScore and BERT Next Sentence Prediction, with the aim of determining whether there is a correlation between the automatic scores and the human judgements. The results show variation in human evaluation results in comparison to modern automated metrics, suggesting further work is required to train automated metrics to identify text that is defined as interesting by humans.

Metadata

Item Type:	Article (Published)
Refereed:	Yes
Additional Information:	Article number: e13292
Subjects:	Computer Science > Computational linguistics Computer Science > Machine learning
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Publisher:	Wiley
Official URL:	https://doi.org/10.1111/exsy.13292
Copyright Information:	© 2023 The Authors.
Funders:	Open access funding provided by IReL
ID Code:	29135
Deposited On:	20 Oct 2023 08:47 by Vidatum Academic . Last Modified 20 Oct 2023 08:47

Documents

Full text available as:

[thumbnail of ExpertSystems-2023-Callan-Howinterestingandcoherentarethestoriesgeneratedbyalargescaleneurallanguage.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial 4.0
2MB

Metrics

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

Altmetric Badge

Dimensions Badge

Downloads