Consultation checklists: standardising the human evaluation of medical note generation

Savkov, Aleksandar; Moramarco, Francesco; Korfiatis, Alex Papadopoulos; Perera, Mark; Belz, Anya; Reiter, Ehud

Savkov, Aleksandar ORCID: 0009-0009-6831-5563, Moramarco, Francesco, Korfiatis, Alex Papadopoulos, Perera, Mark, Belz, Anya ORCID: 0000-0002-0552-8096 and Reiter, Ehud ORCID: 0000-0002-7548-9504 (2022) Consultation checklists: standardising the human evaluation of medical note generation. In: EMNLP 2022 Industry Track, 9-11 Dec 2022, Abu Dhabi, UAE.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

valuating automatically generated text is generally hard due to the inherently subjective nature of many aspects of the output quality. This difficulty is compounded in automatic consultation note generation by differing opinions between medical experts both about which patient statements should be included in generated notes and about their respective importance in arriving at a diagnosis. Previous real-world evaluations of note-generation systems saw substantial disagreement between expert evaluators. In this paper we propose a protocol that aims to increase objectivity by grounding evaluations in Consultation Checklists, which are created in a preliminary step and then used as a common point of reference during quality assessment. We observed good levels of inter-annotator agreement in a first evaluation study using the protocol; further, using Consultation Checklists produced in the study as reference for automatic metrics such as ROUGE or BERTScore improves their correlation with human judgements compared to using the original human note.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	Yes
Subjects:	Computer Science > Computational linguistics
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > ADAPT
Published in:	Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry (EMNLP 2022)(. . Association for Computational Linguistics (ACL).
Publisher:	Association for Computational Linguistics (ACL)
Official URL:	https://aclanthology.org/2022.emnlp-industry.10
Copyright Information:	© 2022 Association for Computational Linguistics
ID Code:	28655
Deposited On:	04 Jul 2023 15:33 by Anya Belz . Last Modified 04 Jul 2023 15:33

Documents

Full text available as:

[thumbnail of 2022.emnlp-industry.10.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0
1MB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Consultation checklists: standardising the human evaluation of medical note generation

Downloads