Multimedia retrieval in MultiMatch: The impact of speech transcript errors on search behaviour
Carmichael, James, Clough, Paul, Newman, EamonnORCID: 0000-0002-0310-0539 and Jones, Gareth J.F.ORCID: 0000-0003-2923-8365
(2008)
Multimedia retrieval in MultiMatch: The impact of speech transcript errors on search behaviour.
In: The Workshop on Information Access to Cultural Heritage at the 12th European Conference on Research and Advanced Technologies for Digital Libraries, September 2008, Aarhus, Denmark,.
This study discusses the findings of an evaluation study on the performance of a multimedia multimodal information access sub-system (MIAS), incorporating automatic speech recognition technology (ASR) to automatically transcribe the speech content of video soundtracks. The study’s results indicate that an information-rich but minimalist graphical interface is preferred. It was also discovered that users tend to have a misplaced confidence in the accuracy of ASR-generated speech transcripts, thus they are not inclined to conduct a systematic auditory inspection (their usual search behaviour) of a video’s soundtrack if the query term does not appear in the transcript. In order to alert the user to the possibility that a search term may be incorrectly recognised as some other word, a matching algorithm is proposed that searches for word sequences of similar phonemic structure to the query term.
Metadata
Item Type:
Conference or Workshop Item (Paper)
Event Type:
Workshop
Refereed:
Yes
Uncontrolled Keywords:
automatic speech recognition; multimodal search; user evaluation