Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Multimedia retrieval in MultiMatch: The impact of speech transcript errors on search behaviour

Carmichael, James, Clough, Paul, Newman, Eamonn orcid logoORCID: 0000-0002-0310-0539 and Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365 (2008) Multimedia retrieval in MultiMatch: The impact of speech transcript errors on search behaviour. In: The Workshop on Information Access to Cultural Heritage at the 12th European Conference on Research and Advanced Technologies for Digital Libraries, September 2008, Aarhus, Denmark,.

Abstract
This study discusses the findings of an evaluation study on the performance of a multimedia multimodal information access sub-system (MIAS), incorporating automatic speech recognition technology (ASR) to automatically transcribe the speech content of video soundtracks. The study’s results indicate that an information-rich but minimalist graphical interface is preferred. It was also discovered that users tend to have a misplaced confidence in the accuracy of ASR-generated speech transcripts, thus they are not inclined to conduct a systematic auditory inspection (their usual search behaviour) of a video’s soundtrack if the query term does not appear in the transcript. In order to alert the user to the possibility that a search term may be incorrectly recognised as some other word, a matching algorithm is proposed that searches for word sequences of similar phonemic structure to the query term.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Uncontrolled Keywords:automatic speech recognition; multimodal search; user evaluation
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16189
Deposited On:16 Jun 2011 13:42 by Shane Harper . Last Modified 25 Oct 2018 11:57
Documents

Full text available as:

[thumbnail of Multimedia_Retrieval_in_MultiMatch_The_Impact_of_Speech_Transcript_Errors_on_Search_Behaviour.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
39kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record