Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Using term clouds to represent segment-level semantic content of podcasts

Fuller, Marguerite, Tsagkias, Manos, Newman, Eamonn orcid logoORCID: 0000-0002-0310-0539, Besser, Jana, Larson, Martha, Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365 and de Rijke, Maarten (2008) Using term clouds to represent segment-level semantic content of podcasts. In: the Workshop on Searching Spontaneous Conversational Speech at Thirty-First Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), 24 July 2008, Singapore.

Abstract
Spoken audio, like any time-continuous medium, is notoriously difficult to browse or skim without support of an interface providing semantically annotated jump points to signal the user where to listen in. Creation of time-aligned metadata by human annotators is prohibitively expensive, motivating the investigation of representations of segment-level semantic content based on transcripts generated by automatic speech recognition (ASR). This paper examines the feasibility of using term clouds to provide users with a structured representation of the semantic content of podcast episodes. Podcast episodes are visualized as a series of sub-episode segments, each represented by a term cloud derived from a transcript generated by automatic speech recognition (ASR). Quality of segment-level term clouds is measured quantitatively and their utility is investigated using a small-scale user study based on human labeled segment boundaries. Since the segment-level clouds generated from ASR-transcripts prove useful, we examine an adaptation of text tiling techniques to speech in order to be able to generate segments as part of a completely automated indexing and structuring system for browsing of spoken audio. Results demonstrate that the segments generated are comparable with human selected segment boundaries.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Uncontrolled Keywords:Speech browsing; term clouds; TextTiling
Subjects:Computer Science > Information technology
DCU Faculties and Centres:Research Institutes and Centres > Centre for Digital Video Processing (CDVP)
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16190
Deposited On:05 Aug 2011 12:52 by Shane Harper . Last Modified 25 Oct 2018 11:57
Documents

Full text available as:

[thumbnail of Using_Term_Clouds_to_Represent_Segment-Level_Semantic_content_of_podcasts.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
328kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record