Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment

Larson, Martha; Newman, Eamonn; Jones, Gareth J.F.

Larson, Martha, Newman, Eamonn ORCID: 0000-0002-0310-0539 and Jones, Gareth J.F. ORCID: 0000-0003-2923-8365 (2009) Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment. In: CLEF 2009: Workshop on Cross-Language Information Retrieval and Evaluation,, 2009, Corfu, Greece.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

VideoCLEF 2009 offered three tasks related to enriching video content for improved multimedia access in a multilingual environment. For each task, video data (Dutch-language television, predominantly documentaries) accompanied by speech recognition transcripts were provided. The Subject Classification Task involved automatic tagging of videos with subject theme labels. The best performance was achieved by approaching subject tagging as an information retrieval task and using both speech recognition transcripts and archival metadata. Alternatively, classifiers were trained using either the training data provided or data collected from Wikipedia or via general Web search. The Affect Task involved detecting narrative peaks, defined as points where viewers perceive heightened dramatic tension. The task was carried out on the “Beeldenstorm” collection containing 45 short-form documentaries on the visual arts. The best runs exploited affective vocabulary and audience directed speech. Other approaches included using topic changes, elevated speaking pitch, increased speaking intensity and radical visual changes. The Linking Task, also called “Finding Related Resources Across Languages,” involved linking video to material on the same subject in a different language. Participants were provided with a list of multimedia anchors (short video segments) in the Dutch-language “Beeldenstorm” collection and were expected to return target pages drawn from English-language Wikipedia. The best performing methods used the transcript of the speech spoken during the multimedia anchor to build a query to search an index of the Dutch language Wikipedia. The Dutch Wikipedia pages returned were used to identify related English pages. Participants also experimented with pseudo-relevance feedback, query translation and methods that targeted proper names.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Workshop
Refereed:	Yes
Uncontrolled Keywords:	video retrieval; classification; affect; multimedia linking; semantic theme classification; speech recognition; Narrative peaks; documentaries
Subjects:	Computer Science > Information retrieval
DCU Faculties and Centres:	Research Institutes and Centres > Centre for Digital Video Processing (CDVP)
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:	16183
Deposited On:	09 Jun 2011 09:54 by Shane Harper . Last Modified 25 Oct 2018 11:14

Documents

Full text available as:

[thumbnail of Overview_of_VideoCLEF_2009_New_Perspectives_on_Speech-based_Multimedia_Content_Enrichment.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
164kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment

Downloads