Blip10000: a social video dataset containing SPUG content for tagging and retrieval
Schmiedeke, Sebastian, Xu, Peng, Ferrané, Isabelle, Eskevich, MariaORCID: 0000-0002-1242-0753, Kofler, Christoph, Larson, Martha, Estève, Yannick, Lamel, Lori, Jones, Gareth J.F.ORCID: 0000-0003-2923-8365 and Sikora, Thomas
(2013)
Blip10000: a social video dataset containing SPUG content for tagging and retrieval.
In: ACM Multimedia Systems Conference (MMSys 2013), 27 Feb - 1 Mar 2013, Oslo, Norway.
ISBN 978-1-4503-1894-5/13/02
The increasing amount of digital multimedia content available is inspiring potential new types of user interaction with video data. Users want to easilyfind the content by searching and browsing. For this reason, techniques are needed that allow automatic categorisation, searching the content and linking to related information.
In this work, we present a dataset that contains comprehensive semi-professional user generated (SPUG) content, including audiovisual content, user-contributed metadata, automatic speech recognition transcripts, automatic shot boundary les, and social information for multiple `social levels'. We describe the principal characteristics of this dataset and present results that have been achieved on different tasks.
Item Type:
Conference or Workshop Item (Paper)
Event Type:
Conference
Refereed:
Yes
Uncontrolled Keywords:
Dataset; SPUG Content; Video Tagging; Speech Retrieval