Ye, Jiamin and Smeaton, Alan F. ORCID: 0000-0003-1028-8389 (2004) Aggregated Feature Retrieval for MPEG-7 via Clustering. In: SIGIR 2004 - the 27th Annual International ACM SIGIR Conference, 25-29 July 2004, Sheffield, UK.
Abstract
In this paper, we describe an approach to combining text and visual features from MPEG-7 descriptions of video. A video retrieval process is aligned to a text retrieval process based on the TF*IDF vector space model via clustering of low-level visual features. Our assumption is that shots within the same cluster are not only similar visually but also semantically, to a certain extent. Our experiments on the TRECVID2002 and TRECVID2003 collections show that adding extra meaning to a shot based on the shots from the same cluster is useful when each video in a collection contains a high proportion of similar shots, for example in documentaries.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | Video retrieval; MPEG-7; TRECVID; clustering; |
Subjects: | Computer Science > Digital video Computer Science > Information retrieval |
DCU Faculties and Centres: | Research Institutes and Centres > Centre for Digital Video Processing (CDVP) |
Publisher: | Association for Computing Machinery |
Official URL: | http://dx.doi.org/10.1145/1008992.1009097 |
Copyright Information: | © ACM, 2004. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. |
Funders: | Enterprise Ireland, EU IST-2000-32795 |
ID Code: | 374 |
Deposited On: | 28 Mar 2008 by DORAS Administrator . Last Modified 08 Nov 2018 11:09 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
43kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record