Browse DORAS
Browse Theses
Search
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Combining textual and visual information processing for interactive video retrieval: SCHEMA's participation in TRECVID 2004

Mezaris, Vasileios and Doulaverakis, Haralambos and Herrmann, Stephan and Lehane, Bart and O'Connor, Noel E. (2004) Combining textual and visual information processing for interactive video retrieval: SCHEMA's participation in TRECVID 2004. In: TRECVID 2004 - Text REtrieval Conference TRECVID Workshop, 15-16 November 2004, Gaithersburg, Maryland.

Full text available as:

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1057Kb

Abstract

In this paper, the two different applications based on the Schema Reference System that were developed by the SCHEMA NoE for participation to the search task of TRECVID 2004 are illustrated. The first application, named ”Schema-Text”, is an interactive retrieval application that employs only textual information while the second one, named ”Schema-XM”, is an extension of the former, employing algorithms and methods for combining textual, visual and higher level information. Two runs for each application were submitted, I A 2 SCHEMA-Text 3, I A 2 SCHEMA-Text 4 for Schema-Text and I A 2 SCHEMA-XM 1, I A 2 SCHEMA-XM 2 for Schema-XM. The comparison of these two applications in terms of retrieval efficiency revealed that the combination of information from different data sources can provide higher efficiency for retrieval systems. Experimental testing additionally revealed that initially performing a text-based query and subsequently proceeding with visual similarity search using one of the returned relevant keyframes as an example image is a good scheme for combining visual and textual information.

Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Uncontrolled Keywords:image segmentation; region-based image and video indexing; MPEG-7 XM; high-level features; multi-modal queries; TRECVID.
Subjects:Computer Science > Digital video
Computer Science > Information retrieval
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Digital Video Processing (CDVP)
Publisher:NIST
Official URL:http://www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.html#2004
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:401
Deposited On:02 Apr 2008 by DORAS Administrator. Last Modified 04 Feb 2009 16:01

Download statistics

Archive Staff Only: edit this record