Combining textual and visual information processing for interactive video retrieval: SCHEMA's participation in TRECVID 2004
Mezaris, Vasileios and Doulaverakis, Haralambos and Herrmann, Stephan and Lehane, Bart and O'Connor, Noel E. (2004) Combining textual and visual information processing for interactive video retrieval: SCHEMA's participation in TRECVID 2004. In: TRECVID 2004 - Text REtrieval Conference TRECVID Workshop, 15-16 November 2004, Gaithersburg, Maryland.
Full text available as:
In this paper, the two different applications based on the Schema Reference System that were developed by the SCHEMA NoE for participation to the search task of TRECVID 2004 are illustrated. The first application, named ”Schema-Text”, is an interactive retrieval application that employs only textual information while the second one, named ”Schema-XM”, is an extension of the former, employing algorithms and
methods for combining textual, visual and higher level information. Two runs for each application were submitted, I A 2 SCHEMA-Text 3, I A 2 SCHEMA-Text 4 for Schema-Text and I A 2 SCHEMA-XM 1, I A 2 SCHEMA-XM 2 for Schema-XM. The comparison of these two applications in terms of retrieval efficiency revealed that the combination of information from different data sources can provide higher efficiency for retrieval systems. Experimental testing additionally revealed that initially performing a text-based query and subsequently proceeding with visual similarity search using one of the returned relevant keyframes as an example image is a good scheme for combining visual and textual information.
Archive Staff Only: edit this record