A comparison of score, rank and probability-based fusion methods for video shot retrieval
McDonald, Kieran and Smeaton, Alan F. (2005) A comparison of score, rank and probability-based fusion methods for video shot retrieval. In: CIVR 2005 - International Conference on Image and Video Retrieval, 20-22 July 2005, Singapore. ISBN 978-3-540-27858-0
Full text available as:
It is now accepted that the most effective video shot retrieval is based on indexing and retrieving clips using multiple, parallel modalities such as text-matching, image-matching and feature matching and then combining or fusing these parallel retrieval streams in some way. In this paper we investigate a range of fusion methods for combining based on multiple visual features (colour, edge and texture), for combining based on multiple visual examples in the query and for combining multiple modalities (text and visual). Using three TRECVid collections and the TRECVid search task, we specifically compare fusion methods based on normalised score and rank that use either the average, weighted average or maximum of retrieval results from a discrete Jelinek-Mercer smoothed language model. We also compare these results with a simple probability-based combination of the language model results that assumes all features and visual examples are fully independent.
Archive Staff Only: edit this record