Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Document expansion for image retrieval

Min, Jinming and Leveling, Johannes and Zhou, Dong and Jones, Gareth J.F. (2010) Document expansion for image retrieval. In: RIAO 2010 - 9th RIAO Conference, 28-30 April, 2010, Paris, France.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


Successful information retrieval requires eective matching between the user's search request and the contents of relevant documents. Often the request entered by a user may not use the same topic relevant terms as the authors' of the documents. One potential approach to address problems of query-document term mismatch is document expansion to include additional topically relevant indexing terms in a document which may encourage its retrieval when relevant to queries which do not match its original contents well. We propose and evaluate a new document expansion method using external resources. While results of previous research have been inconclusive in determining the impact of document expansion on retrieval eectiveness, our method is shown to work eectively for text-based image retrieval of short image annotation documents. Our approach uses the Okapi query expansion algorithm as a method for document expansion. We further show improved performance can be achieved by using a \document reduction" approach to include only the signicant terms in a document in the expansion process. Our experiments on the WikipediaMM task at ImageCLEF 2008 show an increase of 16.5% in mean average precision (MAP) compared to a variation of Okapi BM25 retrieval model. To compare document expansion with query expansion, we also test query expansion from an external resource which leads an improvement by 9.84% in MAP over our baseline. Our conclusion is that the document expansion with document reduction and in combination with query expansion produces the overall best retrieval results for shortlength document retrieval. For this image retrieval task, we also concluded that query expansion from external resource does not outperform the document expansion method.

Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Uncontrolled Keywords:document expansion; query expansion; pseudo-relevance feedback; Wikipedia;
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:
Copyright Information:Copyright © 2010 Centre de Hautes Etudes Internationales d'Informatique Documentaire (CID)
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:15892
Deposited On:25 Nov 2010 11:54 by DORAS Administrator. Last Modified 25 Nov 2010 11:54

Download statistics

Archive Staff Only: edit this record