Min, Jinming, Leveling, Johannes ORCID: 0000-0003-0603-4191, Zhou, Dong and Jones, Gareth J.F. ORCID: 0000-0003-2923-8365 (2010) Document expansion for image retrieval. In: CLEF labs 2010, Intellectual Property (CLEF-IP), 22-23 September 2010, Padua, Italy.
Abstract
Successful information retrieval requires effective matching
between the user's search request and the contents of relevant documents. Often the request entered by a user may
not use the same topic relevant terms as the authors' of the
documents. One potential approach to address problems of query-document term mismatch is document expansion to include additional topically relevant indexing terms in a
document which may encourage its retrieval when relevant to queries which do not match its original contents well. We
propose and evaluate a new document expansion method using external resources. While results of previous research
have been inconclusive in determining the impact of document
expansion on retrieval effectiveness, our method is shown to work effectively for text-based image retrieval of
short image annotation documents. Our approach uses the
Okapi query expansion algorithm as a method for document
expansion. We further show improved performance can be
achieved by using a \document reduction" approach to include
only the significant terms in a document in the expansion
process. Our experiments on the WikipediaMM task at ImageCLEF 2008 show an increase of 16.5% in mean average
precision (MAP) compared to a variation of Okapi BM25 retrieval model. To compare document expansion with query
expansion, we also test query expansion from an external resource which leads an improvement by 9.84% in MAP over
our baseline. Our conclusion is that the document expansion
with document reduction and in combination with query expansion produces the overall best retrieval results for shortlength document retrieval. For this image retrieval task, we also concluded that query expansion from external resource does not outperform the document expansion method.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Subjects: | Computer Science > Information retrieval |
DCU Faculties and Centres: | Research Institutes and Centres > Centre for Next Generation Localisation (CNGL) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Official URL: | http://clef2010.org/index.php?page=pages/proceedin... |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 15833 |
Deposited On: | 22 Nov 2010 14:49 by Shane Harper . Last Modified 25 Oct 2018 10:41 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
296kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record