Replicating web structure in small-scale test
collections
Gurrin, CathalORCID: 0000-0003-4395-7702 and Smeaton, Alan F.ORCID: 0000-0003-1028-8389
(2004)
Replicating web structure in small-scale test
collections.
Information Retrieval, 7
(3).
pp. 239-263.
ISSN 1573-7659
Linkage analysis as an aid to web search has been assumed to be of significant benefit and we know that it is being implemented by many major Search Engines. Why then have few TREC participants been able to scientifically prove the benefits of linkage analysis in recent years? In this paper we put forward reasons why many disappointing results have been found in TREC experiments and we identify the linkage density requirements of a dataset to faithfully support experiments into linkage-based retrieval by examining the linkage structure of the WWW. Based on these requirements we report on methodologies for synthesising such a test collection.
Metadata
Item Type:
Article (Published)
Refereed:
Yes
Uncontrolled Keywords:
Linkage analysis; Search engine; Retrieval evaluation; Test collections