On the use of clustering and the MeSH controlled vocabulary to improve MEDLINE abstract search
Blott, Stephen and Camous, Fabrice and Gurrin, Cathal and Jones, Gareth J.F. (2005) On the use of clustering and the MeSH controlled vocabulary to improve MEDLINE abstract search. In: the Second CORIA (Conference en Recherche d'Informations et Applications), March 2005, Grenoble, France. Full text available as: AbstractDatabases of genomic documents contain substantial amounts of structured information in addition to the texts of titles and abstracts. Unstructured information retrieval techniques fail to take advantage of the structured information available. This paper describes a technique to
improve upon traditional retrieval methods by clustering the retrieval result set into two distinct clusters using additional structural information. Our hypothesis is that the relevant documents are to be found in the tightest cluster of the two, as suggested by van Rijsbergen's cluster
hypothesis. We present an experimental evaluation of these ideas based on the relevance judgments of the 2004 TREC workshop Genomics track, and the CLUTO software clustering
package. Download statistics

Archive Staff Only: edit this record
|