Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

On the use of clustering and the MeSH controlled vocabulary to improve MEDLINE abstract search

Blott, Stephen and Camous, Fabrice and Gurrin, Cathal and Jones, Gareth J.F. (2005) On the use of clustering and the MeSH controlled vocabulary to improve MEDLINE abstract search. In: the Second CORIA (Conference en Recherche d'Informations et Applications), March 2005, Grenoble, France.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


Databases of genomic documents contain substantial amounts of structured information in addition to the texts of titles and abstracts. Unstructured information retrieval techniques fail to take advantage of the structured information available. This paper describes a technique to improve upon traditional retrieval methods by clustering the retrieval result set into two distinct clusters using additional structural information. Our hypothesis is that the relevant documents are to be found in the tightest cluster of the two, as suggested by van Rijsbergen's cluster hypothesis. We present an experimental evaluation of these ideas based on the relevance judgments of the 2004 TREC workshop Genomics track, and the CLUTO software clustering package.

Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Uncontrolled Keywords:Genomic information retrieval; clustering; ontology; tree similarity measure
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16205
Deposited On:09 Jun 2011 09:24 by Shane Harper. Last Modified 13 Jan 2017 12:03

Download statistics

Archive Staff Only: edit this record