Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

DCU-TCD@LogCLEF 2010: re-ranking document collections and query performance estimation

Leveling, Johannes and Ghorab, M. Rami and Magdy, Walid and Jones, Gareth J.F. and Wade, Vincent (2010) DCU-TCD@LogCLEF 2010: re-ranking document collections and query performance estimation. In: CLEF 2010 LABs and Workshops, Logfile Analysis (LogCLEF), 22-23 September 2010, Padua, Italy.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


This paper describes the collaborative participation of Dublin City University and Trinity College Dublin in LogCLEF 2010. Two sets of experiments were conducted. First, different aspects of the TEL query logs were analysed after extracting user sessions of consecutive queries on a topic. The relation between the queries and their length (number of terms) and position (first query or further reformulations) was examined in a session with respect to query performance estimators such as query scope, IDF-based measures, simplified query clarity score, and average inverse document collection frequency. Results of this analysis suggest that only some estimator values show a correlation with query length or position in the TEL logs (e.g. similarity score between collection and query). Second, the relation between three attributes was investigated: the user's country (detected from IP address), the query language, and the interface language. The investigation aimed to explore the influence of the three attributes on the user's collection selection. Moreover, the investigation involved assigning different weights to the three attributes in a scoring function that was used to re-rank the collections displayed to the user according to the language and country. The results of the collection re-ranking show a significant improvement in Mean Average Precision (MAP) over the original collection ranking of TEL. The results also indicate that the query language and interface language have more in uence than the user's country on the collections selected by the users.

Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Subjects:Computer Science > Information storage and retrieval systems
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland
ID Code:15835
Deposited On:22 Nov 2010 14:37 by Shane Harper. Last Modified 22 Nov 2010 14:47

Download statistics

Archive Staff Only: edit this record