Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Exploring structured documents and query formulation techniques for patent retrieval

Magdy, Walid, Leveling, Johannes orcid logoORCID: 0000-0003-0603-4191 and Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365 (2010) Exploring structured documents and query formulation techniques for patent retrieval. In: CLEF 2009, 30 Sept - 2 Oct 2009, Corfu, Greece.

Abstract
This paper presents the experiments and results of DCU in CLEF-IP 2009. Our work applied standard information retrieval (IR) techniques to patent search. Different experiments tested various methods for the patent retrieval, including query formulation, structured index, weighted fields, document filtering, and blind relevance feedback. Some methods did not show expected good retrieval effectiveness such as blind relevance feedback, other experiments showed acceptable performance. Query formulation was the key to achieving better retrieval effectiveness, and this was performed through assigning higher weights to certain document fields. Further experiments showed that for longer queries, better results are achieved but at the expense of additional computations. For the best runs, the retrieval effectiveness is still lower than for IR applications for other domains, illustrating the difficulty of patent search. The official results have shown that among fifteen participants we achieved the seventh and the fourth ranks from the mean average precision (MAP) and recall point of view, respectively.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:query formulation; retrieval effectiveness
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Institutes and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in: Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments. Lecture Notes in Computer Science 6241. Springer-Verlag.
Publisher:Springer-Verlag
Official URL:http://www.springerlink.com/content/w1x346822834r4...
Copyright Information:© 2010 Springer-Verlag. The original publication is available at www.springerlink.com
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16423
Deposited On:22 Jul 2011 08:59 by Shane Harper . Last Modified 25 Oct 2018 10:51
Documents

Full text available as:

[thumbnail of Exploring_Structured_Documents_and_Query_Formulation_Techniques_for_Patent_Retrieval.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
198kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record