Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Patent query reduction using pseudo relevance feedback

Ganguly, Debasis orcid logoORCID: 0000-0003-0050-7138, Leveling, Johannes orcid logoORCID: 0000-0003-0603-4191, Magdy, Walid and Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365 (2011) Patent query reduction using pseudo relevance feedback. In: 20th ACM Conference on Information and Knowledge Management (CIKM 2011), 24-28 Oct 2011, Glasgow, Scotland.

Abstract
Queries in patent prior art search, being full patent applications, are very much longer than standard ad hoc search and web search topics. Standard information retrieval (IR) techniques are not entirely effective for patent prior art search because of the presence of ambiguous terms in these massive queries. Reducing patent queries by extracting small numbers of key terms has been shown to be ineffective mainly because it is not clear what the focus of the query is. An optimal query reduction algorithm must thus seek to retain the useful terms for retrieval favouring recall of relevant patents, but remove terms which impair retrieval effectiveness. We propose a new query reduction technique decomposing a patent application into constituent text segments and computing the Language Modeling (LM) similarities by calculating the probability of generating each segment from the top ranked documents. We reduce a patent query by removing the least similar segments from the query, hypothesizing that removal of segments most dissimilar to the pseudo-relevant documents can increase the precision of retrieval by removing nonuseful context, while still retaining the useful context to achieve high recall as well. Experiments on the patent prior art search collection CLEF-IP 2010, show that the proposed method outperforms standard pseudo relevance feedback (PRF) and a naive method of query reduction based on removal of unit frequency terms (UFTs).
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Query Reduction; Patent Search; Pseudo-Relevance Feedback
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Institutes and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16514
Deposited On:27 Oct 2011 10:01 by Shane Harper . Last Modified 25 Oct 2018 10:26
Documents

Full text available as:

[thumbnail of Patent_Query_Reduction_using_Pseudo_Relevance_Feedback.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
275kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record