United we fall, divided we stand

Debasis Ganguly,Johannes Leveling,Gareth J.F Jones

doi:10.1145/2064975.2064981

Abstract

Previous research in patent search has shown that reducing queries by extracting a few key terms is ineffective primarily because of the vocabulary mismatch between patent applications used as queries and existing patent documents. This finding has led to the use of full patent applications as queries in patent prior art search. In addition, standard information retrieval (IR) techniques such as query expansion (QE) do not work effectively with patent queries, principally because of the presence of noise terms in the massive queries. In this study, we take a new approach to QE for patent search. Text segmentation is used to decompose a patent query into self coherent sub-topic blocks. Each of these much shorted sub-topic blocks which is representative of a specific aspect or facet of the invention, is then used as a query to retrieve documents. Documents retrieved using the different resulting sub-queries or query streams are interleaved to construct a final ranked list. This technique can exploit the potential benefit of QE since the segmented queries are generally more focused and less ambiguous than the full patent query. Experiments on the CLEF-2010 IP prior-art search task show that the proposed method outperforms the retrieval effectiveness achieved when using a single full patent application text as the query, and also demonstrates the potential benefits of QE to alleviate the vocabulary mismatch problem in patent search.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

United we fall, divided we stand

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Patent query reduction using pseudo relevance feedback
Debasis Ganguly ... Gareth J.F Jones
-
Debasis Ganguly, et. al.Debasis Ganguly ... Gareth J.F Jones
24 Oct 2011
24 Oct 2011

Simple vs. Sophisticated Approaches for Patent Prior-Art Search
Walid Magdy ... Gareth J F Jones
-
Walid Magdy, et. al.Walid Magdy ... Gareth J F Jones
01 Jan 2010
01 Jan 2010

Patent Query Formulation by Synthesizing Multiple Sources of Relevance Evidence
Parvaz Mahdabi ... Fabio Crestani
ACM Transactions on Information Systems | VOL. 32
Parvaz Mahdabi, et. al.Parvaz Mahdabi ... Fabio Crestani
28 Oct 2014
ACM Transactions on Information Systems | VOL. 32

Query Generation Techniques for Patent Prior-Art Search in Multiple Languages
Dong Zhou ... Jianxun Liu
-
Dong Zhou, et. al.Dong Zhou ... Jianxun Liu
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

United we fall, divided we stand

Abstract

Talk to us

Similar Papers