Topic based classification and pattern identification in patents

Subhashini Venugopalan,Varun Rai

doi:10.1016/j.techfore.2014.10.006

Abstract

Patent classification systems and citation networks are used extensively in innovation studies. However, non-unique mapping of classification codes onto specific products/markets and the difficulties in accurately capturing knowledge flows based just on citation linkages present limitations to these conventional patent analysis approaches. We present a natural language processing based hierarchical technique that enables the automatic identification and classification of patent datasets into technology areas and sub-areas. The key novelty of our technique is to use topic modeling to map patents to probability distributions over real world categories/topics. Accuracy and usefulness of our technique are tested on a dataset of 10,201 patents in solar photovoltaics filed in the United States Patent and Trademark Office (USPTO) between 2002 and 2013. We show that linguistic features from topic models can be used to effectively identify the main technology area that a patent's invention applies to. Our computational experiments support the view that the topic distribution of a patent offers a reduced-form representation of the knowledge content in a patent. Accordingly, we suggest that this hidden thematic structure in patents can be useful in studies of the policy–innovation–geography nexus. To that end, we also demonstrate an application of our technique for identifying patterns in technological convergence.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Topic based classification and pattern identification in patents

Abstract

Talk to us

Similar Papers

More From: Technological Forecasting and Social Change

Lead the way for us

Journal: Technological Forecasting and Social Change	Publication Date: Nov 7, 2014
Citations: 100

Similar Papers

Prioritization: Addressing the Patent Application Backlog at the United States Patent and Trademark Office

-

18 Feb 2014
18 Feb 2014

Patent citation network in nanotechnology (1976–2004)
Xin Li ... Hsinchun Chen
Journal of Nanoparticle Research | VOL. 9
Xin Li, et. al.Xin Li ... Hsinchun Chen
04 Jan 2007
Patent citation network in nanotechnology (1976–2004)
Xin Li ... Hsinchun Chen

Patent Applications and the Performance of the U.S. Patent and Trademark Office
Christopher Anthony Cotropia ... Ogden H Webster
SSRN Electronic Journal | VOL. 23
Christopher Anthony Cotropia, et. al.Christopher Anthony Cotropia ... Ogden H Webster
03 Mar 2013
SSRN Electronic Journal | VOL. 23

Searching bioremediation patents through Cooperative Patent Classification (CPC).
Rajendra Prasad
Reviews on environmental health | VOL. 31
Rajendra PrasadRajendra Prasad
19 Jan 2016
Reviews on environmental health | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Topic based classification and pattern identification in patents

Abstract

Talk to us

Similar Papers

More From: Technological Forecasting and Social Change