Drivers of the decrease of patent similarities from 1976 to 2021.

Edoardo Filippi-Mazzola,Federica Bianchi,Ernst C Wit,Sérgio Duarte Correia

doi:10.1371/journal.pone.0283247

Edoardo Filippi-Mazzola, Federica Bianchi + Show 2 more

Open Access

https://doi.org/10.1371/journal.pone.0283247

Copy DOI

Journal: PloS one	Publication Date: Mar 20, 2023
Citations: 1	License type: CC BY 4.0

Affiliation: Università della Svizzera italiana

Abstract

The citation network of patents citing prior art arises from the legal obligation of patent applicants to properly disclose their invention. One way to study the relationship between current patents and their antecedents is by analyzing the similarity between the textual elements of patents. Many patent similarity indicators have shown a constant decrease since the mid-70s. Although several explanations have been proposed, more comprehensive analyses of this phenomenon have been rare. In this paper, we use a computationally efficient measure of patent similarity scores that leverages state-of-the-art Natural Language Processing tools, to investigate potential drivers of this apparent similarity decrease. This is achieved by modeling patent similarity scores by means of generalized additive models. We found that non-linear modeling specifications are able to distinguish between distinct, temporally varying drivers of the patent similarity levels that explain more variation in the data (R2 ∼ 18%) compared to previous methods. Moreover, the model reveals an underlying trend in similarity scores that is fundamentally different from the one presented previously.

Full Text