Evaluating generative patent language models

Jieh-Sheng Lee

doi:10.1016/j.wpi.2023.102173

Jieh-Sheng Lee

Open Access

https://doi.org/10.1016/j.wpi.2023.102173

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Generative language models are promising for assisting human writing in various domains. This manuscript aims to build generative language models in the patent domain and evaluate model performance from a human-centric perspective. The perspective is to measure the ratio of keystrokes that can be saved by autocompletion based on generative patent language models. A higher ratio means a more effective model which can save more keystrokes. This metric can be used to benchmark model performance. The metric is keystroke-based and different from conventional machine-centric metrics that are token-based. In terms of model size, the largest model built in this manuscript is PatentGPT-J-6B, which is state-of-the-art in the patent domain. Based on the metric, it is found that the largest model is not necessarily the best for the human-centric metric. The finding means that keeping increasing model sizes in the patent domain might be unnecessary if the purpose is to assist human writing with autocompletion. Several patent language models are pre-trained from scratch in this research. The pre-trained models are released for future researchers. Several visualization tools are also provided. The importance of building a generative language model in the patent domain is its potential to facilitate creativity and innovations in the future.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: World Patent Information	Publication Date: Jan 30, 2023
Citations: 3	License type: cc-by-nc-nd

R Discovery Prime

Evaluating generative patent language models

Abstract

Published Version

Talk to us

Similar Papers

More From: World Patent Information

Lead the way for us

Similar Papers

Constructing Chinese taxonomy trees from understanding and generative pretrained language models.
Jianyu Guo ... Haitao Jia
PeerJ. Computer science | VOL. 10
Jianyu Guo, et. al.Jianyu Guo ... Haitao Jia
01 Jan 2024
PeerJ. Computer science | VOL. 10

Investigating strategies for lexical complexity prediction in a multilingual setting using generative language models and supervised approaches
Abdelhak Kelious ... Christophe Coeur
-
Abdelhak Kelious, et. al.Abdelhak Kelious ... Christophe Coeur
15 Oct 2024
15 Oct 2024

Language Models for Topic Tracking
Wessel Kraaij ... Martijn Spitters
-
Wessel Kraaij, et. al.Wessel Kraaij ... Martijn Spitters
01 Jan 2003
01 Jan 2003

Aspects of creating a corporate question-and-answer system using generative pre-trained language models
Aleksei Golikov ... Sergei Trashchenkov
Litera | VOL. -
Aleksei Golikov, et. al.Aleksei Golikov ... Sergei Trashchenkov
01 Dec 2023
Litera | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Evaluating generative patent language models

Abstract

Published Version

Talk to us

Similar Papers

More From: World Patent Information