A neural classification method for supporting the creation of BioVerbNet

Billy Chiu,Ulla Stenius,Martha Palmer,Olga Majewska,Anna Korhonen,Laura Wey,Sampo Pyysalo

doi:10.1186/s13326-018-0193-x

Abstract

BackgroundVerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We take the first step towards the development of BioVerbNet: A VerbNet specifically aimed at describing verbs in the area of biomedicine. Because VerbNet-style classification is extremely time consuming, we start from a small manual classification of biomedical verbs and apply a state-of-the-art neural representation model, specifically developed for class-based optimization, to expand the classification with new verbs, using all the PubMed abstracts and the full articles in the PubMed Central Open Access subset as data.ResultsDirect evaluation of the resulting classification against BioSimVerb (verb similarity judgement data in biomedicine) shows promising results when representation learning is performed using verb class-based contexts. Human validation by linguists and biologists reveals that the automatically expanded classification is highly accurate. Including novel, valid member verbs and classes, our method can be used to facilitate cost-effective development of BioVerbNet.ConclusionThis work constitutes the first effort on applying a state-of-the-art architecture for neural representation learning to biomedical verb classification. While we discuss future optimization of the method, our promising results suggest that the automatic classification released with this article can be used to readily support application tasks in biomedicine.

Highlights

VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs
We discuss further optimization of the method for real-life computational lexicography, but our promising results suggest that the automatic classification released with this article can be used to readily support Natural Language Processing (NLP) application tasks in biomedicine
The baseline we used is a skip-gram model with negative sampling (SGNS) model trained with all dependency contexts in the corpus (DEP-ALL), a SGNS model trained only with the seven verb-related contexts (POOL-ALL) we identified in “Configuration search” section and a standard SGNS trained with bag-of-words contexts (BOW) using the word2vec tool [46]

Summary

Introduction

VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We take the first step towards the development of BioVerbNet: A VerbNet aimed at describing verbs in the area of biomedicine. Because VerbNet-style classification is extremely time consuming, we start from a small manual classification of biomedical verbs and apply a state-of-the-art neural representation model, developed for class-based optimization, to expand the classification with new verbs, using all the PubMed abstracts and the full articles in the PubMed Central Open Access subset as data

Objectives

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Semantics	Publication Date: Jan 18, 2019
Citations: 10	License type: open-access

R Discovery Prime

R Discovery Prime

A neural classification method for supporting the creation of BioVerbNet

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics

Lead the way for us

Similar Papers

BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine
Olga Majewska ... Susan Windisch Brown
Journal of Biomedical Semantics | VOL. 12
Olga Majewska, et. al.Olga Majewska ... Susan Windisch Brown
15 Jul 2021
Journal of Biomedical Semantics | VOL. 12

BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine.
...
-
, et. al. ...
15 Jul 2021
15 Jul 2021

A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment.
Yufan Guo ... Johan Hogberg
BMC Bioinformatics | VOL. 12
Yufan Guo, et. al.Yufan Guo ... Johan Hogberg
08 Mar 2011
BMC Bioinformatics | VOL. 12

Text Mining for Literature Review and Knowledge Discovery in Cancer Risk Assessment and Research
Anna Korhonen ... Neil R Smalheiser
PLoS ONE | VOL. 7
Anna Korhonen, et. al.Anna Korhonen ... Neil R Smalheiser
12 Apr 2012
PLoS ONE | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A neural classification method for supporting the creation of BioVerbNet

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics