Semantic similarity and machine learning with ontologies.

Maxat Kulmanov,Xin Gao,Robert Hoehndorf,Fatima Zohra Smaili

doi:10.1093/bib/bbaa199

Abstract

Ontologies have long been employed in the life sciences to formally represent and reason over domain knowledge and they are employed in almost every major biological database. Recently, ontologies are increasingly being used to provide background knowledge in similarity-based analysis and machine learning models. The methods employed to combine ontologies and machine learning are still novel and actively being developed. We provide an overview over the methods that use ontologies to compute similarity and incorporate them in machine learning methods; in particular, we outline how semantic similarity measures and ontology embeddings can exploit the background knowledge in ontologies and how ontologies can provide constraints that improve machine learning models. The methods and experiments we describe are available as a set of executable notebooks, and we also provide a set of slides and additional resources at https://github.com/bio-ontology-research-group/machine-learning-with-ontologies.

Highlights

Machine learning methods are applied widely across life sciences to develop predictive models [1]
Ontologies have long been employed in the life sciences to formally represent and reason over domain knowledge and they are employed in almost every major biological database
We provide an overview over the methods that use ontologies to compute similarity and incorporate them in machine learning methods; in particular, we outline how semantic similarity measures and ontology embeddings can exploit the background knowledge in ontologies and how ontologies can provide constraints that improve machine learning models

Summary

Introduction

Machine learning methods are applied widely across life sciences to develop predictive models [1]. While the vocabulary of O may be large and consist of thousands of class, relation and individual symbols, fe usually embeds these entities in a space of relatively small size (depending on the chosen parameter n); the embedding preserves certain structural characteristics of the ontology O similar to a ‘module’ [83] in the ontology, thereby making this local information available to an optimization algorithm that finds c; and embeddings in Rn allow gradient descent methods to be applied directly which are used in many modern machine learning methods. Traditional semantic similarity measures, in particular Resnik’s measure [53], perform well across many evaluations, in particular in recall at the first ranks, and often has better performance than

Method

Limitations and future work

Findings

Key Points

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Briefings in Bioinformatics	Publication Date: Oct 13, 2020
Citations: 86	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Semantic similarity and machine learning with ontologies.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Briefings in Bioinformatics

Lead the way for us

Similar Papers

Seismic fragility analysis of steel moment frames using machine learning models
Hoang D Nguyen ... Myoungsu Shin
Engineering Applications of Artificial Intelligence | VOL. 126
Hoang D Nguyen, et. al.Hoang D Nguyen ... Myoungsu Shin
15 Aug 2023
Engineering Applications of Artificial Intelligence | VOL. 126

Explainable empirical risk minimization
Linli Zhang ... Alex Jung
Neural Computing and Applications | VOL. 36
Linli Zhang, et. al.Linli Zhang ... Alex Jung
08 Dec 2023
Neural Computing and Applications | VOL. 36

P125. Development of a novel ensemble machine learning algorithm for prediction of complications and readmission after anterior cervical spinal fusion
Akash A Shah ... Nelson Soohoo
The Spine Journal | VOL. 21
Akash A Shah, et. al.Akash A Shah ... Nelson Soohoo
10 Aug 2021
The Spine Journal | VOL. 21

P126. Development of a novel ensemble machine learning algorithm for prediction of complications and readmission after posterior cervical spinal fusion
Akash A Shah ... Nelson Soohoo
The Spine Journal | VOL. 21
Akash A Shah, et. al.Akash A Shah ... Nelson Soohoo
10 Aug 2021
The Spine Journal | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic similarity and machine learning with ontologies.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Briefings in Bioinformatics