Unsupervised entity and relation extraction from clinical records in Italian

Anita Alicante,Anna Corazza,Francesco Isgrò,Stefano Silvestri

doi:10.1016/j.compbiomed.2016.01.014

Abstract

This paper proposes and discusses the use of text mining techniques for the extraction of information from clinical records written in Italian. However, as it is very difficult and expensive to obtain annotated material for languages different from English, we only consider unsupervised approaches, where no annotated training set is necessary. We therefore propose a complete system that is structured in two steps. In the first one domain entities are extracted from the clinical records by means of a metathesaurus and standard natural language processing tools. The second step attempts to discover relations between the entity pairs extracted from the whole set of clinical records. For this last step we investigate the performance of unsupervised methods such as clustering in the space of entity pairs, represented by an ad hoc feature vector. The resulting clusters are then automatically labelled by using the most significant features. The system has been tested on a fairly large data set of clinical records in Italian, investigating the variation in the performance adopting different similarity measures in the feature space. The results of our experiments show that the unsupervised approach proposed is promising and well suited for a semi-automatic labelling of the extracted relations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised entity and relation extraction from clinical records in Italian

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Journal: Computers in Biology and Medicine	Publication Date: Jan 23, 2016
Citations: 85

Similar Papers

Clustering-based Unsupervised Generative Relation Extraction
Chenhan Yuan ... Ryan A Rossi
-
Chenhan Yuan, et. al.Chenhan Yuan ... Ryan A Rossi
17 Dec 2022
17 Dec 2022

High-Performance Unsupervised Relation Extraction from Large Corpora
Binjamin Rozenfeld ... Ronen Feldman
-
Binjamin Rozenfeld, et. al.Binjamin Rozenfeld ... Ronen Feldman
01 Dec 2006
01 Dec 2006

Unsupervised Relation Extraction with Sentence level Distributional Semantics
Manzoor Ali ... Muhammad Saleem
-
Manzoor Ali, et. al.Manzoor Ali ... Muhammad Saleem
01 Feb 2023
01 Feb 2023

A novel clustering algorithm for Unsupervised Relation Extraction
Jing Wang ... Yue Teng
-
Jing Wang, et. al.Jing Wang ... Yue Teng
01 Aug 2012
01 Aug 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised entity and relation extraction from clinical records in Italian

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine