Could spatial features help the matching of textual data?

Jacques Fize,Maguelonne Teisseire,Mathieu Roche

doi:10.3233/ida-194749

Abstract

Textual data is available to an increasing extent through different media (social networks, companies data, data catalogues, etc.). New information extraction methods are needed since these new resources are highly heterogeneous. In this article, we propose a text matching process based on spatial features and assessed through heterogeneous textual data. Besides being compatible with heterogeneous data, it comprises two contributions: first, spatial information is extracted for comparison purposes and subsequently stored in a dedicated spatial textual representation (STR); and then two transformations are applied on STR to improve the spatial similarity estimation. This article outlines the proposed approach with new contributions: (i) a new geocoding methods using general co-occurrences between entities, and (ii) a thorough evaluation followed by (iii) an in-depth discussion. The results obtained on two corpora demonstrate that good spatial matches (≈ 80% precision on major criteria) can be obtained between the most similar STRs with further enhancement achieved via STR transformation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Could spatial features help the matching of textual data?

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis

Lead the way for us

Journal: Intelligent Data Analysis	Publication Date: Sep 30, 2020
Citations: 3

Similar Papers

Matching Heterogeneous Textual Data Using Spatial Features
Jacques Fize ... Mathieu Roche
-
Jacques Fize, et. al.Jacques Fize ... Mathieu Roche
01 Nov 2018
01 Nov 2018

Heterogeneous Educational Data Classification at the Course Level
Nguyen Hua Gia Phuc ... Vo Thi Ngoc Chau
Vietnam Journal of Computer Science | VOL. 08
Nguyen Hua Gia Phuc, et. al.Nguyen Hua Gia Phuc ... Vo Thi Ngoc Chau
05 Nov 2020
Vietnam Journal of Computer Science | VOL. 08

Bibliography, catalogs, pixel data: Management of heterogeneous Big Data at CDS by the documentalists
M Buga ... P Fernique
EPJ Web of Conferences | VOL. 186
M Buga, et. al.M Buga ... P Fernique
01 Jan 2018
EPJ Web of Conferences | VOL. 186

Multi-scale deep coupling convolutional neural network with heterogeneous sensor data for intelligent fault diagnosis
Jinghui Tian ... Peiming Shi
Journal of Intelligent & Fuzzy Systems | VOL. 41
Jinghui Tian, et. al.Jinghui Tian ... Peiming Shi
11 Aug 2021
Journal of Intelligent & Fuzzy Systems | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Could spatial features help the matching of textual data?

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis