Deviation detection in text using conceptual graph interchange format and error tolerance dissimilarity function

Siti Sakira Kamaruddin,Abdul Razak Hamdan,Fauzias Mat Nor,Azuraliza Abu Bakar

doi:10.3233/ida-2012-0535

Siti Sakira Kamaruddin, Abdul Razak Hamdan + Show 2 more

Open Access

https://doi.org/10.3233/ida-2012-0535

Copy DOI

Abstract

The rapid increase in the amount of textual data has brought forward a growing research interest towards mining text to detect deviations. Specialized methods for specific domains have emerged to satisfy various needs in discovering rare patterns in text. This paper focuses on a graph-based approach for text representation and presents a novel error tolerance dissimilarity algorithm for deviation detection. We resolve two non-trivial problems, i.e. semantic representation of text and the complexity of graph matching. We employ conceptual graphs interchange format CGIF --a knowledge representation formalism to capture the structure and semantics of sentences. We propose a novel error tolerance dissimilarity algorithm to detect deviations in the CGIFs. We evaluate our method in the context of analyzing real world financial statements for identifying deviating performance indicators. We show that our method performs better when compared with two related text based graph similarity measuring methods. Our proposed method has managed to identify deviating sentences and it strongly correlates with expert judgments. Furthermore, it offers error tolerance matching of CGIFs and retains a linear complexity with the increasing number of CGIFs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Intelligent Data Analysis	Publication Date: May 4, 2012
Citations: 49	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Deviation detection in text using conceptual graph interchange format and error tolerance dissimilarity function

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis

Lead the way for us

Similar Papers

Solving ambiguities in the semantic representation of texts
Marie-Claude Landau
-
Marie-Claude LandauMarie-Claude Landau
01 Jan 1990
01 Jan 1990

Lexical-Semantic Representation of the Lexicon for Word Sense Disambiguation and Text Understanding
Yukiko Sasaki Alam
-
Yukiko Sasaki AlamYukiko Sasaki Alam
01 Sep 2009
01 Sep 2009

Global Semantics with Boundary Constraint Knowledge Graph for Chinese Financial Event Detection
Yin Wang ... Xiangfeng Luo
-
Yin Wang, et. al.Yin Wang ... Xiangfeng Luo
01 Dec 2021
01 Dec 2021

Assistive Text on Hand Held Objects for Blind People
Samruddhi Deshpande ... Revati Shriram
-
Samruddhi Deshpande, et. al.Samruddhi Deshpande ... Revati Shriram
17 Nov 2017
17 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deviation detection in text using conceptual graph interchange format and error tolerance dissimilarity function

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis