Visualizing text similarities from a graph-based SOM

Khalid Kahloot,Mohammad A Mikki,Akram A Elkhatib

doi:10.24297/ijct.v14i7.1889

Abstract

Text in articles is based on expert opinion of a large number of people including the views of authors. These views are based on cultural or community aspects, which make extracting information from text very difficult. This paper introduced how to utilize the capabilities of a modified graph-based Self-Organizing Map (SOM) in showing text similarities. Text similarities are extracted from an article using Google's PageRank algorithm. Sentences from an input article are represented as graph model instead of vector space model. The resulted graph can be shown in a visual animation for eight famous graph algorithms execution with animation speed control.The resulted graph is used as an input to SOM. SOM clustering algorithm is used to construct knowledge from text data. We used a visual animation for eight famous graph methods with animation speed control and according to similarity measure; an adjustable number of most similar sentences are arranged in visual form. In addition, this paper presents a wide variety of text searching. We had compared our project with famous clustering and visualization project in term of purity, entropy and F measure. Our project showed accepted results and mostly superiority over other projects.

Highlights

A context can be composed with variant sets of vocabularies and still express the same meaning
The aim of this paper is to build a graph-based unsupervised clustering based on Self-Organizing Map (SOM) for context extracted from text for semantics representation
We randomly selected 1000 sentences from the one million sentences, computed their exact 1–10 nearest neighbors in the whole article and used SOM methods with different parameter settings to measure the impact on the clustering quality, comparing it to a manual exact clustering

Summary

Introduction

A context can be composed with variant sets of vocabularies and still express the same meaning. Sets of vocabularies in a text documented in an article or a webpage is subjected to opinion of a large number of people including the views of authors It has different cultural or community aspects, which make extracting information from it very difficult. Text analysis is based on the descriptive function at a high level of the context, like the matrix structure presented in [10] or graphical representation based on the text of the attributes of the metadata described in [20]. This descriptive content can be clearly found in encyclopedias website such as Wikipedia, Encyclopedia.com, and Webopedia etc

Objectives

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY	Publication Date: May 12, 2015
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Visualizing text similarities from a graph-based SOM

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY

Lead the way for us

Similar Papers

Urban Flood Risk Assessment in Zhengzhou, China, Based on a D-Number-Improved Analytic Hierarchy Process and a Self-Organizing Map Algorithm
Zening Wu ... Wenchao Qi
Remote sensing | VOL. 14
Zening Wu, et. al.Zening Wu ... Wenchao Qi
24 Sep 2022
Remote sensing | VOL. 14

The N-Grams Based Text Similarity Detection Approach Using Self-Organizing Maps and Similarity Measures
Pavel Stefanovič ... Rokas Štrimaitis
Applied sciences | VOL. 9
Pavel Stefanovič, et. al.Pavel Stefanovič ... Rokas Štrimaitis
07 May 2019
Applied sciences | VOL. 9

Artificial neural networks as potential classification tools for dinoflagellate cyst images: A case using the self-organizing map clustering algorithm
Andrew F Weller ... J Andrew Ware
Review of palaeobotany and palynology | VOL. 141
Andrew F Weller, et. al.Andrew F Weller ... J Andrew Ware
09 Aug 2006
Review of palaeobotany and palynology | VOL. 141

Preparing Text Reports from Web Pages Employing Similarity Tests
J Guadalupe Ramos ... Nicolas Jasso
-
J Guadalupe Ramos, et. al.J Guadalupe Ramos ... Nicolas Jasso
01 Oct 2013
01 Oct 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visualizing text similarities from a graph-based SOM

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL OF COMPUTERS &amp; TECHNOLOGY

More From: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY