Creation of Text Document Matrices and Visualization by Self-Organizing Map

P Stefanovič,O Kurasova

doi:10.5755/j01.itc.43.1.4299

Abstract

In the paper, text mining and visualization by self-organizing map (SOM) are investigated. At first, textual information must be converted into numerical one. The results of text mining and visualization depend on the conversion. So, the influence of some control factors (the common word list and usage of the stemming algorithm) on text mining results, when a document dictionary is created, is investigated. A self-organizing map is used for text clustering and graphical representation (visualization). A comparative analysis is made where a dataset consists of scientific papers about the optimization, based on Pareto, simplex, and genetic algorithms. Two new measures are also proposed to estimate the SOM quality when the classified data are analyzed: distances between SOM cells, corresponding to data items assigned to the same class, and the distance between centers of SOM cells, corresponding to different classes. The quantization error is measured to estimate the SOM quality, too. DOI: http://dx.doi.org/10.5755/j01.itc.43.1.4299

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Creation of Text Document Matrices and Visualization by Self-Organizing Map

Abstract

Talk to us

Similar Papers

More From: Information Technology And Control

Lead the way for us

Journal: Information Technology And Control	Publication Date: Mar 12, 2014
Citations: 4

Similar Papers

Cluster and visualize data using 3D self-organizing maps
Zalhan Mohd Zin
-
Zalhan Mohd ZinZalhan Mohd Zin
01 Nov 2014
01 Nov 2014

On Self-Organizing Feature Map (SOFM) Formation by Direct Optimization Through a Genetic Algorithm
Jose Everardo B Maia ... Andre L.V Coelho
-
Jose Everardo B Maia, et. al.Jose Everardo B Maia ... Andre L.V Coelho
01 Sep 2008
01 Sep 2008

Clustering Using Genetic Algorithm-Based Self-Organising Map
Azmi Hassan ... Muhammad Ridwan Andi Purnomo
Advanced Materials Research | VOL. 1115
Azmi Hassan, et. al.Azmi Hassan ... Muhammad Ridwan Andi Purnomo
01 Jul 2015
Advanced Materials Research | VOL. 1115

A self organizing map-harmony search hybrid algorithm for clustering biological data
Abin John George ... Meeta Pradhan
-
Abin John George, et. al.Abin John George ... Meeta Pradhan
01 Feb 2015
01 Feb 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Creation of Text Document Matrices and Visualization by Self-Organizing Map

Abstract

Talk to us

Similar Papers

More From: Information Technology And Control