An Efficient Ontology Based Concept Indexing and Clustering for Biomedical Documents

K Premalatha ,S Logeswari

doi:10.15866/irecos.v9i6.862

Abstract

Conventional Document clustering techniques aim to group the documents into different semantic classes based on the cluster hypothesis. Most of the existing techniques are based on either single term keyword with its frequency analysis or phrase based approach using n-gram techniques of the document. Accurate clustering is infeasible in document clustering because of the curse of dimensionality due to the high dimensionality space of it. For the successful clustering of text documents, a two step process is proposed in this paper. This proposed method involves with concept based indexing with the domain ontology as background knowledge for concept extraction and clustering of documents. The results of the proposed method is compared with the traditional indexing technique, Latent Semantic Indexing (LSI). In order to prove the efficiency of the proposed technique, biomedical domain is chosen with MeSH ontology. The experimental results show that the proposed method outperforms traditional term-base method and LSI.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient Ontology Based Concept Indexing and Clustering for Biomedical Documents

Abstract

Talk to us

Similar Papers

More From: International Review on Computers and Software

Lead the way for us

Similar Papers

A Case Study on Sepsis Using PubMed and Deep Learning for Ontology Learning.
Julio Des Diz ... Maria Jesus Fernandez Prieto
Studies in health technology and informatics | VOL. 235
Julio Des Diz, et. al.Julio Des Diz ... Maria Jesus Fernandez Prieto
18 Apr 2017
Studies in health technology and informatics | VOL. 235

Using Latent Semantic Indexing to Improve the Accuracy of Document Clustering
Jiaming Zhan ... Han Tong Loh
Journal of Information & Knowledge Management | VOL. 06
Jiaming Zhan, et. al.Jiaming Zhan ... Han Tong Loh
01 Sep 2007
Journal of Information & Knowledge Management | VOL. 06

High performance in minimizing of term-document matrix representation for document clustering
L Muflikhah ... B Baharudin
-
L Muflikhah, et. al.L Muflikhah ... B Baharudin
01 Jul 2009
01 Jul 2009

Improving Learning and Teaching at Universities: The Potential of Applying Automatic Essay Scoring with Latent Semantic Analysis

-

01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Ontology Based Concept Indexing and Clustering for Biomedical Documents

Abstract

Talk to us

Similar Papers

More From: International Review on Computers and Software