A genetic algorithm-based clustering approach for database partitioning

Chun-Hung Cheng Chun-Hung Cheng,Wing-Kin Lee Wing-Kin Lee,Kam-Fai Wong Kam-Fai Wong

doi:10.1109/tsmcc.2002.804444

Abstract

In a typical distributed/parallel database system, a request mostly accesses a subset of the entire database. It is, therefore, natural to organize commonly accessed data together and to place them on nearby, preferably the same, machine(s)/site(s). For this reason, data partitioning and data allocation are performance critical issues in distributed database application design. We are dealing with data partitioning. Data partitioning requires the use of clustering. Although many clustering algorithms have been proposed, their performance has not been extensively studied. Moreover, the special problem structure in clustering is rarely exploited. We explore the use of a genetic search-based clustering algorithm for data partitioning to achieve high database retrieval performance. By formulating the underlying problem as a traveling salesman problem (TSP), we can take advantage of this particular structure. Three new operators for GAs are also proposed and experimental results indicate that they outperform other operators in solving the TSP. The proposed GA is applied to solve the data-partitioning problem. Our computational study shows that our GA performs well for this application.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A genetic algorithm-based clustering approach for database partitioning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)

Lead the way for us

Journal: IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)	Publication Date: Aug 1, 2002
Citations: 126

Similar Papers

A K-medoids based clustering scheme with an application to document clustering
Aytug Onan
-
Aytug OnanAytug Onan
01 Oct 2017
01 Oct 2017

A data clustering algorithm for stratified data partitioning in artificial neural network
Ajit K Sahoo ... M.K Tiwari
Expert Systems With Applications | VOL. 39
Ajit K Sahoo, et. al.Ajit K Sahoo ... M.K Tiwari
18 Jan 2012
Expert Systems With Applications | VOL. 39

Evaluation of hierarchical clustering algorithms for document datasets
Ying Zhao ... George Karypis
-
Ying Zhao, et. al.Ying Zhao ... George Karypis
04 Nov 2002
04 Nov 2002

Don't Look Back, Look into the Future
Yu-Shan Lin ... Ching Tsai
-
Yu-Shan Lin, et. al.Yu-Shan Lin ... Ching Tsai
09 Jun 2021
09 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A genetic algorithm-based clustering approach for database partitioning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)