Clustering: Applied to Data Structuring and Retrieval

Ogechukwu ,Charles C

doi:10.14569/ijacsa.2011.021116

Abstract

Clustering is a very useful scheme for data structuring and retrieval behuhcause it can handle large volumes of multi-dimensional data and employs a very fast algorithm. Other forms of data structuring techniques include hashing and binary tree structures. However, clustering has the advantage of employing little computational storage requirements and a fast speed algorithm. In this paper, clustering, k-means clustering and the approaches to effective clustering are extensively discussed. Clustering was employed as a data grouping and retrieval strategy in the filtering of fingerprints in the Fingerprint Verification Competition 2000 database 4(a). An average penetration of 7.41% obtained from the experiment shows clearly that the clustering scheme is an effective retrieval strategy for the filtering of fingerprints.

Highlights

A collection of datasets may be too large to handle and work on may be better grouped according to some data structure
Clustering is a useful and efficient data structuring technique because it can handle datasets that are very large and at the same time n-dimensional and similar datasets are assigned to the same clusters [9]
Clustering is a process of organizing a collection of data into groups whose members are similar in some way [9, 10, 11, 12] According to Jain et al [13] “Cluster analysis is the organization of a collection of patterns into clusters based on similarity”

Summary

INTRODUCTION

A collection of datasets may be too large to handle and work on may be better grouped according to some data structure. Clustering is a useful and efficient data structuring technique because it can handle datasets that are very large and at the same time n-dimensional (more than 2 dimensions) and similar datasets are assigned to the same clusters [9]. Clustering is a process of organizing a collection of data into groups whose members are similar in some way [9, 10, 11, 12] According to Jain et al [13] “Cluster analysis is the organization of a collection of patterns (usually represented as a vector of measurements, or a point in a multidimensional space) into clusters based on similarity”. A similarity measure is used for the assignment of patterns or features to clusters

CLUSTER SIMILARITY MEASURES

Manhattan distance

Chebyshev distance

Hamming distance

CLASSIFICATION OF CLUSTERING ALGORITHMS

Hierarchical clustering

APPROACHES TO EFFECTIVE CLUSTER ANALYSIS

CLUSTERING USED AS A FINGERPRINT INDEXING RETRIEVAL STRATEGY

VIII. COMPARISON WITH OTHER DATA STRUCTURING TECHNIQUES

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2011
Citations: 22	License type: cc-by

R Discovery Prime

R Discovery Prime

Clustering: Applied to Data Structuring and Retrieval

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

The combined effects of user schemas and degree of cognitive fit on data retrieval performance
Cheryl L Dunn ... Severin V Grabski
International Journal of Accounting Information Systems | VOL. 26
Cheryl L Dunn, et. al.Cheryl L Dunn ... Severin V Grabski
01 Aug 2017
International Journal of Accounting Information Systems | VOL. 26

Research on Target Recognition Method Based on Laser Point Cloud Data
Fan Yu ... Yanxi Wei
-
Fan Yu, et. al.Fan Yu ... Yanxi Wei
25 Apr 2019
25 Apr 2019

A fast instance reduction algorithm for intrusion detection scenarios
Vitali Herrera-Semenets ... Jan Van Den Berg
Computers and Electrical Engineering | VOL. 101
Vitali Herrera-Semenets, et. al.Vitali Herrera-Semenets ... Jan Van Den Berg
21 Apr 2022
Computers and Electrical Engineering | VOL. 101

A Fast Heuristic Search Algorithm for Finding the Longest Common Subsequence of Multiple Strings
Qingguo Wang ... Mian Pan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24
Qingguo Wang, et. al.Qingguo Wang ... Mian Pan
04 Jul 2010
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering: Applied to Data Structuring and Retrieval

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications