Abstract
Clustering methods divide the dataset into groups of similar objects called as clusters. Two objects in different clusters are dissimilar and objects in the same cluster are similar. Evaluation of clustering results is known as cluster validation. Cluster validation can be of different types. Internal cluster validation indices measure the quality of the clusters based on the intrinsic properties of the data. External cluster validation is based on external information about the data. The advantage of internal validation is that external information is not required. But using small amount of external information can make unsupervised clustering technique using internal cluster validation for finding optimal clustering solution achieve better results. The advantage with supervised clustering technique using external validation is that clusters confirming to class distribution are obtained. But using intrinsic information present in the data can prevent over fitting of data by supervised learning technique using external validation. In this paper we propose various hybrid cluster validation indices using internal and external cluster validation indices. The advantage with hybrid indices is that validation is done using both intrinsic information of data and available external information.In this work we focus on hybrid cluster validation indices for semi-supervised clustering.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.