Comparing clusterings using combination of the kappa statistic and entropy-based measure

Evženie Uglickich,Ivan Nagy,Dominika Vlčková

doi:10.1007/s40300-019-00162-5

Comparing clusterings using combination of the kappa statistic and entropy-based measure

Evženie Uglickich, Ivan Nagy + Show 1 more

https://doi.org/10.1007/s40300-019-00162-5

Copy DOI

Journal: METRON	Publication Date: Nov 16, 2019
Citations: 1

Affiliation: Czech Academy of Sciences, Institute of Information Theory and Automation, Czech Technical University in Prague

#Number Of Clusters #Entropy-based Measure + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

The paper focuses on a problem of comparing clusterings with the same number of clusters obtained as a result of using different clustering algorithms. It proposes a method of the evaluation of the agreement of clusterings based on the combination of the Cohen’s kappa statistic and the normalized mutual information. The main contributions of the proposed approach are: (i) the reliable use in practice in the case of a small fixed number of clusters, (ii) the suitability to comparing clusterings with a higher number of clusters in contrast with the original statistics, (iii) the independence on size of the data set and shape of clusters. Results of the experimental validation of the proposed statistic using both simulations and real data sets as well as the comparison with the theoretical counterparts are demonstrated.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: METRON

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.