An Ordinal Data Clustering Algorithm with Automated Distance Learning

Yiqun Zhang,Yiu-Ming Cheung

doi:10.1609/aaai.v34i04.6168

Abstract

Clustering ordinal data is a common task in data mining and machine learning fields. As a major type of categorical data, ordinal data is composed of attributes with naturally ordered possible values (also called categories interchangeably in this paper). However, due to the lack of dedicated distance metric, ordinal categories are usually treated as nominal ones, or coded as consecutive integers and treated as numerical ones. Both these two common ways will roughly define the distances between ordinal categories because the former way ignores the order relationship and the latter way simply assigns identical distances to different pairs of adjacent categories that may have intrinsically unequal distances. As a result, they may produce unsatisfactory ordinal data clustering results. This paper, therefore, proposes a novel ordinal data clustering algorithm, which iteratively learns: 1) The partition of ordinal dataset, and 2) the inter-category distances. To the best of our knowledge, this is the first attempt to dynamically adjust inter-category distances during the clustering process to search for a better partition of ordinal data. The proposed algorithm features superior clustering accuracy, low time complexity, fast convergence, and is parameter-free. Extensive experiments show its efficacy.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Ordinal Data Clustering Algorithm with Automated Distance Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 6

Similar Papers

Practical Approximation of Optimal Multivariate Discretization
Tapio Elomaa ... Juho Rousu
-
Tapio Elomaa, et. al.Tapio Elomaa ... Juho Rousu
01 Jan 2006
01 Jan 2006

A Further Study on Inverse Frequent Set Mining
Xia Chen ... Maria Orlowska
-
Xia Chen, et. al.Xia Chen ... Maria Orlowska
01 Jan 2004
01 Jan 2004

Incremental Frequent Itemsets Mining with MapReduce
Kirill Kandalov ... Ehud Gudes
-
Kirill Kandalov, et. al.Kirill Kandalov ... Ehud Gudes
01 Jan 2017
01 Jan 2017

An Accurate MDS-Based Algorithm for the Visualization of Large Multidimensional Datasets
Antoine Naud
-
Antoine NaudAntoine Naud
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Ordinal Data Clustering Algorithm with Automated Distance Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence