Comparing Clustering with Pairwise and Relative Constraints

Yuanli Pei,Teresa Vania Tjahja,Xiaoli Z Fern,Rómer Rosales

doi:10.1145/2996467

Abstract

Clustering can be improved with the help of side information about the similarity relationships among instances. Such information has been commonly represented by two types of constraints: pairwise constraints and relative constraints, regarding similarities about instance pairs and triplets, respectively. Prior work has mostly considered these two types of constraints separately and developed individual algorithms to learn from each type. In practice, however, it is critical to understand/compare the usefulness of the two types of constraints as well as the cost of acquiring them, which has not been studied before. This paper provides an extensive comparison of clustering with these two types of constraints. Specifically, we compare their impacts both on human users that provide such constraints and on the learning system that incorporates such constraints into clustering. In addition, to ensure that the comparison of clustering is performed on equal ground (without the potential bias introduced by different learning algorithms), we propose a probabilistic semi-supervised clustering framework that can learn from either type of constraints. Our experiments demonstrate that the proposed semi-supervised clustering framework is highly effective at utilizing both types of constraints to aid clustering. Our user study provides valuable insights regarding the impact of the constraints on human users, and our experiments on clustering with the human-labeled constraints reveal that relative constraint is often more efficient at improving clustering.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparing Clustering with Pairwise and Relative Constraints

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data

Lead the way for us

Journal: ACM Transactions on Knowledge Discovery from Data	Publication Date: Dec 3, 2016
Citations: 10

Similar Papers

A Feature Space Learning Model Based on Semi-Supervised Clustering
Renchu Guan ... Xu Wang
-
Renchu Guan, et. al.Renchu Guan ... Xu Wang
01 Jul 2017
01 Jul 2017

Semi-supervised clustering with discriminative random fields
Chin-Chun Chang ... Hsin-Yi Chen
Pattern Recognition | VOL. 45
Chin-Chun Chang, et. al.Chin-Chun Chang ... Hsin-Yi Chen
12 Jun 2012
Pattern Recognition | VOL. 45

Accuracy vs. Speed: Scalable Entity Coreference on the Semantic Web with On-the-Fly Pruning
Dezhao Song ... Jeff Heflin
-
Dezhao Song, et. al.Dezhao Song ... Jeff Heflin
01 Dec 2012
01 Dec 2012

Accuracy vs. Speed: Scalable Entity Coreference on the Semantic Web with On-the-Fly Pruning
...
Web intelligence | VOL. 1
, et. al. ...
04 Dec 2012
Web intelligence | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparing Clustering with Pairwise and Relative Constraints

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data