Abstract

A new variant of Hierarchical Cluster Analysis is gaining interest in the field of Machine Learning, called Object Cluster Hierarchy. Being still at an early stage of development, the lack of tools for systematic analysis of Object Cluster Hierarchies inhibits further improvement of this concept. In this paper we address this issue by proposing a generator of synthetic hierarchical data that can be used for benchmarking Object Cluster Hierarchy generation methods. The article presents a thorough empirical and theoretical analysis of the generator and provides guidance on how to control its parameters. The conducted experiments show the usefulness of the data generator capable of producing a wide range of differently structured data. Furthermore, datasets that represent the most common types of hierarchies are generated and made available to the public for benchmarking, along with the developed generator (http://kio.pwr.edu.pl/?page_id=396).

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call