Abstract

Evolutionary multi-objective clustering (EMOC) is a modern clustering technique in which the general concepts of evolutionary multi-objective optimization are applied to the clustering problem. The design and definition of clustering are difficult problems, and the choice of the objective functions and parameter setting of the algorithms are still challenges. In this paper, we propose the adaptive evolutionary multi-objective clustering approach based on data properties (AEMOC). AEMOC considers a new metric to evaluate the base partitions (candidate solutions used in the initial population) generated by minimum spanning tree clustering. This metric is applied to: (1) determine properties of the data, such as separation and overlap; and (2) define an offline selection of objective functions and parameter settings of the multi-objective algorithm. The information regarding the data properties allows AEMOC to avoid unnecessary data processing in datasets in which the initialization provides optimal solutions and optimization is not required. AEMOC presented promising results considering a diverse set of artificial and real-life datasets, based on two aspects: (1) it succeeded in the determination of the data properties of the base partitions and verifying the potential clustering quality, presenting a correlation of 0.8 with a reference metric, and (2) it provided better clustering results than reference EMOC approaches, achieving a general gain in the clustering performance of 3% in real-life datasets and 7% in artificial datasets.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.