Abstract

In order to address the problem of instability of existing principal curve algorithms and their difficulties to deal with complex distribution data, in this study, invoking the ideas of information granulation, we propose a local-to-global multi-granularity principal curves extraction approach based on improved spectral clustering. Firstly, we propose an improved spectral clustering algorithm based on inflection point estimation to granulate complex distribution data into several granular data, and develop techniques of an automatic selection of a parameter, which determines the number of clusters (granular data). Secondly, PL (polygonal line) principal curves algorithm proposed by Kégl is utilized to extract the local principal curves of each granular data. Finally, with the use of the shortest Hamiltonian path algorithm and noise variance, the local principal curves are gradually connected together to form a global curve. A number of numeric studies completed for synthetic and publicly available data sets provide a useful quantifiable insight into the effectiveness of the proposed algorithm.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call