Abstract

In high-dimensional data, clusters often exist in the form of complex hierarchical relationships. In order to explore these relationships, there is a need to integrate dimensionality reduction techniques with data mining approaches and graph theory. The correlations in data points emerge more clearly if this integration is flawless. We propose an approach called Local Graph Based Correlation Clustering (LGBACC). This approach merges hierarchical clustering, with PCA to uncover complex hierarchical relationships, and uses graph models to visualize the results. We propose a framework of this approach that is divided into four phases. Each phase is flawlessly integrated with the next phase. Visualization of data after each phase is an important output and is knitted into the fabric of the framework. The focus of this technique remains on obtaining high quality clusters. The quality of the final clusters obtained is measured using standard indices. It is found that LGBACC is better than the existing hierarchical clustering approaches. We have used real-world data sets to validate our framework. These datasets test the approach on low as well as high-dimensional data. It is found that LGBACC produces high-quality clusters across a wide spectrum of dimensionality. Scalability test on synthetically produced high-dimensional and large datasets show that the proposed approach runs efficiently. Hence, LGBACC is an efficient and scalable approach that produces high-quality clusters in high-dimensional and large data spaces.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.