A New Functional Clustering Method with Combined Dissimilarity Sources and Graphical Interpretation

Wenlin Dai,Tomáš Mrkvička,Stavros Athanasiadis

doi:10.5772/intechopen.100124

Abstract

Clustering is an essential task in functional data analysis. In this study, we propose a framework for a clustering procedure based on functional rankings or depth. Our methods naturally combine various types of between-cluster variation equally, which caters to various discriminative sources of functional data; for example, they combine raw data with transformed data or various components of multivariate functional data with their covariance. Our methods also enhance the clustering results with a visualization tool that allows intrinsic graphical interpretation. Finally, our methods are model-free and nonparametric and hence are robust to heavy-tailed distribution or potential outliers. The implementation and performance of the proposed methods are illustrated with a simulation study and applied to three real-world applications.

Highlights

Cluster analysis is a critical step in exploratory data analysis intended to identify homogeneous subgroups among observations
The filtering-based methods involve the approximation of the curves with linear combinations of finite basis functions, such as splines and functional principal components, and the cluster analysis is conducted based on the coefficients or scores of finite dimensions [5–7]
We introduce a new class of functional cluster analysis methods based on functional orderings

Summary

Introduction

Cluster analysis is a critical step in exploratory data analysis intended to identify homogeneous subgroups among observations. By “standardization”, we mean that the marginal empirical distributions are standardized so that they have zero mean and unit variance This approach is used in the simulation study in order to compare the performance of existing methods with the proposed methods. Since the proposed procedure applies functional ordering, such that every part of the function is treated the different sources of variation are combined in an equal manner. For univariate cases, it may combine the raw curves and the derivatives to measure the magnitude and shape variation simultaneously. The proposed method provides a reasonable graphical interpretation of the clustering result It inherits the robustness of functional orderings and can stably recover the clusters when abnormal observations contaminate the data. The proposed methods will be available soon in the R package GET

Dissimilarity matrix

Combined functional ordering

Functional ordering with intrinsic graphical interpretation

Extreme rank length ordering

Global continuous rank ordering

Global area rank ordering

Studentized maximum ordering

Dissimilarity matrix based on the combined ordering

Simulation study

Clustering of insurance penetration

Clustering of population growth data

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A New Functional Clustering Method with Combined Dissimilarity Sources and Graphical Interpretation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Apr 6, 2022
Citations: 4	License type: CC BY 3.0

Similar Papers

Functional Data Clustering Via Functional Mahalanobis Distance
Yangxinzi Zao
Highlights in Science, Engineering and Technology | VOL. 70
Yangxinzi ZaoYangxinzi Zao
15 Nov 2023
Highlights in Science, Engineering and Technology | VOL. 70

Preprocessing of centred logratio transformed density functions using smoothing splines
J Machalová ... G.S Monti
Journal of Applied Statistics | VOL. 43
J Machalová, et. al.J Machalová ... G.S Monti
22 Dec 2015
Journal of Applied Statistics | VOL. 43

Principal component analysis of hybrid functional and vector data.
Jeong Hoon Jang
Statistics in Medicine | VOL. 40
Jeong Hoon JangJeong Hoon Jang
23 Jun 2021
Statistics in Medicine | VOL. 40

Functional exploratory data analysis for high-resolution measurements of urban particulate matter.
M Giovanna Ranalli ... Silvia Castellini
Biometrical journal. Biometrische Zeitschrift | VOL. 58
M Giovanna Ranalli, et. al.M Giovanna Ranalli ... Silvia Castellini
13 Apr 2016
Biometrical journal. Biometrische Zeitschrift | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A New Functional Clustering Method with Combined Dissimilarity Sources and Graphical Interpretation

Abstract

Highlights

Summary

Talk to us

Similar Papers