Feature Clustering of Noisy Data and Application in the Currency Market

Mohammad Seidpisheh,Salman Babayi,Adel Mohammadpour

doi:10.1142/s0219477522500584

Abstract

With the increase in high-dimensional data, researchers pay more attention to dimensionality reduction techniques because there are many noisy, redundant and irrelevant features in high-dimensional data. The existence of noisy features leads to decrease performance when analyzing high-dimensional data. Also, unsupervised dimensionality reduction techniques are widely used due to the lack of available labels. Feature clustering is an unsupervised dimensionality reduction technique to partition features into clusters in which features are strongly related. In addition, the Pearson correlation coefficient is widely used as a similarity tool for feature clustering. However, the Pearson correlation coefficient is easily influenced by outliers and noises, thus leading to misleading results. This paper focuses on the influence of dissimilarity measures on the clustering of noisy features. Heavy-tailed distributions are used for modeling data with outliers and noises. Therefore, we introduce a new dissimilarity measure based on a new dependence coefficient of heavy-tailed distributions. The performance of feature clustering using the proposed dissimilarity is evaluated using ARI and internal criteria on artificial and real currency market datasets. Experiment results have demonstrated the effectiveness of the proposed feature clustering method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature Clustering of Noisy Data and Application in the Currency Market

Abstract

Talk to us

Similar Papers

More From: Fluctuation and Noise Letters

Lead the way for us

Similar Papers

Evaluation of effect of unsupervised dimensionality reduction techniques on automated arrhythmia classification
Rekha Rajagopal ... Vidhyapriya Ranganathan
Biomedical Signal Processing and Control | VOL. 34
Rekha Rajagopal, et. al.Rekha Rajagopal ... Vidhyapriya Ranganathan
10 Jan 2017
Biomedical Signal Processing and Control | VOL. 34

A binary Krill Herd approach based feature selection for high dimensional data
V Preeja ... A H Shahana
-
V Preeja, et. al.V Preeja ... A H Shahana
01 Aug 2016
01 Aug 2016

The dimensionality reductions of environmental variables have a significant effect on the performance of species distribution models.
Hao-Tian Zhang ... Wen-Ting Wang
Ecology and evolution | VOL. 13
Hao-Tian Zhang, et. al.Hao-Tian Zhang ... Wen-Ting Wang
01 Nov 2023
Ecology and evolution | VOL. 13

Unsupervised damage clustering in complex aeronautical composite structures monitored by Lamb waves: An inductive approach
Amirhossein Rahbari ... Stephane Canu
Engineering Applications of Artificial Intelligence | VOL. 97
Amirhossein Rahbari, et. al.Amirhossein Rahbari ... Stephane Canu
16 Nov 2020
Engineering Applications of Artificial Intelligence | VOL. 97

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Clustering of Noisy Data and Application in the Currency Market

Abstract

Talk to us

Similar Papers

More From: Fluctuation and Noise Letters