A multi-scale seriation algorithm for clustering sparse imbalanced data: application to spike sorting

V Vigneron,H Chen

doi:10.1007/s10044-015-0458-2

Abstract

Seriation is a useful statistical method to visualize clusters in a dataset. However, as the data are noisy or unbalanced, visualizing the data structure becomes challenging. To alleviate this limitation, we introduce a novel metric based on common neighborhood to evaluate the degree of sparsity in a dataset. A pile of matrices are derived for different levels of sparsity, and the matrices are permuted by a branch-and-bound algorithm. The matrix with the best block diagonal form is then selected by a compactness criterion. The selected matrix reveals the intrinsic structure of the data by excluding noisy data or outliers. This seriation algorithm is applicable even if the number of clusters is unknown or if the clusters are imbalanced. However, if the metric introduces too much sparsity in the data, the sub-sampled groups of data could be ousted. To resolve this problem, a multi-scale approach combining different levels of sparsity is proposed. The capability of the proposed seriation method is examined both by toy problems and in the context of spike sorting.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multi-scale seriation algorithm for clustering sparse imbalanced data: application to spike sorting

Abstract

Talk to us

Similar Papers

More From: Pattern Analysis and Applications

Lead the way for us

Journal: Pattern Analysis and Applications	Publication Date: Mar 4, 2015
Citations: 5

Similar Papers

Algorithms for Sparse Multichannel Blind Deconvolution
Kenji Nose-Filho ... Renato Lopes
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61
Kenji Nose-Filho, et. al.Kenji Nose-Filho ... Renato Lopes
01 Jan 2023
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61

When Sparsity Meets Noise in Collaborative Filtering
Biyun Hu ... Zhoujun Li
-
Biyun Hu, et. al.Biyun Hu ... Zhoujun Li
01 Jan 2012
01 Jan 2012

A Maximum Margin Matrix Factorization based Transfer Learning Approach for Cross-Domain Recommendation
Sowmini Devi Veeramachaneni ... Vikas Kumar
Applied Soft Computing | VOL. 85
Sowmini Devi Veeramachaneni, et. al.Sowmini Devi Veeramachaneni ... Vikas Kumar
09 Sep 2019
Applied Soft Computing | VOL. 85

Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions.
Abbas Shojaee ... Shao-Shan Carol Huang
Briefings in Bioinformatics | VOL. 24
Abbas Shojaee, et. al.Abbas Shojaee ... Shao-Shan Carol Huang
22 Sep 2023
Briefings in Bioinformatics | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multi-scale seriation algorithm for clustering sparse imbalanced data: application to spike sorting

Abstract

Talk to us

Similar Papers

More From: Pattern Analysis and Applications