Functional clustering algorithm for high-dimensional proteomics data.

Halima Bensmail,O John Semmes,Buddana Aruna,Abdelali Haoudi

doi:10.1155/jbb.2005.80

Abstract

Clustering proteomics data is a challenging problem for any traditional clustering algorithm. Usually, the number of samples is largely smaller than the number of protein peaks. The use of a clustering algorithm which does not take into consideration the number of features of variables (here the number of peaks) is needed. An innovative hierarchical clustering algorithm may be a good approach. We propose here a new dissimilarity measure for the hierarchical clustering combined with a functional data analysis. We present a specific application of functional data analysis (FDA) to a high-throughput proteomics study. The high performance of the proposed algorithm is compared to two popular dissimilarity measures in the clustering of normal and human T-cell leukemia virus type 1 (HTLV-1)-infected patients samples.

Highlights

A variety of mass spectrometry-based platforms are currently available for providing information on both protein patterns and protein identity [1, 2]
Depending upon the range of masses the investigator wishes to study, there are a variety of possible slide surfaces; for example, the strong anion exchange (SAX) or the weak cation exchange (WCX) surface
We propose to implement a hierarchical clustering algorithm for proteomics data using functional data analysis (FDA)

Summary

INTRODUCTION

A variety of mass spectrometry-based platforms are currently available for providing information on both protein patterns and protein identity [1, 2]. A flexible dissimilarity measure is the one that may combine the characteristic of both measures δHZ and δC This means that a potential dissimilarity measure should use the collected estimated points of the original curve obtained from FDA so that no information is lost and should work on different type of smoothed curves without using the monotonicity restriction. In this sense, we propose a functional-based dissimilarity δB measure which uses the rank of the curve proposed by Heckman and Zamar and generalizes Cerioli et al dissimilarity measure as follows:.

RESULTS

Findings

DISCUSSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BioMed Research International	Publication Date: Jan 1, 2000
Citations: 23	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Functional clustering algorithm for high-dimensional proteomics data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioMed Research International

Lead the way for us

Similar Papers

Application of functional data analysis to investigate seasonal progression with interannual variability in plankton abundance in the Bay of Fundy, Canada
Takayoshi Ikeda ... Jennifer L Martin
Estuarine, Coastal and Shelf Science | VOL. 78
Takayoshi Ikeda, et. al.Takayoshi Ikeda ... Jennifer L Martin
11 Jan 2008
Estuarine, Coastal and Shelf Science | VOL. 78

Application of functional data analysis to explore movements: walking, running and jumping - A systematic review
Julia Dannenmaier ... Gert Krischak
Gait & Posture | VOL. 77
Julia Dannenmaier, et. al.Julia Dannenmaier ... Gert Krischak
03 Feb 2020
Gait & Posture | VOL. 77

Application of Functional Data Analysis to Identify Patterns of Malaria Incidence, to Guide Targeted Control Strategies.
Sokhna Dieng ... Jean Gaudart
International Journal of Environmental Research and Public Health | VOL. 17
Sokhna Dieng, et. al.Sokhna Dieng ... Jean Gaudart
01 Jun 2020
International Journal of Environmental Research and Public Health | VOL. 17

Data on the application of Functional Data Analysis in food fermentations
M.A Ruiz-Bellido ... A Garrido-Fernández
Data in Brief | VOL. 9
M.A Ruiz-Bellido, et. al.M.A Ruiz-Bellido ... A Garrido-Fernández
15 Sep 2016
Data in Brief | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Functional clustering algorithm for high-dimensional proteomics data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioMed Research International