Comparative study of classification algorithms for immunosignaturing data

Muskan Kukreja,Stephen Albert Johnston,Phillip Stafford

doi:10.1186/1471-2105-13-139

Muskan Kukreja, Stephen Albert Johnston + Show 1 more

Open Access

https://doi.org/10.1186/1471-2105-13-139

Copy DOI

Abstract

BackgroundHigh-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of features. As new microarrays are invented, classification systems that worked well for other array types may not be ideal. Expression microarrays, arguably one of the most prevalent array types, have been used for years to help develop classification algorithms. Many biological assumptions are built into classifiers that were designed for these types of data. One of the more problematic is the assumption of independence, both at the probe level and again at the biological level. Probes for RNA transcripts are designed to bind single transcripts. At the biological level, many genes have dependencies across transcriptional pathways where co-regulation of transcriptional units may make many genes appear as being completely dependent. Thus, algorithms that perform well for gene expression data may not be suitable when other technologies with different binding characteristics exist. The immunosignaturing microarray is based on complex mixtures of antibodies binding to arrays of random sequence peptides. It relies on many-to-many binding of antibodies to the random sequence peptides. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides. This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. However, it is not clear what is the optimal classification algorithm for analyzing this new type of data.ResultsWe characterized several classification algorithms to analyze immunosignaturing data. We selected several datasets that range from easy to difficult to classify, from simple monoclonal binding to complex binding patterns in asthma patients. We then classified the biological samples using 17 different classification algorithms. Using a wide variety of assessment criteria, we found ‘Naïve Bayes’ far more useful than other widely used methods due to its simplicity, robustness, speed and accuracy.Conclusions‘Naïve Bayes’ algorithm appears to accommodate the complex patterns hidden within multilayered immunosignaturing microarray data due to its fundamental mathematical properties.

Highlights

High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others
In order to establish the question of classification suitability, we examine a basic classification algorithm, Linear Discriminant Analysis (LDA)
Immunosignaturing is a novel approach which aims to detect complex patterns of antibodies produced in acute or chronic disease. This complex pattern is obtained using random peptide microarrays where 10,000 random peptides are exposed to antibodies in sera/plasma/saliva

Summary

Introduction

High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. It is not clear what is the optimal classification algorithm for analyzing this new type of data. Serological diagnostics using antibodies have the potential to reduce medical costs and may be one of the few methods that allow for true presymptomatic detection of disease For this reason, our group has pursued immunosignaturing for its ability to detect the diseases early and with a low false positive rate. Immunosignaturing microarrays may require that we change our underlying assumptions as we determine the suitability of a particular classifier

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jun 21, 2012
Citations: 68	License type: cc-by

R Discovery Prime

R Discovery Prime

Comparative study of classification algorithms for immunosignaturing data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Comparative study of classification algorithms for damage classification in smart composite laminates
Chang-Kyung Ryoo ... Vijay K Varadan
-
Chang-Kyung Ryoo, et. al.Chang-Kyung Ryoo ... Vijay K Varadan
17 Apr 2017
17 Apr 2017

Exploring Antibody Recognition of Sequence Space through Random-Sequence Peptide Microarrays
Rebecca F Halperin ... Stephen Albert Johnston
Molecular & Cellular Proteomics | VOL. 10
Rebecca F Halperin, et. al.Rebecca F Halperin ... Stephen Albert Johnston
01 Mar 2011
Molecular & Cellular Proteomics | VOL. 10

Comparative Study of Classification Algorithms for Various DNA Microarray Data.
Jingeun Kim ... Yong-Hyuk Kim
Genes | VOL. 13
Jingeun Kim, et. al.Jingeun Kim ... Yong-Hyuk Kim
11 Mar 2022
Genes | VOL. 13

A Comparative Study of Classification Algorithms for Predicting Liver Disorders
Rashi Bhardwaj ... Rajat Mehta
-
Rashi Bhardwaj, et. al.Rashi Bhardwaj ... Rajat Mehta
17 Dec 2019
17 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative study of classification algorithms for immunosignaturing data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics