Single-channel Microarray Research Articles

BackgroundOne essential step in the massive analysis of transcriptomic profiles is the calculation of the correlation coefficient, a value used to select pairs of genes with similar or inverse transcriptional profiles across a large fraction of the biological conditions examined. Until now, the choice between the two available methods for calculating the coefficient has been dictated mainly by technological considerations. Specifically, in analyses based on double-channel techniques, researchers have been required to use covariation correlation, i.e. the correlation between gene expression changes measured between several pairs of biological conditions, expressed for example as fold-change. In contrast, in analyses of single-channel techniques scientists have been restricted to the use of coexpression correlation, i.e. correlation between gene expression levels. To our knowledge, nobody has ever examined the possible benefits of using covariation instead of coexpression in massive analyses of single channel microarray results.ResultsWe describe here how single-channel techniques can be treated like double-channel techniques and used to generate both gene expression changes and covariation measures. We also present a new method that allows the calculation of both positive and negative correlation coefficients between genes. First, we perform systematic comparisons between two given biological conditions and classify, for each comparison, genes as increased (I), decreased (D), or not changed (N). As a result, the original series of n gene expression level measures assigned to each gene is replaced by an ordered string of n(n-1)/2 symbols, e.g. IDDNNIDID....DNNNNNNID, with the length of the string corresponding to the number of comparisons. In a second step, positive and negative covariation matrices (CVM) are constructed by calculating statistically significant positive or negative correlation scores for any pair of genes by comparing their strings of symbols.ConclusionThis new method, applied to four different large data sets, has allowed us to construct distinct covariation matrices with similar properties. We have also developed a technique to translate these covariation networks into graphical 3D representations and found that the local assignation of the probe sets was conserved across the four chip set models used which encompass three different species (humans, mice, and rats). The application of adapted clustering methods succeeded in delineating six conserved functional regions that we characterized using Gene Ontology information.

Read full abstract

BackgroundResearchers using RNA expression microarrays in experimental designs with more than two treatment groups often identify statistically significant genes with ANOVA approaches. However, the ANOVA test does not discriminate which of the multiple treatment groups differ from one another. Thus, post hoc tests, such as linear contrasts, template correlations, and pairwise comparisons are used. Linear contrasts and template correlations work extremely well, especially when the researcher has a priori information pointing to a particular pattern/template among the different treatment groups. Further, all pairwise comparisons can be used to identify particular, treatment group-dependent patterns of gene expression. However, these approaches are biased by the researcher's assumptions, and some treatment-based patterns may fail to be detected using these approaches. Finally, different patterns may have different probabilities of occurring by chance, importantly influencing researchers' conclusions about a pattern and its constituent genes.ResultsWe developed a four step, post hoc pattern matching (PPM) algorithm to automate single channel gene expression pattern identification/significance. First, 1-Way Analysis of Variance (ANOVA), coupled with post hoc 'all pairwise' comparisons are calculated for all genes. Second, for each ANOVA-significant gene, all pairwise contrast results are encoded to create unique pattern ID numbers. The # genes found in each pattern in the data is identified as that pattern's 'actual' frequency. Third, using Monte Carlo simulations, those patterns' frequencies are estimated in random data ('random' gene pattern frequency). Fourth, a Z-score for overrepresentation of the pattern is calculated ('actual' against 'random' gene pattern frequencies). We wrote a Visual Basic program (StatiGen) that automates PPM procedure, constructs an Excel workbook with standardized graphs of overrepresented patterns, and lists of the genes comprising each pattern. The visual basic code, installation files for StatiGen, and sample data are available as supplementary material.ConclusionThe PPM procedure is designed to augment current microarray analysis procedures by allowing researchers to incorporate all of the information from post hoc tests to establish unique, overarching gene expression patterns in which there is no overlap in gene membership. In our hands, PPM works well for studies using from three to six treatment groups in which the researcher is interested in treatment-related patterns of gene expression. Hardware/software limitations and extreme number of theoretical expression patterns limit utility for larger numbers of treatment groups. Applied to a published microarray experiment, the StatiGen program successfully flagged patterns that had been manually assigned in prior work, and further identified other gene expression patterns that may be of interest. Thus, over a moderate range of treatment groups, PPM appears to work well. It allows researchers to assign statistical probabilities to patterns of gene expression that fit a priori expectations/hypotheses, it preserves the data's ability to show the researcher interesting, yet unanticipated gene expression patterns, and assigns the majority of ANOVA-significant genes to non-overlapping patterns.

Read full abstract

Single-channel Microarray Research Articles

Articles published on Single-channel Microarray

Identification of biomarker genes for resistance to a pathogen by a novel method for meta-analysis of single-channel microarray datasets.

PTH-100 A comparative gene expression study between individuals with apparent resistance, spontaneous clearance, or chronic infection from hcv

A centrosome clustering protein, HSET, as a potential biomarker for ovarian adenocarcinomas.

RNA-Seq vs dual- and single-channel microarray data: sensitivity analysis for differential expression and clustering.

A single-sample microarray normalization method to facilitate personalized-medicine workflows

High-throughput processing and normalization of one-color microarrays for transcriptional meta-analyses

Gene Network Landscape of the Ciliate Tetrahymena thermophila

Abstract 1142: A specific miRNA signature characterizes metastasis in renal cell carcinoma

ANAIS: Analysis of NimbleGen Arrays Interface

Construction and use of gene expression covariation matrix

OneChannelGUI: a graphical interface to Bioconductor tools, designed for life scientists who are not familiar with R language

Post hoc pattern matching: assigning significance to statistically defined expression patterns in single channel microarray data

Simulation of microarray data with realistic characteristics.

Effects on gene expression and viral load of a medicinal extract from Agaricus blazei in patients with chronic hepatitis C infection

AffylmGUI: a graphical user interface for linear modeling of single channel microarray data

Can Zipf's law be adapted to normalize microarrays?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Single-channel Microarray Research Articles

Articles published on Single-channel Microarray

Identification of biomarker genes for resistance to a pathogen by a novel method for meta-analysis of single-channel microarray datasets.

PTH-100 A comparative gene expression study between individuals with apparent resistance, spontaneous clearance, or chronic infection from hcv

A centrosome clustering protein, HSET, as a potential biomarker for ovarian adenocarcinomas.

RNA-Seq vs dual- and single-channel microarray data: sensitivity analysis for differential expression and clustering.

A single-sample microarray normalization method to facilitate personalized-medicine workflows

High-throughput processing and normalization of one-color microarrays for transcriptional meta-analyses

Gene Network Landscape of the Ciliate Tetrahymena thermophila

Abstract 1142: A specific miRNA signature characterizes metastasis in renal cell carcinoma

ANAIS: Analysis of NimbleGen Arrays Interface

Construction and use of gene expression covariation matrix

OneChannelGUI: a graphical interface to Bioconductor tools, designed for life scientists who are not familiar with R language

Post hoc pattern matching: assigning significance to statistically defined expression patterns in single channel microarray data

Simulation of microarray data with realistic characteristics.

Effects on gene expression and viral load of a medicinal extract from Agaricus blazei in patients with chronic hepatitis C infection

AffylmGUI: a graphical user interface for linear modeling of single channel microarray data

Can Zipf's law be adapted to normalize microarrays?