Matching of array CGH and gene expression microarray features for the purpose of integrative genomic analyses

Wessel N Van Wieringen,Gwenaël Gr Leday,Bauke Ylstra,Oscar Krijgsman,Renée X De Menezes,Mark A Van De Wiel,Kristian Unger

doi:10.1186/1471-2105-13-80

Wessel N Van Wieringen, Gwenaël Gr Leday + Show 5 more

Open Access

https://doi.org/10.1186/1471-2105-13-80

Copy DOI

Abstract

BackgroundAn increasing number of genomic studies interrogating more than one molecular level is published. Bioinformatics follows biological practice, and recent years have seen a surge in methodology for the integrative analysis of genomic data. Often such analyses require knowledge of which elements of one platform link to those of another. Although important, many integrative analyses do not or insufficiently detail the matching of the platforms.ResultsWe describe, illustrate and discuss six matching procedures. They are implemented in the R-package sigaR (available from Bioconductor). The principles underlying the presented matching procedures are generic, and can be combined to form new matching approaches or be applied to the matching of other platforms. Illustration of the matching procedures on a variety of data sets reveals how the procedures differ in the use of the available data, and may even lead to different results for individual genes.ConclusionsMatching of data from multiple genomics platforms is an important preprocessing step for many integrative bioinformatic analysis, for which we present six generic procedures, both old and new. They have been implemented in the R-package sigaR, available from Bioconductor.

Highlights

An increasing number of genomic studies interrogating more than one molecular level is published
They are implemented in the R-package sigaR
The The Cancer Genome Atlas (TCGA) I and II data sets differ in their gene expression data, which have been generated on different platforms

Summary

Results

Five data sets have been downloaded to compare the matching procedures. Data set 1, referred to as the Chin. The even worse ‘performance’ (1.8% matched gene expression features) on the Chin data set of the distanceAny procedure with a smaller window may be attributed to the size of DNA copy number features (BACs) They are rather long compared to the gene expression features, resulting in distances between the midpoints. For the Chin data set the distance procedure finds most significant genes, followed by distanceAny (< 100 k), overlapPlus and other overlap methods This order is concordant with the matching result: the more matched genes, the more discoveries. This could be interpreted as the matched genes being assigned an unrelated DNA copy number signature This comparison of downstream analyses suggests that Generated on a high-resolution DNA copy number platform) the overlapAny procedure may be preferred

Conclusions

Background

Conclusion

36. López-Romero P

38. Lockstone HE

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: May 4, 2012
Citations: 46	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Matching of array CGH and gene expression microarray features for the purpose of integrative genomic analyses

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Computational methods for the integrative analysis of single-cell data.
Mattia Forcato ... Silvio Bicciato
Briefings in bioinformatics | VOL. 22
Mattia Forcato, et. al.Mattia Forcato ... Silvio Bicciato
06 Aug 2020
Briefings in bioinformatics | VOL. 22

Construction of an ortholog database using the semantic web technology for integrative analysis of genomic data.
Hirokazu Chiba ... Hiroyo Nishide
PloS one | VOL. 10
Hirokazu Chiba, et. al.Hirokazu Chiba ... Hiroyo Nishide
13 Apr 2015
PloS one | VOL. 10

The Bioconductor channel in F1000Research.
Wolfgang Huber ... Martin Morgan
F1000Research | VOL. 4
Wolfgang Huber, et. al.Wolfgang Huber ... Martin Morgan
10 Jul 2015
The Bioconductor channel in F1000Research.
Wolfgang Huber ... Martin Morgan

Identification of p53-target genes in human papillomavirus-associated head and neck cancer by integrative bioinformatics analysis.
Amal Bouzid ... Muwaffaq Al Ani
Frontiers in oncology | VOL. 13
Amal Bouzid, et. al.Amal Bouzid ... Muwaffaq Al Ani
04 Apr 2023
Frontiers in oncology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Matching of array CGH and gene expression microarray features for the purpose of integrative genomic analyses

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics