Abstract

BackgroundMiR arrays distinguish themselves from gene expression arrays by their more limited number of probes, and the shorter and less flexible sequence in probe design. Robust data processing and analysis methods tailored to the unique characteristics of miR arrays are greatly needed. Assumptions underlying commonly used normalization methods for gene expression microarrays containing tens of thousands or more probes may not hold for miR microarrays. Findings from previous studies have sometimes been inconclusive or contradictory. Further studies to determine optimal normalization methods for miR microarrays are needed.MethodsWe evaluated many different normalization methods for data generated with a custom-made two channel miR microarray using two data sets that have technical replicates from several different cell lines. The impact of each normalization method was examined on both within miR error variance (between replicate arrays) and between miR variance to determine which normalization methods minimized differences between replicate samples while preserving differences between biologically distinct miRs.ResultsLowess normalization generally did not perform as well as the other methods, and quantile normalization based on an invariant set showed the best performance in many cases unless restricted to a very small invariant set. Global median and global mean methods performed reasonably well in both data sets and have the advantage of computational simplicity.ConclusionsResearchers need to consider carefully which assumptions underlying the different normalization methods appear most reasonable for their experimental setting and possibly consider more than one normalization approach to determine the sensitivity of their results to normalization method used.

Highlights

  • MiR arrays distinguish themselves from gene expression arrays by their more limited number of probes, and the shorter and less flexible sequence in probe design

  • The intra-class correlation (ICC) for different normalization methods using the ten lung cancer cell lines ranged from -0.30 to 0.87

  • The quantile normalization methods based on invariant sets were observed to produce the highest mean ICCs across the ten lung cancer cell lines

Read more

Summary

Introduction

MiR arrays distinguish themselves from gene expression arrays by their more limited number of probes, and the shorter and less flexible sequence in probe design. Further studies to determine optimal normalization methods for miR microarrays are needed. Technologies utilized for relative quantification of miR expression include Northern blot, real time PCR, in situ hybridization, sequence analysis and array-based profiling [8]. Due to the limited throughput of other technologies, microarray-based miR profiling has become a popular method for interrogation of miRs, especially when the contributions of specific miRs to a given condition or process remain elusive. Robust data processing and analysis methods tailored to the unique characteristics of miR arrays are greatly needed

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.