Abstract

With the availability of high-throughput microarray technologies, investigators can simultaneously measure the expression levels of many thousands of genes in a short period. Although there are rich statistical methods for analyzing microarray data in the literature, limited work has been done in mapping expression quantitative trait loci (eQTL) that influence the variation in levels of gene expression. Most existing eQTL mapping methods assume that the expression phenotypes follow a normal distribution and violation of the normality assumption may lead to inflated type I error and reduced power. QTL analysis of expression data involves the mapping of many expression phenotypes at thousands or hundreds of thousands of marker loci across the whole genome. An appropriate procedure to adjust for multiple testing is essential for guarding against an abundance of false positive results. In this study, we applied a semiparametric quantitative trait loci (SQTL) mapping method to human gene expression data. The SQTL mapping method is rank-based and therefore robust to non-normality and outliers. Furthermore, we apply an efficient Monte Carlo procedure to account for multiple testing and assess the genome-wide significance level. Particularly, we apply the SQTL mapping method and the Monte-Carlo approach to the gene expression data provided by Genetic Analysis Workshop 15.

Highlights

  • With the availability of high-throughput microarray technologies to measure the expression levels of many thousands of genes simultaneously, investigators have developed a vast amount of statistical and computational methods for analyzing microarray data in the last decade

  • Because of the abundance of singlenucleotide polymorphisms (SNPs) as well as the modern genotyping technologies, tremendous efforts have been focused on the genetic mapping of complex human diseases, many of which are associated with quantitative traits

  • Limited work has been done in combining gene expression data and marker genotype data and detecting expression quantitative trait loci that influence the variation in levels of gene expression

Read more

Summary

Introduction

With the availability of high-throughput microarray technologies to measure the expression levels of many thousands of genes simultaneously, investigators have developed a vast amount of statistical and computational methods for analyzing microarray data in the last decade. Because of the abundance of singlenucleotide polymorphisms (SNPs) as well as the modern genotyping technologies, tremendous efforts have been focused on the genetic mapping of complex human diseases, many of which are associated with quantitative traits. Many expression phenotypes may be non-normally distributed even after proper normalization procedures, as is evident by the gene expression data provided by Genetic Analysis Workshop 15 (GAW15). Used eQTL mapping methods such as the standard variance-component (VC) approach implemented in programs SOLAR [1] and Merlin [2] assume that the expression phenotypes follow a normal distribution. The other challenge is the multiple testing introduced in the mapping of many expression phenotypes at thousands or hundreds of thousands of marker loci across the whole genome. A proper procedure to adjust for multiple testing is essential for guarding against an abundance of false-positive results

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.