Abstract
BackgroundStructural variations such as copy number variants (CNV) influence the expression of different phenotypic traits. Algorithms to identify CNVs through SNP-array platforms are available. The ability to evaluate well-characterized CNVs such as GSTM1 (1p13.3) deletion provides an important opportunity to assess their performance.Results773 cases and 759 controls from the SBC/EPICURO Study were genotyped in the GSTM1 region using TaqMan, Multiplex Ligation-dependent Probe Amplification (MLPA), and Illumina Infinium 1 M SNP-array platforms. CNV callings provided by TaqMan and MLPA were highly concordant and replicated the association between GSTM1 and bladder cancer. This was not the case when CNVs were called using Illumina 1 M data through available algorithms since no deletion was detected across the study samples. In contrast, when the Log R Ratio (LRR) was used as a continuous measure for the 5 probes contained in this locus, we were able to detect their association with bladder cancer using simple regression models or more sophisticated methods such as the ones implemented in the CNVtools package.ConclusionsThis study highlights an important limitation in the CNV calling from SNP-array data in regions of common aberrations and suggests that there may be added advantage for using LRR as a continuous measure in association tests rather than relying on calling algorithms.
Highlights
Structural variations such as copy number variants (CNV) influence the expression of different phenotypic traits
When we conducted an analysis with PennCNV on the Illumina 1M SNP-array data, no deletion was detected among the 759 controls and 773 cases
For the individuals with two copies according to both TaqMan and Multiplex Ligation-dependent Probe Amplification (MLPA), the Log R Ratio (LRR) for the five probes was 0.17 on average, higher than 0, the value expected when 2 copies of the gene are present
Summary
Structural variations such as copy number variants (CNV) influence the expression of different phenotypic traits. Algorithms to identify CNVs through SNP-array platforms are available. A common copy number variant (CNV) has been well characterized and reported on the basis of the frequency of homozygous deletions of the entire. Due to the established role of GSTM1 in detoxification and the high frequency of homozygous deletions in the population, GSTM1 has been extensively investigated in association with many chronic diseases, in particular, with asthma [5] and different types of cancers [4,6], including bladder cancer [7]. High-throughput SNP-array platforms offer the possibility to explore CNVs at a genome-wide scale. Illumina Infinium 1 M provides intensity data of both alleles at each SNP allowing the detection of CNV breakpoints and the estimation of the associated number of copies. This platform contains 1,071,820 probes, among them 206,665 are located in reported CNV regions and 17,202 are monomorphic probes specially designed for CNV purpose
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.