NGS allele counts versus called genotypes for testing genetic association

Rosa González Silos,Christine Fischer,Justo Lorenzo Bermejo

doi:10.1016/j.csbj.2022.07.016

Rosa González Silos, Christine Fischer + Show 1 more

Open Access

https://doi.org/10.1016/j.csbj.2022.07.016

Copy DOI

Abstract

RNA sequence data are commonly summarized as read counts. By contrast, so far there is no alternative to genotype calling for investigating the relationship between genetic variants determined by next-generation sequencing (NGS) and a phenotype of interest. Here we propose and evaluate the direct analysis of allele counts for genetic association tests. Specifically, we assess the potential advantage of the ratio of alternative allele counts to the total number of reads aligned at a specific position of the genome (coverage) over called genotypes. We simulated association studies based on NGS data from HapMap individuals. Genotype quality scores and allele counts were simulated using NGS data from the Personal Genome Project. Real data from the 1000 Genomes Project was also used to compare the two competing approaches. The average proportions of probability values lower or equal to 0.05 amounted to 0.0496 for called genotypes and 0.0485 for the ratio of alternative allele counts to coverage in the null scenario, and to 0.69 for called genotypes and 0.75 for the ratio of alternative allele counts to coverage in the alternative scenario (9% power increase). The advantage in statistical power of the novel approach increased with decreasing coverage, with decreasing genotype quality and with decreasing allele frequency – 124% power increase for variants with a minor allele frequency lower than 0.05. We provide computer code in R to implement the novel approach, which does not preclude the use of complementary data quality filters before or after identification of the most promising association signals. Author summaryGenetic association tests usually rely on called genotypes. We postulate here that the direct analysis of allele counts from sequence data improves the quality of statistical inference. To evaluate this hypothesis, we investigate simulated and real data using distinct statistical approaches. We demonstrate that association tests based on allele counts rather than called genotypes achieve higher statistical power with controlled type I error rates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational and Structural Biotechnology Journal	Publication Date: Jan 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

NGS allele counts versus called genotypes for testing genetic association

Abstract

Talk to us

Similar Papers

More From: Computational and Structural Biotechnology Journal

Lead the way for us

Similar Papers

Using next-generation DNA sequence data for genetic association tests based on allele counts with and without consideration of zero inflation.
Rosa González Silos ... Carine Legrand
BMC Proceedings | VOL. 10
Rosa González Silos, et. al.Rosa González Silos ... Carine Legrand
01 Oct 2016
BMC Proceedings | VOL. 10

Cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate
Günter Klambauer ... Karin Schwarzbauer
Nucleic Acids Research | VOL. 40
Günter Klambauer, et. al.Günter Klambauer ... Karin Schwarzbauer
02 Jan 2012
Nucleic Acids Research | VOL. 40

Defind: Detecting Genomic Deletions by Integrating Read Depth, GC Content, Mapping Quality and Paired-end Mapping Signatures of Next Generation Sequencing Data
Xin Wang ... Xiaojing Liu
Current Bioinformatics | VOL. 14
Xin Wang, et. al.Xin Wang ... Xiaojing Liu
07 Jan 2019
Current Bioinformatics | VOL. 14

NGS_SNPAnalyzer: a desktop software supporting genome projects by identifying and visualizing sequence variations from next-generation sequencing data
Dong-Jun Lee ... Chang-Kug Kim
Genes & Genomics | VOL. 42
Dong-Jun Lee, et. al.Dong-Jun Lee ... Chang-Kug Kim
26 Sep 2020
Genes & Genomics | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NGS allele counts versus called genotypes for testing genetic association

Abstract

Talk to us

Similar Papers

More From: Computational and Structural Biotechnology Journal