RAINBOW: Haplotype-based genome-wide association study using a novel SNP-set method.

Kosuke Hamazaki,Hiroyoshi Iwata,Mihaela Pertea

doi:10.1371/journal.pcbi.1007663

Kosuke Hamazaki, Hiroyoshi Iwata + Show 1 more

Open Access

https://doi.org/10.1371/journal.pcbi.1007663

Copy DOI

Journal: PLoS computational biology	Publication Date: Feb 14, 2020
Citations: 49	License type: CC BY 4.0

Affiliation: The University of Tokyo

Abstract

Difficulty in detecting rare variants is one of the problems in conventional genome-wide association studies (GWAS). The problem is closely related to the complex gene compositions comprising multiple alleles, such as haplotypes. Several single nucleotide polymorphism (SNP) set approaches have been proposed to solve this problem. These methods, however, have been rarely discussed in connection with haplotypes. In this study, we developed a novel SNP-set method named "RAINBOW" and applied the method to haplotype-based GWAS by regarding a haplotype block as a SNP-set. Combining haplotype block estimation and SNP-set GWAS, haplotype-based GWAS can be conducted without prior information of haplotypes. We prepared 100 datasets of simulated phenotypic data and real marker genotype data of Oryza sativa subsp. indica, and performed GWAS of the datasets. We compared the power of our method, the conventional single-SNP GWAS, the conventional haplotype-based GWAS, and the conventional SNP-set GWAS. Our proposed method was shown to be superior to these in three aspects: (1) controlling false positives; (2) in detecting causal variants without relying on the linkage disequilibrium if causal variants were genotyped in the dataset; and (3) it showed greater power than the other methods, i.e., it was able to detect causal variants that were not detected by the others, primarily when the causal variants were located very close to each other, and the directions of their effects were opposite. By using the SNP-set approach as in this study, we expect that detecting not only rare variants but also genes with complex mechanisms, such as genes with multiple causal variants, can be realized. RAINBOW was implemented as an R package named "RAINBOWR" and is available from CRAN (https://cran.r-project.org/web/packages/RAINBOWR/index.html) and GitHub (https://github.com/KosukeHamazaki/RAINBOWR).

Highlights

With the decreasing cost and increasing throughput of next-generation sequencing, the number of accessions that can be used for genome-wide association study (GWAS) is increasing [1,2,3]
The datasets and scripts generated and analyzed during the current study are available from the “KosukeHamazaki/ HGRAINBOW‘repository in the GitHub,https:// github.com/KosukeHamazaki/HGRAINBOW
One problem caused by rare variants is that the noncausal markers that have a strong linkage disequilibrium (LD) with one causal rare variant indicate a higher detection power than the true causal rare variant, which may interfere with the detection of the true causal variant

Summary

Introduction

With the decreasing cost and increasing throughput of next-generation sequencing, the number of accessions that can be used for genome-wide association study (GWAS) is increasing [1,2,3] Using such large sequencing data, GWAS is widely used in human and in plant and animal genetics and breeding, and has identified novel genes related to important agronomic traits [4,5,6]. One problem caused by rare variants is that the noncausal markers that have a strong linkage disequilibrium (LD) with one causal rare variant indicate a higher detection power than the true causal rare variant, which may interfere with the detection of the true causal variant This phenomenon is known as “synthetic association”, and often happens when the minor allele frequency (MAF) of the non-causal marker is higher than that of the true rare variant [13]. This problem is closely related to the complex gene compositions comprising multiple alleles such as haplotypes because genes related to important agronomic traits often consist of multiple rare alleles, and this is why haplotypes are hard to detect using GWAS [14]

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RAINBOW: Haplotype-based genome-wide association study using a novel SNP-set method.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology

Lead the way for us

Similar Papers

RAINBOW: Haplotype-based genome-wide association study using a novel SNP-set method
Mihaela Pertea ... Hiroyoshi Iwata
-
Mihaela Pertea, et. al.Mihaela Pertea ... Hiroyoshi Iwata
14 Feb 2020
14 Feb 2020

SNP- and haplotype-based genome-wide association studies for growth, carcass, and meat quality traits in a Duroc multigenerational population.
Shuji Sato ... Yoshinobu Uemoto
BMC genetics | VOL. 17
Shuji Sato, et. al.Shuji Sato ... Yoshinobu Uemoto
19 Apr 2016
BMC genetics | VOL. 17

Detection of quantitative trait loci in Bos indicus and Bos taurus cattle using genome-wide association studies
Sunduimijid Bolormaa ... Jennie E Pryce
Genetics, selection, evolution : GSE | VOL. 45
Sunduimijid Bolormaa, et. al.Sunduimijid Bolormaa ... Jennie E Pryce
29 Oct 2013
Genetics, selection, evolution : GSE | VOL. 45

Association of Copy Number Variation at Intron 3 of HMGA2 With Navel Length in Bos indicus.
Tamíris Sayuri Aguiar ... Yuri Tani Utsunomiya
Frontiers in Genetics | VOL. 9
Tamíris Sayuri Aguiar, et. al.Tamíris Sayuri Aguiar ... Yuri Tani Utsunomiya
07 Dec 2018
Frontiers in Genetics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RAINBOW: Haplotype-based genome-wide association study using a novel SNP-set method.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology