Coverage‐based consensus calling (CbCC) of short sequence reads and comparison of CbCC results to identify SNPs in chickpea (Cicer arietinum; Fabaceae), a crop species without a reference genome

Sarwar Azam,Rajeev K Varshney,David J Studholme,Jayashree Balaji,Andrew D Farmer,Jonathan D G Jones,Vivek Thakur,Bhanuprakash Amindala,Trushar Shah,Pradeep Ruperao,Gregory D May,David Edwards

doi:10.3732/ajb.1100419

Abstract

Next-generation sequencing (NGS) technologies are frequently used for resequencing and mining of single nucleotide polymorphisms (SNPs) by comparison to a reference genome. In crop species such as chickpea (Cicer arietinum) that lack a reference genome sequence, NGS-based SNP discovery is a challenge. Therefore, unlike probability-based statistical approaches for consensus calling and by comparison with a reference sequence, a coverage-based consensus calling (CbCC) approach was applied and two genotypes were compared for SNP identification. A CbCC approach is used in this study with four commonly used short read alignment tools (Maq, Bowtie, Novoalign, and SOAP2) and 15.7 and 22.1 million Illumina reads for chickpea genotypes ICC4958 and ICC1882, together with the chickpea trancriptome assembly (CaTA). A nonredundant set of 4543 SNPs was identified between two chickpea genotypes. Experimental validation of 224 randomly selected SNPs showed superiority of Maq among individual tools, as 50.0% of SNPs predicted by Maq were true SNPs. For combinations of two tools, greatest accuracy (55.7%) was reported for Maq and Bowtie, with a combination of Bowtie, Maq, and Novoalign identifying 61.5% true SNPs. SNP prediction accuracy generally increased with increasing reads depth. This study provides a benchmark comparison of tools as well as read depths for four commonly used tools for NGS SNP discovery in a crop species without a reference genome sequence. In addition, a large number of SNPs have been identified in chickpea that would be useful for molecular breeding.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Coverage‐based consensus calling (CbCC) of short sequence reads and comparison of CbCC results to identify SNPs in chickpea (Cicer arietinum; Fabaceae), a crop species without a reference genome

Abstract

Talk to us

Similar Papers

More From: American Journal of Botany

Lead the way for us

Journal: American Journal of Botany	Publication Date: Feb 1, 2012
Citations: 35

Similar Papers

Machine Learning as an Effective Method for Identifying True Single Nucleotide Polymorphisms in Polyploid Plants.
Walid Korani ... Ye Chu
The Plant Genome | VOL. 12
Walid Korani, et. al.Walid Korani ... Ye Chu
01 Mar 2019
The Plant Genome | VOL. 12

Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms
Frank M You ... Dawei Lin
BMC Genomics | VOL. 13
Frank M You, et. al.Frank M You ... Dawei Lin
01 Jan 2012
BMC Genomics | VOL. 13

Transcriptome analysis of the gill of Takifugu rubripes using Illumina sequencing for discovery of SNPs
Jun Cui ... Xiuli Wang
Comparative Biochemistry and Physiology Part D: Genomics and Proteomics | VOL. 10
Jun Cui, et. al.Jun Cui ... Xiuli Wang
27 Mar 2014
Comparative Biochemistry and Physiology Part D: Genomics and Proteomics | VOL. 10

SNP Discovery from Transcriptome of the Swimbladder of Takifugu rubripes
Jun Cui ... Lifu Zhu
PLoS ONE | VOL. 9
Jun Cui, et. al.Jun Cui ... Lifu Zhu
20 Mar 2014
PLoS ONE | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Coverage‐based consensus calling (CbCC) of short sequence reads and comparison of CbCC results to identify SNPs in chickpea (Cicer arietinum; Fabaceae), a crop species without a reference genome

Abstract

Talk to us

Similar Papers

More From: American Journal of Botany