Abstract

Guar (Cyamopsis tetragonoloba (L.) Taub.) is becoming a popular industrial crop in response to industry demand for the guar gum extracted from seeds’ endosperm. Breeding efforts of new guar varieties would greatly benefit from genomic resources developed for marker assisted selection (MAS) purposes. We have undertaken the first steps to establish a whole-genome assembly of the guar ‘Vaviloskij 130’ accession, bred at VIR. Using a combination of second (Illumina short reads) and third generation (Oxford Nanopore long reads) sequencing methods, a dataset of approx. 5X of genome coverage was obtained. We tested assemblers for short reads, namely SOAPdenovo, AbySS and SGA, based on different algorithms. For short reads (Illumina MiSeq and HiSeq data), the better result in terms of total number of scaffolds and total assembly length were obtained with SGA (String Graph Assembler). For Oxford Nanopore dataset, we used the combination of minimap + miniASM assembly, then corrected the assembly with raw Illumina and Nanopore data. The current preliminary de novo assembly of the guar genome covers 1.2 Gb, corresponding to 50% of the genome. The data confirm the phylogeny position of C. tetragoloba as being highly related to the genus Vigna, Abrus, Glycine and Lupinus genomes. This preliminary reference genome paves the way to further detailed diversity and genetic analyses into that important agro-industrial crop.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call