Abstract

Summary Brassica juncea (AABB), commonly referred to as mustard, is a natural allopolyploid of two diploid species—B. rapa (AA) and B. nigra (BB). We report a highly contiguous genome assembly of an oleiferous type of B. juncea variety Varuna, an archetypical Indian gene pool line of mustard, with ~100× PacBio single‐molecule real‐time (SMRT) long reads providing contigs with an N50 value of >5 Mb. Contigs were corrected for the misassemblies and scaffolded with BioNano optical mapping. We also assembled a draft genome of B. nigra (BB) variety Sangam using Illumina short‐read sequencing and Oxford Nanopore long reads and used it to validate the assembly of the B genome of B. juncea. Two different linkage maps of B. juncea, containing a large number of genotyping‐by‐sequencing markers, were developed and used to anchor scaffolds/contigs to the 18 linkage groups of the species. The resulting chromosome‐scale assembly of B. juncea Varuna is a significant improvement over the previous draft assembly of B. juncea Tumida, a vegetable type of mustard. The assembled genome was characterized for transposons, centromeric repeats, gene content and gene block associations. In comparison to the A genome, the B genome contains a significantly higher content of LTR/Gypsy retrotransposons, distinct centromeric repeats and a large number of B. nigra specific gene clusters that break the gene collinearity between the A and the B genomes. The B. juncea Varuna assembly will be of major value to the breeding work on oleiferous types of mustard that are grown extensively in south Asia and elsewhere.

Highlights

  • Genus Brassica contains some of the most important oilseed and vegetable crops that are grown worldwide

  • High molecular weight DNA isolated from B. juncea variety Varuna was subjected to singlemolecule real-time (SMRT) sequencing on the PacBio RSII platform

  • We have reported a highly contiguous genome assembly of B. juncea variety Varuna, an oleiferous type belonging to the Indian gene pool of mustard, using long-read SMRT sequencing and optical mapping

Read more

Summary

Introduction

Genus Brassica contains some of the most important oilseed and vegetable crops that are grown worldwide. Nagaharu, based on cytogenetic studies, proposed a model on the relationship of six key species of the genus[1]. Draft genomes of B. rapa (estimated genome size ~ 485 Mb), B. oleracea (~ 630 Mb) and B. napus (~1130 Mb) were assembled with short-read, high-throughput Illumina technology with some limited Sanger sequencing[2,3,4]. A draft sequence of a vegetable type of B. juncea variety Tumida (genome size ~922 Mb) was assembled[5] using ~176x Illumina reads, gap filling with 12x PacBio singlemolecule real-time (SMRT) sequences and long-range scaffolding with BioNano optical mapping

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.