Abstract

DNA barcoding has been established as an efficient, sensitive and reliable methodology for plant identification. However, in spite of efforts to find a universal DNA plant barcode, some taxa are not sufficiently resolved by typical plant barcoding genes like matK or rbcL. We have used a technique known as genome skimming, which relies on the empiric low coverage sequencing of a full plant genome, resulting in high coverage of the high copy genome fractions such as chloroplast and rDNA. Phylogenetic studies show that these regions are a reservoir of variability which could be further exploited for DNA barcode discovery. We sequenced eight amaranth species using Illumina Next Generation Sequencing technology to test the feasibility of this technique. Amaranths were chosen due to their increasing impact as invasive species bearing multiple herbicide resistance mechanisms. Our results showed that complete chloroplast genomes could be assembled for all of the eight species tested. We obtained an average of 47 million reads for each one of the amaranth nuclear genomes, which range in size between 400-700Mb approximately. These reads provide an average theoretical coverage of 10-15X for each nuclear genome, but resulted in an average chloroplast genome coverage in the range of 500-8000X due to multiple chloroplast genome copies per cell. Alignment of the eight chloroplast genomes shows variability in the single copy regions (Fig. 1), especially on intergenic sections (Fig. 2). Additional preliminary analyses also show variation among different populations of the same species, demonstrating the importance of studying both inter and intraspecific diversity to design reliable and accurate DNA barcodes that can be used in species identification.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call