Analysing complex Triticeae genomes – concepts and strategies

Manuel Spannagl,Thomas Nussbaumer,Klaus Fx Mayer,Matthias Pfeifer,Mihaela M Martis

doi:10.1186/1746-4811-9-35

Abstract

The genomic sequences of many important Triticeae crop species are hard to assemble and analyse due to their large genome sizes, (in part) polyploid genomes and high repeat content. Recently, the draft genomes of barley and bread wheat were reported thanks to cost-efficient and fast NGS technologies. The genome of barley is estimated to be 5 Gb in size whereas the genome of bread wheat accounts for 17 Gb and harbours an allo-hexaploid genome. Direct assembly of the sequence reads and access to the gene content is hampered by the repeat content. As a consequence, novel strategies and data analysis concepts had to be developed to provide much-needed whole genome sequence surveys and access to the gene repertoires. Here we describe some analytical strategies that now enable structuring of massive NGS data generated and pave the way towards structured and ordered sequence data and gene order. Specifically we report on the GenomeZipper, a synteny driven approach to order and structure NGS survey sequences of grass genomes that lack a physical map. In addition, to access and analyse the gene repertoire of allo-hexaploid bread wheat from the raw sequence reads, a reference-guided approach was developed utilizing representative genes from rice, Brachypodium distachyon, sorghum and barley. Stringent sub-assembly on the reference genes prevented collapsing of homeologous wheat genes and allowed to estimate gene retention rate and determine gene family sizes. Genomic sequences from the wheat sub-genome progenitors enabled to discriminate a large number of sub-assemblies between the wheat A, B or D sub-genome using machine learning algorithms. Many of the concepts outlined here can readily be applied to other complex plant and non-plant genomes.

Highlights

The Triticeae tribe comprises some of the most economically important crops including bread wheat, barley and rye
With an estimated genome size of ~5 Gb the barley genome is significantly larger than the human genome, exceeded by the bread wheat genome with ~17 Gb
It has been speculated that the bread wheat genome originated from hybridization between cultivated tetraploid emmer wheat (AABB) and diploid goat grass (DD) about 8000 years ago [5]

Summary

Introduction

The Triticeae tribe comprises some of the most economically important crops including bread wheat, barley and rye. 454-like shotgun reads were simulated (5× genome coverage), re-mapped against their corresponding OG representatives, sub-assembled with varying minimum overlap identity (97% mi, 99% mi and 100% mi) and, the gene copy number predicted. Wheat sub-assemblies were generated by a stringent assembly of reads mapped to representative (for orthologous groups defined by OrthoMCL [16]) genes from the reference organisms Brachypodium distachyon [13], Hordeum vulgare, Oryza sativa [15] and Sorghum bicolor [14] as well as the genome sequences of the D genome donor species Ae. tauschii [24], and the A genome relative Triticum monococcum (NCBI archive SRP004490.3), and cDNA sequence assemblies from Ae. speltoides (Trick&Bancroft, unpublished data) a member of the Sitopsis section to which the putative B genome donor belongs. The linear ordered gene maps provide a valuable resource for a variety of applications: (i) for marker development and to assist positional cloning [37], (ii) for comparative analyses of the conserved gene space [4], and (iii) to resolve the structure of a genome/chromosome and to establish the colinearity between grass genomes[34,35]

Conclusions

United States

13. Initiative IB

Findings

15. Project IRGS

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Plant Methods	Publication Date: Jan 1, 2013
Citations: 50	License type: cc-by

R Discovery Prime

R Discovery Prime

Analysing complex Triticeae genomes – concepts and strategies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Plant Methods

Lead the way for us

Similar Papers

COMPARATIVE RETROTRANSPOSON ANALYSIS in WHEAT
Seray Altintaş ... Elif Karlik
Journal of Advanced Research in Natural and Applied Sciences | VOL. 7
Seray Altintaş, et. al.Seray Altintaş ... Elif Karlik
25 Sep 2021
Journal of Advanced Research in Natural and Applied Sciences | VOL. 7

Unlocking the Barley Genome by Chromosomal and Comparative Genomics
...
The Plant Cell | VOL. 23
, et. al. ...
01 Apr 2011
The Plant Cell | VOL. 23

Deep transcriptome sequencing provides new insights into the structural and functional organization of the wheat genome.
Lise Pingault ... Patrick Wincker
Genome Biology | VOL. 16
Lise Pingault, et. al.Lise Pingault ... Patrick Wincker
10 Feb 2015
Genome Biology | VOL. 16

Using genic sequence capture in combination with a syntenic pseudo genome to map a deletion mutant in a wheat species.
Laura‐Jayne Gardiner ... Anthony Hall
The Plant journal : for cell and molecular biology | VOL. 80
Laura‐Jayne Gardiner, et. al.Laura‐Jayne Gardiner ... Anthony Hall
10 Oct 2014
The Plant journal : for cell and molecular biology | VOL. 80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysing complex Triticeae genomes – concepts and strategies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Plant Methods