Abstract
Although shotgun sequences of the genomic DNA of common wheat and its ancestors are available, gene discovery in common wheat is primarily based on proof sequencing of expressed full-length (FL) cDNAs. Use of expressed sequence tag (EST) databases including FLcDNA has been recognized as an important method for gene annotation in common wheat. In the large repetitive genome of common wheat, a transcriptome approach is complementary to whole genome sequencing. We have initiated a wheat EST project in Japan and constructed cDNA libraries from various tissues and strains of wheat, including biotic and abiotic stress treatments. We have also generated a high quality full-length cDNA resource for common wheat, an essential element necessary for the ongoing curation and annotation of the wheat genome. After several rounds of screening of CAP-trapped cDNA libraries, 21,408 FLcDNAs have been fully sequenced. The origins of these FLcDNAs were estimated through examination of the RNAseq data of three ancestral diploids, namely, Triticum urartu, Aegilops speltoides, and Aegilops tauschii. In addition, 51 cDNA libraries were constructed with an accumulation of 0.9 million ESTs. The ESTs, including the FLcDNA data, were assembled into contigs with stringent bioinformatic tool parameters. In total, 41,003 gene clusters were classified, in which 27,943 (68.1 %) had homology with other cereal genes. The digital monitoring system was utilized to identify characteristic gene expression patterns among various tissues and stress treatments in common wheat. These transcriptome data comprise a substantial reference for wheat genome sequencing.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have