Sequenced Plant Species Research Articles

The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575–1584)] to classify all of these species’ protein-coding genes into putative gene families, called tribes, using three clustering stringencies (low, medium and high). For all tribes, we have generated protein and DNA alignments and maximum-likelihood phylogenetic trees. A parallel database of microarray experimental results is linked to the genes, which lets researchers identify groups of related genes and their expression patterns. Unified nomenclatures were developed, and tribes can be related to traditional gene families and conserved domain identifiers. SuperTribes, constructed through a second iteration of MCL clustering, connect distant, but potentially related gene clusters. The global classification of nearly 200 000 plant proteins was used as a scaffold for sorting ∼4 million additional cDNA sequences from over 200 plant species. All data and analyses are accessible through a flexible interface allowing users to explore the classification, to place query sequences within the classification, and to download results for further study.

Read full abstract

The recent release of the first tree genome (Populus trichocarpa) has allowed a comparison to be made of the multigenic glutaredoxin (Grx) and glutathione reductase (GR) families of this tree with those of other sequenced organisms and especially of the two other fully sequenced plant species, Arabidopsis thaliana and Oryza sativa. Grxs are small proteins involved in disulphide bridge or protein-glutathione adduct reduction, and they are maintained in a reduced form using glutathione and an NADPH-dependent GR. While the P. trichocarpa and O. sativa genomes are nearly five times larger than that of A. thaliana, they contain approximately 45 000 and 37 500 genes compared with the 25 500 genes of A. thaliana. On the one hand, the GR gene composition varies little between species and the gene structures are relatively conserved. On the other hand, the Grx gene family can be divided into three subgroups and the gene content is larger in P. trichocarpa (36 genes) compared with A. thaliana and O. sativa (31 and 27 genes, respectively). This could be partly explained by the occurrence of more duplication events, and this is especially true for one of the three identified Grx subgroups (subgroup III). The expression of most of these genes was confirmed by analysing expressed sequence tags present in various databases. In addition, the expression of Grx of subgroups I and II was examined by RT-PCR in various poplar organs. A complete classification based essentially on gene structure and sequence identity is proposed.

Read full abstract

Sequenced Plant Species Research Articles

Related Topics

Articles published on Sequenced Plant Species

PlantTribes: a gene and gene family resource for comparative genomics in plants.

Structure of two melon regions reveals high microsynteny with sequenced plant species

Genome-wide analysis of plant glutaredoxin systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sequenced Plant Species Research Articles

Related Topics

Articles published on Sequenced Plant Species

PlantTribes: a gene and gene family resource for comparative genomics in plants.

Structure of two melon regions reveals high microsynteny with sequenced plant species

Genome-wide analysis of plant glutaredoxin systems