Abstract

The eukaryotic genomes expand their sizes via several different pathways, one of which is the duplication of existing gene(s). According to the classical model by Ohno (1970), one purpose of gene duplication is to allow for evolutionary gain of new function(s). Indeed, gene duplications were well documented in the 1980’s through extensive molecular cloning and sequencing of genes and pieces of genomic DNAs from different species. An excellent representative of the eukaryotic gene families identified from these studies encodes the oxygen-carrying globin chains in the vertebrate red cells. This family apparently expanded its size via a series events of tandem duplications and interchromosomal transpositions (Lewin 2000). Following duplications, the accumulated mutations in the regulatory regions of the different globin family members have led to the developmental regulation of their expression in the erythroid cells (Fraser and Grosveld 1998). At the same time, mutations in the coding regions of the individual globin genes also allowed for their differential functioning under different erythroid environments (Bunn and Forget 1986). More recently, efforts on the genome initiatives have pushed forward a new wave of interest in gene duplication. Refined models to explain the evolutionary significance of gene duplication have also been formulated. One such is the duplication–degenerationcomplementation, or DDC, model (Force et al. 1999). It hypothesized that after duplications, each gene paralog acquired specific loss-of-function mutation(s) in its regulatory region. This then resulted in the expression of different gene family members in different tissues or at different developmental stages. The combined spatial-temporal manner of expression of the whole family would be similar to that of the single ancestral gene. The partitioning of the sites of expression for the different members has thus allowed for their preservation, or survival, during evolution. As a test of the DDCmodel, Serluca et al. reports in this issue a detailed phylogenetic analysis of the gene family encoding the subunit of the vertebrate Na(+), K(+)-ATPase or sodium pump (Serluca et al. 2001). It has been known that four different subunit genes ( 1, 2, 3, and 4) exist in mammals, each of which exhibit a tissue-specific expression pattern (Lingrel and Kuntzweiler 1994). Serluca et al. set out to characterize the subunit genes in the zebrafish. Interestingly, through combined efforts of molecular cloning, database mining, chromosomal mapping, and expression analysis, at least eight different subunit-encoding genes were identified. These include five 1 paralogs, two 3 paralogs, and one 2 gene. In particular, the five 1 paralogs appear to result from both genomic duplication and additional tandem duplications in the teleost lineage. Two of the five 1 paralogs, atp 1A1 and atp 1B1, were further subjected to detailed expression analysis of different zebrafish tissues during development. As predicted by the DDC model, the two genes together exhibit a major portion of the tissue-specificities of expression of the mammalian 1 subunit. Future expression analysis of the other three 1 paralogs would complete the test of whether the preservation of the five zebrafish 1 paralogs is because they together share, in zebrafish, the essential cellular and developmental functions carried out by the single mammalian 1 gene. Other evidence for the DDC model already existed prior to its formulation. These include the globins, as mentioned previously, and the more classical examples of isozyme gene families such as the alcohol dehydrogenases (Edenberg 2000). Molecular genetic analysis of three developmental regulators in zebrafish—engrailed (Force et al. 1999), sox11 (de Martino et al. 2000), and the Hox gene family (McClintock et al. 2001)—also lend further support to the DDC model. Come to think of it, however, the existence of these and other multi-gene families, including the immunoglobulins and the T cell receptors (Lewin 2000) in vertebrates, are all likely a result of positive selection for both gain-offunction of each member (the Ohno model) and partition-of-function among the members (the DDC model). Yet, has the selection for gain-of-function or that for degenerationcomplementation been more frequent? Are the two scenarios independent? Also, in the DDC model, partitioning of the functions and sites of expression among the paralogs are associated with mutations in the regulatory as well as in the coding regions. Did partitioning of the sites of expression occur first, or the partitioning of the function? Were critical mutations in the regulatory regions fixed prior to those in the coding regions? With the fast-expanding databases of the eukaryotic genomes and transcriptomes, it is expected that many more eukaryotic gene families will be identified and their molecular genetics understood in detail. Some of the above questions regarding molecular evolutionary processes and consequences of gene duplications undoubtedly will be soon answered in well-defined terms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.