Prokaryotes possess relatively small genomes consisting predominantly oflow-copy numberDNA sequences. The genome sizes ofdifferent species vary by less than an order of magnitude (Kingsbury, 1969). In contrast, eukaryotic genomes are generally much larger than their prokaryotic counterparts, and a far greater proportion of this DNA (about 30-40%) is repeated (Britten & Kohne, 1968; Laird, 1971). This repetitive component takes on several guises, and for the purpose of previous discussions it has often been useful to classify these sequences according to their various characteristic properties: structure, distribution and reiteration frequency (Jelinek & Schmid, 1982). Recent applications ofDNA cloning methods and more powerful analytical techniques to study specific sequence families has provided a different perspective on the apparent diversity of structure and organization of eukaryotic repetitive DNA. Does this new information provide us with clues as to what the functions of these sequences might be? An unresolved, and possibly related, question centres on the considerable variation seen in the haploid nuclear DNA contents, or 'C-values', of eukaryotes (Britten & Davidson, 1969). These differences are found in many different phyla encompassing amphibians (Straus, 1971), plants (Rothfels et al., 1966; Rees & Jones, 1967), insects (Keyl, 1965) and rodents (Mazrimas & Hatch, 1972). C-value variation can, on occasion, be especially dramatic; it is sometimes observed in organisms of the same genus that have virtually identical morphology and karyotype. This is the essential element of the so-called 'C-value paradox'; nearly identical species must express about the same number of genes despite significant differences in their C-values. What is the nature of all this extra non-coding DNA? It does not, as one might have anticipated, all correspond to additional repetitive elements; it also includes a considerable increase in the amount of 'single-copy' DNA. Such considerations led some to conceive radical theories of genetic organization in eukaryotes to provide a function for this extra DNA (Callan, 1967; Britten & Davidson, 1969; Thomas, 1971) which, in their simplest forms, have proved to be incorrect. A more recent theory, based on the involvement of dispersed repetitive sequences, envisages that some of these elements may have a nuclear role in selecting which mRNA molecules enter the cell cytoplasm (Davidson & Britten, 1979). A recent study may offer some support for such an idea (Sutcliffe et al., 1984), but others disagree with the interpretation of these data (Owens et al., 1985). In any event, such proposals may provide a function for a minority of the non-coding sequences in eukaryotic genomes, but what of the remainder? Attention has periodically focused on -the persuasively presented alternative argument that the bulk of the 'extra' DNA in eukaryotes might be 'selfish' and have no function (Doolittle & Sapienza, 1980; Orgel & Crick, 1980). This Review will re-examine the idea in the light of recent studies of the structure of specific eukaryotic repetitive DNA elements.
Read full abstract