Abstract

Multicopy genes, like ribosomal RNA genes (rDNA), are widely used to describe and distinguish individuals. Despite concerted evolution that homogenizes a large number of rDNA gene copies, the presence of different gene variants within a genome has been reported. Characterization of an organism by defining every single variant of tens to thousands of rDNA repeat units present in a eukaryotic genome would be quite unreasonable. Here we provide an alternative approach for the characterization of a set of internal transcribed spacer sequences found within every rDNA repeat unit by implementing direct sequencing methodology. The prominent allelic variants and their relative amounts characterizing an individual can be described by a single sequencing electropherogram of the mixed amplicon containing the variants present within the genome. We propose a method for rational analysis of heterogeneity of multicopy genes by compiling a profile based on quantification of different sequence variants of the internal transcribed spacers of the freshwater sponge Ephydatia fluviatilis as an example. In addition to using conventional substitution analysis, we have developed a mathematical method, the proportion model method, to quantify the relative amounts of allelic variants of different length using data from direct sequencing of the heterogeneous amplicon. This method is based on determining the expected signal intensity values (corresponding to peak heights from the sequencing electropherogram) by sequencing clones from the same or highly similar amplicon and comparing hypothesized combinations against the values obtained by direct sequencing of the heterogeneous amplicon. This method allowed to differentiate between all specimens analysed.

Highlights

  • Ribosomal RNA genes have been widely used in taxonomy, biogeographic and phylogenetic analyses

  • Elaboration of the method The amplicons obtained from all 13 Ephydatia fluviatilis specimens analysed contained the internal transcribed sequences (ITS) region, including the 39 end of the 18S gene, the full length ITS1, 5.8S gene, ITS2, and the 59 end of the 28S gene

  • The direct sequencing electropherograms obtained for a single individual exhibited significant similarity, whereas the similarity decreased noticeably in the case of different individuals

Read more

Summary

Introduction

Ribosomal RNA genes (rDNA) have been widely used in taxonomy, biogeographic and phylogenetic analyses. It is crucial to distinguish between real informative allelic variants and the misleading single mutated copies, to achieve reproducible findings For this reason, a small number of cloned sequences may not suffice to describe the whole complement of the genome and there is no guarantee that these few sequences are the most prominent and true descriptors for the individual studied. A small number of cloned sequences may not suffice to describe the whole complement of the genome and there is no guarantee that these few sequences are the most prominent and true descriptors for the individual studied To overcome this problem, the allelic variants obtained should be quantified (either in the absolute or relative quantity they possess within the genome) in order to assess their importance (representability)

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call