Abstract

Theoretical predictions are given for the progress expected, when mapping DNA by identifying clones containing specific unique sequences. Progress is measured in three ways; however, all results depend on (dimensionless counterparts of) the number of clones and the number of unique sequences used. Furthermore, the effects of clone length dispersion are included in the theoretical predictions. Both the clones in the library and the unique sequences are assumed to be generated randomly, with uniform probability of originating at any base in the region to be mapped. The first measure of progress is the expected length fraction of the region to be mapped covered by at least one clone, when clones containing at least one unique sequence are included in the map. The second measure of progress is the expected length fraction of the region to be mapped in “covered intervals”, an interval being the region between adjacent unique sequences. Alternative definitions for clones covering an interval are analyzed. The third measure of progress is the expected number of clone islands generated; an island covers successive intervals. Finally, using these measures of progress, we compare the efficiency of this new mapping strategy with conventional clone mapping strategies.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call