Some mathematical aspects of mapping DNA cosmids
Tóm tắt
A number of experimental and mathematical problems must be solved before high resolution physical maps of mammalian chromosomes can be reliably determined. Such a map might consist of an ordered set of nonsequenced, overlapping DNA fragments 20,000-40,000 bases long, produced by digestion of a chromosome, using two restriction enzymes. Map construction requires assigning a signature to each fragment that differentiates it unambiguously from every other fragment, and then devising a computationally efficient algorithm that will provide a unique ordering of the fragments. In the first part of this paper we present a polynomial time algorithm that yields a unique map, and is largely independent of the method for assigning signatures. In the next section we analyze the distribution of lengths of restriction digest fragments and discuss the implications for the algorithm, including the expected number of map gaps. Finally, we discuss a specific method for assigning signatures proposed by Hans Lehrach, based on which of a panel of probes binds to a given fragment. In particular we examine the effects of fragment length heterogeneity on the theoretical optimum length and number of probes, and the extent to which false signatures might be obtained by nonspecific binding. We conclude that the Lehrach strategy is effective provided the number of probes is >-150, but that each fragment will need testing with at most 25 probes.
Tài liệu tham khảo
Pearson, W. R. (1982),Nuc. Acids Res. 10, 217.
Polner, G., László, D., and László, O. (1984),Nuc. Acids Res. 12, 227.
Petola, H., Söderlund, H., and Ukkonen, E. (1984),Nuc. Acids Res. 12, 307.
Grimes, R. A., Travers, P., and Engelberg, A. (1986),Nuc. Acids Res. 14, 87.
Staden, R. (1986),Nuc. Acids Res. 14, 217.
Zehetner, G., and Lehrach, H. (1986),Nuc. Acids Res. 14, 335.
Goldstein, L., and Waterman, M. S. (1987), Mapping DNA by stochastic relaxation,Advances in Applied Mathematics 8, 194.
Bitensky, M. (1986), Genome sequencing workshop, March 3 and 4, 1986, Santa Fe, New Mexico. Internal Publication, Los Alamos National Laboratory, Los Alamos, NM.
Poutska, A., Pohl, T., Barlow, D. P., Zehetner, G., Craig, A., Michiels, F. Ehrich, E, Frischauf, A. M., and Lehrach, H. (1986),Cold Spring Harbor Symposia on Quantitative Biology 51, Part I, 131.
Coulson, A., Sulston, J., Brenner, S, and Karn, J. (1986),Proc. Nat. Acad. Sci. USA 83, 7821.
Feller, W. (1950),Probability Theory and its Applications, Wiley, New York.
Waterman, M. S. (1983),Nuc. Acids Res. 11, 8951.
Breen, S., Waterman, M. S., and Zhang, N. (1985),J. Appl. Prob.,22, 228.
Poland, D., and Scheraga, H. A. (1970)Theory of Helix-Coil Transitions in Biological Macromolecules, Academic, New York.
Breslauer, K. J., Frank, R., Blöcker, H., and Marky, L. (1986),Proc. Nat. Acad. Sci. USA,83, 3746.
Burke, D. T., Carle, G. F., and Olson, M. W. (1987),Science 236, 806.
