Repeats of base oligomers as the primordial coding sequences of the primeval earth and their vestiges in modern genes

Journal of Molecular Evolution - Tập 20 - Trang 313-321 - 1984
Susumu Ohno1
1Beckman Research Institute of the City of Hope, Duarte, USA

Tóm tắt

Three outstanding properties uniquely qualify repeats of base oligomers as the primordial coding sequences of all polypeptide chains. First, when compared with randomly generated base sequences in general, they are more likely to have long open reading frames. Second, periodical polypeptide chains specified by such repeats are more likely to assume either α-helical or β-sheet secondary structures than are polypeptide chains of random sequence. Third, provided that the number of bases in the oligomeric unit is not a multiple of 3, these internally repetitious coding sequences are impervious to randomly sustained base substitutions, deletions, and insertions. This is because the recurring periodicity of their polypeptide chains is given by three consecutive copies of the oligomeric unit translated in three different reading frames. Accordingly, when one reading frame is open, the other two are automatically open as well, all three being capable of coding for polypeptide chains of identical periodicity. Under this circumstance, a frame shift due to the deletion or insertion of a number of bases that is not a multiple of 3 fails to alter the downstream amino acid sequence, and even a base change causing premature chain-termination can silence only one of the three potential coding units. Newly arisen coding sequences in modern organisms are oligomeric repeats, and most of the older genes retain various vestiges of their original internal repetitions. Some of the genes (e.g., oncogenes) have even inherited the property of being impervious to randomly sustained base changes.

Tài liệu tham khảo

Alexander F, Young PR, Tilghman SM (1984) Evolution of the albumin:α-fetoprotein ancestral gene from the amplification of a 27 nucleotide sequence. J Mol Biol 173:159–174 Bibb MJ, Van Etten RA, Wright CT, Walberg MW, Clayton DA (1981) Sequence and gene organization of mouse mitochondrial DNA. Cell 26:167–180. Blake C (1983) Exons—present from the beginning? Nature 306:535–537 Bridson PK, Orgel LE (1980) Catalysis of accurate poly (C) directed synthesis of 3′–5′-linked oligoguanytes by Zn+2. J Mol Biol 144:567–577 Crewther WG, Inglis AS, McKern NM (1978) Amino acid sequences of α-helical segments fromS-carboxymethylkerateine-A. Complete sequence of a type-II segment. Biochem J 173:365–371 Dayhoff MO (ed) (1972) Atlas of protein sequence and structure. National Biomedical Research Foundation, Silver Spring, Maryland DeVries AL (1982) Biological antifreeze agents in cold water fishes. Comp Biochem Physiol [A] 73:627–640 Douthart RJ, Norris FH (1982) Events in the evolution of preproinsulin. Science 217:729–732 Downward J, Yarden Y, Mayes E, Scrace G, Totty N, Stockwell P, Ullrich A, Schlessinger J, Waterfield MD (1984) Close similarity of epidermal growth factor receptor and v-erb-B oncogene protein sequences. Nature 307:521–527 Eigen M, Schuster P (1977) The hypercycle. A principle of natural self-organization. Part A: Emergence of the hypercycle. Naturwissenschaften 64:541–565 Ferris SD, Whitt GS (1977) Loss of duplicate gene expression after polyploidisation. Nature 265:258–260 Geisler N, Weber K (1981) Comparison of the proteins of two immunologically distinct intermediate sized filaments by amino acid sequence analysis: desmin and vimentin. Proc Natl Acad Sci USA 78:4120–4123 Gō M (1983) Modular structural units, exons and functions in chicken lysozyme. Proc Natl Acad Sci USA 80:1964–1968 Hotta Y, Benzer S (1976)Drosophila mosaics and sex-specific foci for sequential behavior pattern. Proc Natl Acad Sci USA 73:4154–4158 Hoyle F (1977) Ten faces of the universe. Freeman Press, London Jukes TH (1966) Molecules and evolution. Columbia University Press, New York, p 69 Jukes TH (1983) Mitochondrial codes and evolution. Nature 301:19–20 Kornberg A (1982) DNA Replication, 1982 Suppl. W. H. Freeman, San Francisco Mak AS, Smillie LB, Steward GR (1980) A comparison of the amino acid sequences of rabbit skeletal muscle α- and β-tropomyosins. J Biol Chem 255:3647–3655 Mardon G, Varmus HE (1983) Frameshift and intragenic suppressor mutations in a Rous sarcoma provirus suggestSRC encodes two proteins. Cell 32:871–879 McLaren A (1976) Mammalian chimaeras, development and cell biology 4. Cambridge University Press, Cambridge London New York Melbourne Sydney Miller SL, Orgel LE (1974) The origin of life on the Earth. Prentice-Hall, New York Ohno S (1970) Evolution by gene duplication. Springer-Verlag, Heidelberg Berlin New York Ohno S (1979) Major sex determining genes. Springer-Verlag, Heidelberg Berlin New York (Endocrinology, monograph series, vol 11) Ohno S (1981) Original domain for the serum albumin family arose from repeated sequences. Proc Natl Acad Sci USA 78: 7657–7661 Ohno S (1984) The birth of a new enzyme from an alternate reading frame of the preexisted, internally repetitious coding sequence. Proc Natl Acad Sci USA 81:2421–2425 Ohno S, Yazaki A (1983) Simple construction of humanc-myc gene implicated in B-cell neoplasmas and its relationship with avianv-myc and human lymphokins. Scand J Immunol 18: 373–388 Ohno S, Matsunaga T, Epplen JT, Itakura K, Wallace RB (1982) Identification of the 45-base-long primordial building block of the entire class I major histocompatibility complex antigen gene. Proc Natl Acad Sci USA 79:6342–6346 Orgel LE (1968) Evolution of the genetic apparatus. J Mol Biol 38:381–393 Ozaki LS, Svec P, Nussenzweig RS, Nussenzweig V, Godson GN (1983) Structure of thePlasmodium knowlesi gene coding for the circumsporozoite protein. Cell 34:815–822 Taniguchi T, Matsui H, Fujita T, Takaoka C, Kashima N, Yoshimoto R, Hamuro J (1983) Structure and expression of a cloned cDNA for human interleukin-2. Nature 302:305–310 Watt R, Stanton LW, Marcu KB, Gallo RC, Croce CM, Rovera G (1983) Nucleotide sequence of cloned cDNA of humanc-myc oncogene. Nature 303:725–728 Yazaki A, Ohno S (1983) The recurrence of 49 base decamers, nonomers and octamers within mouse Ig CμH genes and its primordial building block. Proc Natl Acad Sci USA 80:2338–2340