Abstract
Protein sequences of the Dayhoff databank of 1984 have been analyzed to evaluate the occurrences of the 400 dipeptides and 8000 tripeptides. Expected values and standard deviations for the di- and tripeptides were determined by Monte Carlo and binomial approximation. A condensed format containing this information, labeled a uniqueness diagram, is presented and made available in the form of a microfiche.
Similar content being viewed by others
Literature
Barker, W. C., L. T. Hunt, B. C. Orcutt, D. G. George, L. S. Yeh, H. R. Chen, M. C. Blomquist, G. C. Johnson and M. O. Dayhoff. 1984. “Protein Sequence Database, January 1984 Release” (On tape). Washington, D.C.: National Biomedical Research Foundation.
Saroff, H. A. and Pretorius. 1983. “The Uniqueness of Protein Sequences.o-Uniqueness and Infrequent Peptides.”Bull. math. Biol. 45, 117–138.
Saroff, H. A. 1984. Submitted for publication.
Schlossman, S. F. and H. Levine. 1967. “Immunochemical Studies on Delayed and Arthus-Type Hypersensitivity Reactions”J. Immunol. 98, 211–219.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Saroff, H.A. The uniqueness of protein sequences. Uniqueness diagrams for the Dayhoff file—1984. Bltn Mathcal Biology 46, 661–672 (1984). https://doi.org/10.1007/BF02459509
Issue Date:
DOI: https://doi.org/10.1007/BF02459509