ISSN:
0219-3094
Keywords:
62E20
;
62E25
;
92C40
;
protein threading
;
extreme value statistics
;
Poisson clumping heuristic
;
probabilities related to uniform distributions
Source:
Springer Online Journal Archives 1860-2000
Topics:
Mathematics
Notes:
Abstract In protein threading, one is given a protein sequence, together with a database of protein core structures that may contain the natural structure of the sequence. The object of protein threading is to correctly identify the structure(s) corresponding to the sequence. Since the core structures are already associated with specific biological functions, threading has the potential to provide biologists with useful insights about the function of a newly discovered protein sequence. Statistical tests for threading results based on the theory of extreme values suggest several combinatorial problems. For example, what is the number of waysm′=# t {L i 〉x i } i =0n of choosing a sequence {X i } i =1n from the set {1, 2, ...,t}, subject to the difference constraints {L i =X i+1−X i 〉x i } i =0n , whereX 0=0,X n+1=t+1, and {x i } i =0n is an arbitrary sequence of integers? The quantitym′ has many attractive combinatorial interpretations and reduces in special continuous limits to a probabilistic formula discovered by the Finetti. Just as many important probabilities can be derived from de Finetti's formula, many interesting combinatorial quantities can be derived fromm′. Empirical results presented here show that the combinatorial approach to threading statistics appears promising, but that structural periodicities in proteins and energetically unimportant structure elements probably introduce statistical correlations that must be better understood.
Type of Medium:
Electronic Resource
URL:
http://dx.doi.org/10.1007/BF01609878
Permalink