ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Hits per page

hit 1 - 1 | 1 hit

Select All Export

Electronic Resource

Amino acid composition and hydrophobicity patterns of protein domains correlate with their structures (1985)

Sheridan, Robert P. ; Dixon, J. Scott ; Venkataraghavan, R. ; [et al.]

New York : Wiley-Blackwell

Biopolymers 24 (1985), S. 1995-2023

add to mindlist on the mindlist

Details

ISSN: 0006-3525

Keywords: Chemistry ; Polymer and Materials Science

Source: Wiley InterScience Backfile Collection 1832-2000

Topics: Chemistry and Pharmacology

Notes: We examine the correlation between the sequence and tertiary structure for 212 domains from globular proteins and polypeptides. The sequence of each domain is described as a set of 25 features: the mole percent of 20 amino acids, the number of residues in the domain, and the abundance of four simple patterns in the hydrophobicity profile of the sequence. Each domain, then, is described as a location in 25-dimensional sequence-feature space. We use pattern-recognition methods to find the two axes through the 25-dimensional sequence-feature space that best discriminate, respectively, predominantly α-helix domains from predominantly β-strand domains (the “secondary structure vector,” SV) and parallel α/β domains from other domains (the “parallel vector,” PV). When we divide the domains into two categories based on whether the cysteine content is above (CYS-RICH) or below (NORMAL) 4.5%, we find the secondary structure vector for the subset of CYS-RICH domains points in a significantly different direction than the equivalent vector for the NORMAL domains. Thus, CYS-RICH and NORMAL, domains are best treated separately. The secondary structure vector and the parallel vector for NORMAL domains describes statistically meaningful information, but the secondary structure vector for CYS-RICH domains may not be as reliable. We show how the secondary structure content of a NORMAL domain can be predicted by projecting the domain in the feature space onto the secondary structure vector. We subdivide the domains into five structural classes based on whether there is a parallel or mixed β-sheet in the domain and whether there are more helix or strand residues: NORMAL ALPHA, NORMAL BETA, NORMAL PARALLEL, CYS-RICH ALPHA, and CYS-RICH BETA. When we project the NORMAL domains onto the plane containing the origin of the feature space and SV and PV, we see that ALPHA, BETA, and PARALLEL, domains cluster in the plane, with the BETA cluster partially overlapping the PARALLEL cluster. The separations between the clusters are such that, by looking at the location of any given NORMAL domain in the plane, we can correctly predict its structural class with 83% accuracy. CYS-RICH ALPHA and BETA domains cluster when projected onto the CYS-RICH SV vector, and the classes can be preducted with 83% accuracy, but this accuracy for CYS-RICH domains may not be statistically meaningful.

Additional Material: 2 Ill.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1002/bip.360241011

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

hit 1 - 1 | 1 hit