Professional Documents
Culture Documents
Synopsis
For a successful analysis of the relation between amino acid sequence and protein structure,
an unambiguous and physically meaningful definition of secondary structure is essential. We
have developed a set of simple and physically motivated criteria for secondary structure,
programmed as a pattern-recognition process of hydrogen-bonded and geometrical features
extracted from x-ray coordinates. Cooperative secondary structure is recognized as repeats
of the elementary hydrogen-bonding patterns turn and bridge. Repeating turns are
helices, repeating bridges are ladders, connected ladders are sheets. Geometric
structure is defined in terms of the concepts torsion and curvature of differential geometry.
Local chain chirality is the torsional handedness of four consecutive Ca positions and is
positive for right-handed helices and negative for ideal twisted @-sheets. Curved pieces are
defined as bends. Solvent exposure is given as the number of water molecules in possible
contact with a residue. The end result is a compilation of the primary structure, including
SS bonds, secondary structure, and solvent exposure of 62 different globular proteins. Th e
presentation is in linear form: strip graphs for an overall view and strip tables for the details
of each of 10,925 residues. The dictionary is also available in computer-readable form for
protein structure prediction work.
INTRODUCTION
Background
a-Helices and pleated @-sheetswere predicted in 1951 by Linus Pauling
and Robert Corey on the basis of hydrogen-bonding and cooperativity
criteria. They were seen later, and beautifully, in the first structures shown
in atomic detail by x-ray crystallography. Since then, the number of known
protein structures has risen to over 100 and comprehensive analysis of
secondary structure requires a computerized compilation of structure as-
signments, especially in the context of structure prediction methods.
Existing compilations have various shortcomings. The crystallographers
assignments of secondary structure in the Brookhaven Protein Data Bank2
are often subjective and sometimes incomplete. Objective algorithms exist,
e.g., for defining turn^^-^ (reviewed in Refs. 7, 8), P-sheets? and solvent
accessibility,1 but only Levitt and Greerll have published an extensive
compilation of automatic assignments of helices and sheets. Their ap-
DEFINITIONS
The definitions of H-bonded features form a hierarchy: first H bonds
are defined; based on them, turns and bridges; and, based on them, a-helices
and @-ladders,including common imperfections such as helical kinks and
@-bulges. Features defined geometrically are bends, chirality, SS bonds,
and solvent exposure. Each structural feature is defined independently
of the others and structural overlaps are resolved by defining a secondary
structure summary that assigns a single state to each residue. For brevity
we express the pattern definitions in the form of equations. For example,
DICTIONARY OF PROTEIN SECONDARY STRUCTURE 2579
I
:40 -
120 -
0-c
hF;\
100 -
e
40
20 \\ \
\ \ \
\ I \ \
0
2.5 3.0 3.5 4.0 4.5 5.0 5.5
Fig. 1. H bond between peptide units is described here by the dominant electrostatic part
E (see text) of the H-bond energy, drawn in contours of constant E a t 0.5 kcal/mol intervals
as a function of the distance, d , and the alignment angle 0. Dotted lines, E positive or zero;
broken lines, E negative. An ideal H bond has d = 2.9 A, 19 = 0, and E = -3.0 kcal/mol. We
assume an H bond for E up to - 0.5 kcal/mol (solid line). Thus, misalignment of up to 63
is allowed a t the ideal length; an N - 0 distance of up to d = 5.2 8, is allowed for perfect align-
ment. This definition of H bonds is particularly simple and physically meaningful. It is more
general than the historical definition of hydrogen bond and could be called polar interac-
tion.
Hydrogen-Bonded Structure
Hydrogen Bonds
Hydrogen bonds in proteins have little wave-function overlap and are
well described by an electrostatic m0de1.l~ We calculate the electrostatic
interaction energy between two H-bonding groups by placing partial
charges on the C,O (+q1, - q1) and N,H (-q2, q 2 ) atoms, i.e., +
E = qlq2(1/r(0N) + l/r(CH) - l/r(OH) - l/r(CN))*f
with q1 = 0.42e and q 2 = 0.20e, e being the unit electron charge and r(AB)
the interatomic distance from A to B. In chemical units, r is in angstroms,
the dimensional factor f = 332, and E is in kcal/mol. A good H bond has
about -3 kcal/mol binding energy. We choose a generous cutoff to allow
for bifurcated H bonds and errors in coordinates and assign an H bond
between C=O of residue i and N-H of residue j if E is less than the cutoff,
i.e., Hbond(ij)=: [E < -0.5kcal/mole].
Figure 1illustrates the relation of this one-parameter definition to the
2580 KABSCH AND SANDER
3-turn
> 3 3 < no t a t i o n
residues
H-bond
4-turn
notation
residues
H-bond
5-turn
notation
residues
H-bond
p a r a l l e l bridge
not a t ion
residues
H-bonds
( \ and / , or .)
residues
notation
a n t i p a r a l l e l bridge
notation
residues
H-bonds
( ! o r .)
residues
notation
bend
-
notation
re s f due s
d i r e c t i o n change
more t h a n 70
degrees
chirality
f o u r C(a1pha) atoms
define t h e dihedral
angle alpha
(b)
Fig. 3. Stereoviews of secondary structure: (a) 3-helix (3lo-helix) and (b) 5-helix (*-helix).
(a) 3-Helix Gly232-Lys237 from triose phosphate isomerase (ITIM). In Table AIII, it appears
as the H-bond pattern
S-TURN
SUMMARY
SEQUENCE
3-Helices are not uncommon, but have only two or three weak H bonds with E about -1
kcal/mol and the C=O direction tilted away from the helix axis typically by 30". (b) 5-Helix
Gly181-Lys188 from alcohol dehydrogenase (4ADH), a t the C-terminal end of a 4-helix. In
Table AIII, it appears as the H-bond pattern
.!-TURN
SEQUENCE
5-Helices are extremely rare; the longest one, shown here, has three H bonds. All stereoviews
are by PLUTO (Sam Motherwell, unpublished). In Figs. 3 and 5, the larger atoms are backbone
atoms with 1/4 their hard-sphere radius (C-, 0.47; C of CO, 0.44; 0,0.35; N, 0.41 A) and in Fig.
4 with twice these values; side-chain atoms are small, with 0.20-A radius.
2584 KABSCH AND SANDER
(b)
Fig. 4. Stereoviews of secondary structure: (a) antiparallel and parallel P-sheets with two
ladders (three strands) each. (a) Two connected antiparallel P-ladders from trypsin (IPTN).
The three participating strands are Va116(31)-Ser20(37), Ile46(63)-Gly51(69), and
Glue62(80)-Ala67(85), where the first number is the sequential residue number from Table
A111 and the number in parentheses the authors residue identifier. The corresponding H-
bond notation (Table AIII) is
SHEET.. ................. CCC.. .............. C C C C ........... . C C C C . . ..
BRIDGEZ.. ................................... NNNN... ..................
BRIDGE1 ................. KKK ............... K K K . . ............NNNN.. ..
SEQUENCE .............. VSLNS.. ............ IQVRLG.. ........ E Q F I S A ..
The middle strand participates in two ladders. Both ladders belong to sheet C. (b) Two
connected parallel P-ladders, Arg172(189)-Gly177(194), Thrl96(213)-Ile200(217),
Asp266(283)-Ala271(288) from glutathione reductase (BGRS). The corresponding H-bond
notation (Table AIII) is
SHEET .................EEEE ................ E E E ............. E E E E ....
BRIDGEZ..... ............ 1 1 I ...........................................
BRIDGE1 ................ k k k k ................ I 1 I ............. k k k k .....
SEQUENCE.. .......... . R S V I V G . . ............ T S L M I . . ......... DCLLWA ...
The first strand has two ladder partners. The three strands are part of sheet E.
DICTIONARY OF PROTEIN SECONDARY STRUCTURE 2585
are formed. This possibility is implicit in the above helix definition, e.g.,
two overlapping minimal helices offset by two or three residues are joined
into one helix:
))44( ( ))44((
+ ))44(( + ) )44((
= ) ) 4 )x ( 4( ( irregular = ) ) 4 4 ~ x 44( ( irregular
= HHHHHHH = HHHHHHHH
)) ) )x( ( ( ( perfect > > ) > X X ( ( ( ( perfect
even though the third and/or fourth H bond is missing, compared to a
perfect seven- or eight-residue helix. Such imperfections are often asso-
ciated with a kink in the helix, e.g., due to a proline residue.
For 6-structure, we define explicitly: a bulge-linked ladder consists of
two (perfect) ladders or bridges of the same type connected by at most one
extra residue on one strand and at most four extra residues on the other
strand. This definition follows Richardson's* observation of 6-bulges, a
frequent lattice fault in ,&sheets, but. includes more general bulges than
her main types. In naming ladders, a bulge-linked ladder is treated as one
ladder (lines BRIDGE). In line SUMMARY, all residues in bulge-linked
ladders are marked "E," including the extra residues.
Geometrical Structure
Bend
Bends are regions with high curvature. We quantify chain curvature
at the central residue i of five residues as the angle between the backbone
direction of the first three and the last three residues. This definition of
curvature is identical to that of Rose and Seltzer5 but slightly different from
that of Rackovsky and Scheraga.I5 For a bend at i, we require a curvature
of a t least 70". The cutoff value was chosen by visual inspection of three-
dimensional traces. With C" the position vector of C", we define
Bend(i) =: [angle ( ( C W - Ca(i - 2)),(C"(i + 2) - C"(i))) > 70"]
and assign "S" for a bend at residue i.
Chirality
We define chirality at each residue (except at the ends of the chain) as
(Fig. 2)
a(i) = dihedral angle(C"(i - l),C"(i),C"(i + l),C"(i + 2))
but report only the sign of a in Table AIII: "+"
if 0" < a < 180" and "-"
if -180" < a < 0". Note that most helices have positive, most twisted
,&ladders negative, chirality. We have found only one left-handed helix,
in thermolysin. This rare specimen is shown in Fig. 5.
2586 KABSCH AND SANDER
CHIRALITY _--
4-TURN ))44( (
SUMMARY HHHH
SEQUENCE QDNGGV
SS Bonds
SS bonds, i.e., covalent links between the S Y atoms of two Cys residues,
are taken directly from the Data Bank SSBOND records, as they can be
considered part of the amino acid sequence (primary structure). For the
coordinate data sets used here, an S-S distance of less than 3.0 8, can also
serve as a definition. The SS bonds are given names a,b,c . . . , and the
participating residues noted by this name in the line SEQUENCE in Table
AIII. Thus, Cys appears in the amino sequence either as C or as a lower-
case letter.
Chain Breaks
Chain breaks are assumed if the peptide bond length (distance C-N)
exceeds 2.5 A. They are labeled ! and counted as a break residue. Thus,
! may reflect the absence of a chemical peptide bond, missing density in
the crystallography map, or coordinate errors. The residues for which there
are coordinates in the data set are numbered sequentially, including break
residues. The resulting residue numbers often agree with the authors
except for proteins numbered according to sequence homology or those with
missing density or chain breaks. In any case, inspection of the amino acid
sequence in Table A111 always allows unambiguous identification of a
residue.
DICTIONARY OF PROTEIN SECONDARY STRUCTURE 2587
Structure Summary
To make contact with the usual notation of secondary structure and to
facilitate comparison with intuitive assignments, we summarize secondary
structure in a single line (SUMMARY in Table AIII). Structural overlaps
are eliminated in this line by giving priority to H,B,E,G,I,T,S in this order,
i.e., when several symbols coincide, the first one in this list is written. For
example, a helix is also a series of bends, but the state helix is given higher
priority. Pieces of 3- or 5-helix, reduced to less than minimal size due to
overlaps, are labeled T. A blank, by implication, means a piece of low
curvature not in H-bonded structure.
all side-chain atoms,17 and 1.40 for a water molecule following observed
water-protein distances (Ref. 18 as cited in Ref. 19).
In Table AIII, we report the average number, W ,of water molecules in
contact with each residue. W can be estimated from the surface area by
Area
W= =-Area
V(water molecule)2/3 10
since the surface is proportional to the volume of the monolayer, which,
in turn, is proportional to the average number of molecules in the mono-
layer. For a water molecule volume of 30 A3 and area in A2, the conversion
factor is 9.65 N 10. Note that solvent exposure differs for a monomer and
a dimer: here, it is calculated in the presence of all monomers in the data
set (Table AI) but omitting HETATOMs (substrates, ligands, heme, etc.).
The sum over all residues is the total solvent exposure of the protein.
Choice of Proteins
Of the more than 100 coordinate data sets in the Protein Data Bank,2
about 75 have complete backbone coordinates and a known amino acid
sequence. When two protein data sets had more than a 50% sequence
homology, i.e., identical amino acids in equivalent positions, the one with
higher resolution, better refinement, or more secondary structure was
chosen as representative, e.g., the first one was chosen of these pairs: serine
proteinase lSGA=lSGB by 61%;lactate dehydrogenases 4LHD=lLDX
by 63%; carbonic anhydrase lCAC=lCAB by 60%; chymotrypsin
2GCH=2CHA by 98%. Both were chosen of the following pairs: sulfhy-
dry1 proteinases actinidin/papain 2ACT=8PAP by 47%; immunoglobulins
lFAB=lREI by 47%; cytochrome c550/c2 155C=lC2C by 43%; chymo-
trypsin/trypsin 2GHA=lPTN by 42%; elastase/trypsin lEST=lPTN by
38%; acid protease/penicillopepsin lAPR=lAPP by 43%;a l p subunit of
hemoglobin 2MHB(a)=2MHB(P) by 44%. The final 62 data sets thus
cover essentially all known different protein structures, except those not
deposited with the protein data bank (Table AI).
H-Bonded Structure
Backbone-backbone H bonds can be simply classified by the number
+
of residues they bracket or, in our notation, by n of (i,i n ) = (CO(i),NH(i
+ n ) ) . Let us discuss the structural role of H bonds for each n.
H bonds n = 0 and n = 1are sterically disallowed. A hydrogen bond (i,i
+ 2) can be formed between two consecutive peptide units for certain 4,$
+
values of residue i 1. This local conformation is known as C7 and leads
to an extended strand roughly similar to a P-strand if it repeats. When
it occurs as part of a tight turn, that turn is sometimes called a y-turn.
DICTIONARY OF PROTEIN SECONDARY STRUCTURE 2589
30K
30
ia 5 10
Length of &leltx
15 20 25 10 5 10 15
30 lb)
5 10 15 20 25
Length of p-Ladder lparollell
Fig. 6. The common feature of the size distribution of secondary structure segments is the
gradual fall-off: larger sizes are less probable than smaller ones. Note that we give (b,c) the
length of 8-ladders (strand pairs) rather than the length of 6-strands. A strand is often longer
than the ladders in which it participates, since sheets tend to be trapezoidal rather than rec-
tangular in shape. The number of bulge-linked ladders per sheet (d) is given as an indication
of the width of the sheet. The width of a ladder is about 5 p\. In an ideal sheet, center strands
take part in two ladders, edge strands in one: the number of ladders is equal to the number
of strands minus one. In general, however, one strand can participate in more than one ladder
on each side and the width of the sheet less than the number of ladders times 5 A. Note:
sheets consisting of a single bridge are not included in the histogram of ladders per sheet. (e)
+
Number of H-bonds of type (CO(i),NH(i n ) ) . Due to the nature of L-amino acids, positive
n are heavily favored. The dominant peak a t n = 4 represents a-helices and 4-turns. We
+ +
find that H bonds (i,i 2) and (i,i 3) are surprisingly common, though generally weak.
Using our H bond definition, we find that many P-strands have, in addition
+
to the main interstrand H bonds, minor (i,i 2) intrastrand H bonds [see
peak in Fig. 6(e)]. These reflect part of the electrostatic stabilization of
extended conformations due to the polar interaction of the C - 0 and N-H
groups of adjacent peptide units, first shown by Florys group20to be es-
sential in stabilizing the C7 conformation in solution. We speculate that
P-strands originate as extended C7 strands as the protein folds up. Outside
of @strands, we typically find one or two weak ( E < -1.0 kcal/mol) (i,i +
2) H bonds per 100 residues, but most of them are neither repeating nor
part of a tight turn.
2590 KABSCH AND SANDER
TABLE I
Comparison of Secondary Structure Assignments for Three Proteins of Higher, Medium,
and Lower Resolution
Original Levitt
Authors & Greer This Work
Structurea (AU) (LG) (KS)
3PTI
G1 -h 2-7 ?-6 Clearly 310; LG have N
El 16-25 14-25 18-24
E2 28-36 28-37 29-35
E3 -b 43-46 45-45 @-Bridge,2 H bonds
H1 47-56 47-55 48-55
155C
H1 6-11 4-16'' 6-12 4-Turn 13-16
G1 11-13 - 11-13 Overlaps with H1
El - 17-23h 19-20 AU have 2 H bonds; KS, 4
E2 - 26-3 1 - Discontinuity a t Asp28-He29
E3 - 33-39 35-37 AU have 2 H bonds; KS have 4
H1 - 40-44h - KS have 3-Turn
H2 56-63 55-65 56-64
H3 73-79 7 1-80 73-80
H4 - 81-9Ob - Pro a t 82,84; possible helix
H5 107-118 106-118 107-117
2ADK
H1 1-8 1-7 2-7
El 10-14 8-15 10-14
H2 23-30 21-31 23-31
E2 35-38 34-38 35-38
H3 41-48 39-49 39-48
H4 53-62 52-61 52-62
H5 69-84 68-83 69-83
E3 90-94 88-95 90-93
H6 100- 107 100-109 101-108
E4 114-1 I8 113-120 114-118
Hi I 23- 133 12 1- 136" 122-132 <?-Helixends in 3-turn
H8 144-158 141-1<57'] 143-157 N o (i,i + 4) H bond a t Asp 141
H9 160-164 159-166 160-167h Two weak H bonds a t 167,168
E.5 169-1 73 169-175 170-1 7 3
HI0 179-194 179-192 179-193
a H = n-helix, G = 31,)-helix,E = @-strand. 3PTI = pancreatic trypsin inhibitor, 1.5-A
resolution, Diamond real-space refinement (Re!. 25). 155C = cytochrome c550,2.5-A reso-
lution, Diamond model building to guide coordinates, assignments derived from the H-bonding
diagram of Ref'. 27. 2ADK = adenylate kinase, :LO-.& resolution, unrefined (Ref. 28).
Serious discrepancy (segment missing or boundary different by three or more resi-
dues).
2) in approximate y-turns a t Glu2, Gly40, Lys 53, and Lys88; two are (i,i
+ 4) H bonds at the end of a-helices; two are (i,i - 3) and (i,i - 6) in the
loop region Gln22-Asp28; and one is involved in forming the heme pocket
by a tertiary contact between Thr80(CO) at a helix end and MetlOS(NH)
in an extended strand. All of these have a meaningful structural inter-
pretation. The resulting secondary structure assignments are consistent
with the authors H-bond list, except for the additional short parallel bulged
&strand pair, 19-20135-37, which is due to two additional weak H bonds.
Levitt and Greer assign considerably more secondary structure (Table
I), including a much longer parallel 0-sheet 17-23/33-39 (probably too
long), a 0-strand 26-31 (roughly antiparallel to 17-23), a helix 40-44 (we
assign a 3-turn), and a longer helix 81-90 (which has only two of the seven
possible H bonds but looks very much like a helix in a C* chain tracing and
therefore may be seen to be a helix a t higher resolution).
For the unrefined, lower-resolution structure of adenylate kinase
( 2ADK28),all secondary structure assignments (ours, the original authors,28
and Levitt and Greersl) are similar. Other lower-resolution coordinate
data sets show more discrepancies, depending on the quality of the H
bonds.
This detailed comparison shows that our H-bond energy cutoff, chosen
out of necessity to allow for coordinate errors in lower-resolution data,
typically leads to 50% more H bonds than conservative assignments in
higher-resolution data (example, 3PTI). All these have a physical meaning
in terms of electrostatic interaction energy and nearly all have an inter-
pretation in terms of canonical secondary structure; and, most importantly,
the increased number of H bonds does not give rise to spurious secondary
structure assignments.
H-bond assignments become less certain for some lower-resolution data.
For example, in the data sets lAPR, 3PGM, and lABP, Richardson8 sees
a number of 0-strands, which, in Table AIII, do appear as uncurved (non-
S) strands but with relatively few H-bonded bridges between them. A t
least for lAPR, only partially refined at 2.5-A resolution with tentative
amino acid sequence, one may expect that more H bonds will form in the
,f3-sheetson further refinement.
We conclude that our criteria for H-bonded secondary structure are
relatively strict, in spite of a generous cutoff in the H-bonding energy. For
higher-resolution data sets, our assignments are more accurate than those
of Levitt and Greer,l and for lower-resolution data, they are conservative
compared with both Levitt and Greers program and Richardons8 visual
processing.
OUTLOOK
The structure of influenza virus h e m a g g l ~ t i n i nwith
, ~ ~ its 50-residue
helix, shows that our data base certainly does not exhaust all possible
variations in protein architecture. In spite of this limitation, this compi-
lation will be used in the ongoing development of protein structure pre-
diction methods.
Notes to Table A1
Proteins are ordered by function and can be found in the strip tables (Table AIII) and strip
maps (Fig. AI) by their running number. Yo a-helix, YOP-antiparallel, YO¶llel = number
of H bonds per 100 residues of type 4-turn, parallel and antiparallel bridge; these percentages
can be compared with results from spectroscopy (CD, Raman, ir). Exposure = estimated
number of water molecules in contact with protein surface (first hydration shell); it can also
be read as the static exposed surface area in units of 10 A?. Exposure is calculated for the
entire data set and then divided by the multiplicity of sequence-unique molecules, e.g., the
data set lINS has two copies each of the insulin A- and B-chain (multiplicity 2). Exposure
given is that of the A- and B-chain in the tetramer. Number of residues is also for the se-
quence-unique molecule. Crystallographic resolution (A) and refinement give some indication
of the quality of the coordinates; both are taken from the Data Bank without further checking.
In case of doubt, consult the original papers. Refinement code: D1 = Diamond model
building to guide coordinates (Ref. 30); D2 = Diamond real-space refinement (Ref. 31); HK
= Hendrickson-Konnert (Ref. 32); DO = Dodson, Isaacs, and Rollett (Ref. 33); J L = Jack
and Levitt (Ref. 34); DS = Deisenhofer and Steigemann (Ref. 25); DF = difference Fourier;
DC = difference Fourier with constraints; FD = difference Fourier and D1; LS = least squares;
R L = restrained least squares; CL = constrained least squares; S D = steepest descent; LL
= energy minimization of Levitt and Lifson (Ref. 35); H H = D2 and Hermans REFINE2 and
HK; DD = DS and D2; DL = D F and LS; DJ = D2 and JL; AD = Agarwal least squares (Ref.
36) and DO; DH = D2 and HK; DE = D2 and LL; MD = energy minimization of McQueen
and D O CS = constrained difference Fourier of Chambers and Stroud (Ref. 37); RE = real
space and energy minimization; CC = constrained crystallographic refinement; CD = D2 and
CORELS (Ref. 38).
2594 KABSCH AND SANDER
TABLE A1
List of 62 Different Globular Proteins
........... I A L P H A H E L I C A L AND 4-TURN HYDROGEN BONDS
I B E T A ANTIPARALLEL HYDROGEN BONDS
. . % BETA PARALLEL HYDROGEN BONDS
.. .. . . . . ... . . . . . . . . . . . . . .
WATER E X P O S U R E
.. . X U L T I P L I C I T Y OF D A T A S E T
. .. .................
.. . . . ... . . .... . . ..
N U 3 B E R O F RESIDUES
. . RESOLUTION
.. .. .. . .. . . .. . .. . .. REFINEHENT
ZAH XBA %BP EXPO M LEN RES RF . PROTEIN I D E N T I F I E R , N A X E
hlnding proceins
38 4 0 610 I08 1.9 D F I ) lCPV CALCIUM-BINDING P4RV4LBUXIN B
31 1 6 1423 306 2.4 - - 2) 1ABP L-ARABIUOSE-BINDING P R O T E I X
eleccio" transfer
7 I3 2 690 85 2.0 DF ?) I'iIP O X I D I Z E D H I G H P O T E N T I A L I R O N PROTEIN (HIPIPI.
election Lrd"SP0rt
19 15 7 566 85 CY T V C H R O N E
57 0 0 665 103 CYTOCHROXE
34 2 0 620 2 I03 CYTOCHROlE
23 2 2 642 112 CYTOCHROXE
23 0 3 781 I34 CYTOCHRUXE
38 2 0 482 a2 CYTOCHROXE
9 9 0 316 54 FERREDOXIN (PEPTOCOCCUS AEROGENES)
0 0 0 623 98 FERREDOXIN (SPIRULINA PLATENSIS)
30 I 18 715 I38 F L A V O D O X I N IOKIDILED)
I7 0 376 54 RUBREDOXIN
I7 7 645 125 A211
~. R IN
~
21 10 513 99 PLASTOCYANI
hormones
44 0 3430 36 1.4 RL 16) IPPT AVIAN P A N C R E A T I C P O L Y P E P T I D E
38 0 3540 29 3.0 R E 17) ICCN G L U C A G O N ( P H 6-7)
29 0 12
301 2 51 1.5 D L 18) L l N S INSULIN (A AND B CHAIN)
hydrolase, phospharlde acyl
37 7 0 712 123 1.7 AD 19) IBPZ P H O S P H O L I P A S E A2
hydrolases, 0-glycolsyl
38 7 0 918 164 2.6 C L 2d) I L Z M L Y S O Z Y X E (BACTERIOPHAGE T4)
24 8 2 665 129 2.5 CD 21) 7 L Y Z L Y S O Z Y M E (HEN EGG W H I C E , TRICLINIC)
hydrolaser, phosphoric d i e i t e r
I9 20 3 842 142 (6 - - 22) IS35 STAPHYLOCOCCAL NUCLEASE (COMPLEX)
I4 28 2 709 124 2.0 S D 23) I R X S RIBONUCLE4SE-S
hydrolJses, protelnases
23 5 9 1209 303 2.0 -- 24) LCPA C A R B O X Y P E P T I D A S E A
3 I3 3 1333 324 2.5 HK 2 5 ) LAPR ACID P R O T E A S E (RHIZOPUS CHINENSIS)
7 32 6 1272 323 2.8 D2 2 6 ) I A P P ACID P R O T E I N A S E (PENICILLOPEPSIN. FUNGUS)
27 9 6 1266 316 2.3 D2 27) 2 T L N T I I E R X O L Y S I N
6 31 I 1093 236 1.9 HH 28) 2GCH G A M X A CHYXOTRYPSIN A
3 37 3 821 198 2.8 DL 29) M L P ALPHA L Y T I C P R J T E A S E
7 31 I 929 223 1.5 02 30) lPTN BETA-TRYPSIN ( N A T I V E AT P H 8 )
6 34 2 145 131 2.8 DI 311 I S G A P R O T E I N A S E A FROM S T R E P T O M Y C E S G K I S E U S (SGPAI
23 5 I2 I 0 5 8 275 2.5 FU 3 2 ) iiaT SUHTILISIH BPN'
4 35 0 LO89 240 2.5 -- 3 3 ) i E s r TOSYLkELAST4SE
21 19 1 923 218 1.7 LS 34) 2 A C T ACTINIDIN
19 I5 I 968 212 2.8 D1 35) 8 Y A P Y A P A I N
immunoglobulins
I 34 2 2101 428 2.0 - - 3 6 ) IFAR L A X B D A IXMUHOGLOBULIN F A B
1 37 5 492 2 107 2.0 C C ?7) I R E 1 BENCE-JONES IXMUNOGLORVLIN (VARIABLE P O R T I O N )
IEOlerdSeS
23 2 5 12211 2 3 1) 2 . 8 HK 3 8 ) 3PGY PdOSPl{OCLYCERATE H U T A S E ( D E - P I l O S P ! l O )
35 I 15 1326 2 246 2.5 DO 39) I T I M T R l O i E P H O S P H A T E I S U X E K A S E
Iecrio (agglutinin)
2 15 n 1125 237 2.4 Y D 40) 3CNA CONC.\UAVALIX A
Lyasc, carbon-oxygen
4 23 5 1271 256
oxidoreduc~a~es
15 I2 13 937 162 2.5 IDFR
22 I1 10 1505 333 2.9 DI IGPU
I7 I1 9 I639 374 2.4 DJ 4ADli
27 6 7 1753 329 2.0 DZ 6LDH
22 I2 7 2356 661 2.0 _- 2GRS
I
33 I 686 IS1 2.0 HK 2S0D
oxygen r r o r r g e
65 0 0 842 I53 2.0 0 2 48) INBV MYOCLOBIN (FERRIC IRON - METXYOCLOBIN)
TABLE A11
Structure Notation Used in Table AIII
First line: running number 1-62, data set identifier (3PTI,4LDH . . .), protein name,
[function], {sourcel
SHEET.. . One-character name of P-sheet (A, B, C . . .) in which residue i
participates.
H H I D G E ~. . . One-character name of P-ladders in which residue i participates,
HRIDGE1 . . . A, B, C . . . = antiparallel,
a, b, c . . . = parallel.
Ladders are named sequentially from N- to C-terminus.
A 8-strand can be part of two ladders, one to each side, so there are
two lines for the possible ladder partners. Each ladder name appears
twice, once for each participating strand. Partner strands can thus be
easily identified by identical letters. The sheet topology can he
reconstructed by starting from a P-strand and tracing all partners and
their partners.
CHIRALITY + or -
Chirality a t residue i is the sign of the dihedral angle defined by CC1
+
i - 1 to i 2. Thus, a right-handed a-helix has +, an ideal twisted
P-strand -.
.
HEN11 . . S = five-residue bend centered a t residue i.
5-TURN.. , Hydrogen-bonding pattern for turns and helices:
4-IURN ) = backbone CO ofthisresidue makes H bond (i,i + n )
:j-rURN ( = backbone NH of this residue makes H bond ( i - n,i)
X = both CO and N H make H bond
3, 4, 5 = residues bracketed by H bond
SUMMARY. .. Structure summary:
H = +helix (a-helix)
B = residue in isolated P-bridge
E - extended strand. participates in 8-ladder
G = %helix (3lo-heIix)
I = 5-helix (?r-helix)
T = H-bonded turn
S = bend
In case of structural overlaps. priority is given to the structure first in
this list.
EXPOSURE. .. Solvent exposure is the estimated number of water molecules in
contact with residue i. The scale is 0-9; I * = more than 9 water
molecules, Exposure can he read as solvated surface area in units of
10 A?.
SEQUENCE Amino acid sequence in one letter code:
SHEET.. C DDD D D D
BRIDGE?. ee
...
BRIDGE1.. C dd4 e e d
CHIRALITY ++---+---+ ++++++++++ +++++++-+- -+++----++ -+++-+++++ ++++++++++ +-+--+++-- ----+-+-++ ++++++++++ +++++++---
BEND..... SS s ssssssssss ssssssss sss sssssss ssssssssss sss sss ssssss ssssssssss ss s s
5-TURN.. >55>5<55< > 5 5 55< >5555<
4-TURN.. > >>>xxxxxxx xxxxx<<<< >4>>x >xxxxxxxx< <<< > > >>xxxxxxx< <<<
3-TURN.. >3><3< >33< >>3<< >>3<< > 33< >>3<<
...
SWMARY.. SS B S HHHHHHHHHH HHHHHHHHT S R EEE SSGGGHH HHHHHHHHHH HHS SSS E E SSSSSH HHHHHHHHHH HH S S E
EXPOSURE. * 9 2 1 1 0 1 0 0 0 7*108*017~3 08*21**963 8 6 * 4 0 1 0 1 # @ 119*3610*4 1 0 4 0 0 1 8 2 1 7 89216**33* *292*45695 1 0 8 * 1 0 6 6 1 2 **2**18400
1 0 1 SEQUENCE.. DTVPLVMMM TKIGERCGOE LYKEMQKRCW DVKESAWAI TANELDTARR RTTCSMDALK AAGPPEKQIY QVPTKSNDIP GAFDAANSML VQHPEVKHWL
nsua a d H i a x I u YSIZLIY~,LSH M ~ B W O L V ~D D V D ~ M M 3~3~d H a a i m ~ ~ M Y A H ' I I ~ M S Y S N N H X ~ IOB-JWAXAV ' ~ s ~ a n ~K a s
L..91 *L9091bLE+ .28r.E..9S L f r L E L r Z l S ZKZS.9ZL.8 E9rE.rIL.b KS109.8200 ZBE.BL~E+I 09rESbr.8* '3MflSOdXa
3333 DD a a a a a s ~XUHHHHH LLHHHHHH a IL.+LHHHHHS LL D ~ W U a a a a v n a ~3 3 a u 3 3 LL HHHH 38 --mvuYns
>>EXX E < < >>> x<<< >>E<< >E>XE<XEE< >EEX>C<XEE < >EE< > E E < > > E << '' ' N W J - E
>P bP< >>>><<<< >>>><<<< >>>b<<< >bP><PP < > >bP<< ' * 'NWL-P
sss ss ss ssssssss
> SSSS<
sssssss ss sssssss ss ssssss sss sss ss ssss
.
.....
* 'NlmL-S
aN3e
+++ ++-----+++ ++++++++-- --++++++++ -+++++++++ --+-++++++ ++---+---- ++--+-++++ ++++----- ALX1VMIH3
ww 5 3 eeee 888 ea3 a 3 We "KaMIME
aa a aaa 53 'zaxme
vw v Y Y W V Y VVY vee el3 vv . * * * La3HS
3s ez . . . . . . . . . . . . . . . . . . . . . . . . . . I . . . . . .
{snMnvJ soe :MMI+I ILMWSNYML N O M L J ~ ~ (I a a z r a r x o ) se 3 ~ 1 1 ~ 3 0 3~s e3z (P
3W1& S Y 3 M 3 N A N I 1 Y C d d l d 3 3 Y Ma(LLVDVVaV O W & X V 3 H O 3 3 d d l N M V Y VAM3SYLVaO N A M 1 V I V L Y N (IYVAYNVdVS '33NanOaS K
66.65 0bbZ006ZK0 BL8.19.20E 1 6 r 9 V 9 6 E r L 9C99KbPKSB ..ESE9r6SS 662*2+.098 915E8.09EE .61KZ9rESr 'aMnSOdX3
e LL e ~ 1 3 3 a .+LsLaaa 3ss usu B ~ U E D D ~ DDD sss H HHHSSDDDSS e LHHHHHH LL BJL *-AMVUUIIS
>EE< > t E < >EEXEE< > E t XE t < > E C X>E <<> > E < < >>E<< > E E < >EE< "'NWL-E
>> P b < < >>>><<<< * * 'NMRL-C
>5ss5< * * 'NWL-S
s ss sssss ss sss s ss sss sss sss s ss sssssss ssssss ss sss .***. aNae
+-- -++---++-- --++-++--- +++--++-++ --++-+++-+ +++---+--- ++++-+++-+ --+-*++++-
ALI1VMIH3 ++---++--
3 a w ee 88 3 a w * 'K3MIME
3 e
33 3
w v
33 3
vw v w 3 E V
"23MIME
u a ~ s ....
dIHK"" (YnSONIA U I I I L V W M H 3 ) (NIP&OMd Yfld'lllS-NOMI 'M3dSNVMJ NOML33131 (dIdIH)
N 1 3 U U d NOMI 1 V I L N 3 J O d H D I H a B Z I a I X 0 d I H 1 (E
Y331DY
.8EZSS
LL LL
>EE< ' * 'NWL-E
>P
>s5 ...NWJ-S
'* 'NWL-b
.....
ss ON3 e
+++- ALI1VMIHJ
* * K 3WIL1E
....
"Z8MIME
L33HS
~ 3 1 3 JNCY
3 ~ M A l L n S S Y A 3 H h C d S d l 7 S D A d 3 J V O W S 7 3 S A V M D N I 3 1 3 I I O Y W J 31)33LVW33 lALSUNY9AI '33N31103S K0Z
w6ZL.rZV.6 V89029K0SS 0 0 S Z 0 0 0 0 0 0 0L0025.6.8 0 1 6 b 0 2 2 0 1 0 00000SEE1Z t.LL0K8000 2 0 1 2 0 1 1 0 0 0 '3MnSOdXa
13333JLJSSS HHHWHHHHHH H L L U 33 S S S S S HHHHLLLSSO 3 U S SSSHHHHHHH H L U S S S 33 * 'AMVWiUlS
>>E<XEE< >>E<X>E< < > E E < >>E <XE E < >EE< > E E < > E EXE E < ' 'NMnL-t
>DXD<C< >xxxx<xx<> < X P < b < > >bbX<bb< >>>>xx<< X < t b < ' ' 'NMflL-b
55<
ssssssssss
S<
ssssssssss ssss s ssss ssssssssss sss ssssssssss sssssss .....
* ' 'NYnJ-S
aN3e
++++++*+ti ++++++++++ ++++++---- --++---+-+ ++++++++-+ ---+++++-- +--++t++++ +++++-+-+- ALI'lVMIH3
33 3 3 PP * 'K3XIIME
D
33
TABLE AIII (continued)
5) 1 5 6 8 CYTOCHRWE 8 5 6 2 ( O X I D I Z E D ) [ELECTRON TRANSPORT1 {ESCHERICIA C O L I ) . . . . . . . . . . . . . . . . . . . . . ........................ 156B
SHEET.. ..
BRIDGE2..
BRIDGE1..
CHIRALITY ttttttttt tt+tttttt- t-tttttttt ttt+tttttt t+---ttttt ttttttttt+ tttttttttt tttttt+-tt tttttttt+t ttt+tttttt
BEND..... SSSSSSSS SSSSSSSSS SSSSSSSSS S S S S S S S S S S S S S SSSSS S S S S S S SSSSSSSSSS S S S S S S S S S S S S S S S S S S S S S S S S S S S S S
5-TURN.. . >55>5<5 5< >5555<
4-TURN... >>>>XXXXX XX<<X<44< >4>>X>X XXXXXX<<<< >444< > > > > x xxxxxxxxxx xxxxx<<<< > > > > x x x x < xx x > < < x x 4 4 <
3-TURN.. . >>><<< >33x>3<< >33< >>> <<< > 3 3 < >33< >> 3<< >33< >33< > > 3 x x 3<< >33<
SUMMARY.. HHHHHHHH HMHHHTTTS STTTTHHHH HHHHHHHHHG GCS TTTTT SSTTHHHH H H H H H H H H H H HHHHHHHHT SHHHHHHHHH H H H H H H H H H H
EXPOSURE. *64'716'43 *'13*718*2 '698605608 8'19618'60 4*270'*2'* 77*'2'61*8 0 2 9 4 2 2 ' 0 6 6 603'31**59 4 * * 0 3 6 3 2 8 * 1782'564'.
1 SEQUENCE. ADLEDDMQTL NDNLKVIEKA BBZKANDAAL VKMRAAALNA QKATPPKLED NSQPMKDFRH GFDILVEGID DALKLANEGK VKEAQAAAEQ LKTTRNAYHQ
SHEET.. , .
BRIDGE2..
BRIDGE1..
CHIRALITY t
BEND..... S
5-TURN.. .
4-TURN... <
)-TURN.. .
SUMMARY.. S
EXPOSURE. 93.
1 0 1 SEQUENCE. KYR
6) 1CYT CYTCCHROME C ( O X I D I Z E D ) [ELECTRON TRANSWRTI (ALBACORE TUNA HEART: THUNNUS ALALUNGA] ... . . .... ..... ,... ........lCYT
SHEET.. .. A A
BRIDGE2..
BRIDGEl.. A A
CHIRALITY -ttt++ttt +ttt+t+--t -tt--t-t-- -tttt++--t tt-+t----t +ttt-t--+- ttttt++tt- tttt-++-t- t----t-ttt +tttt+ttt+
BEND..... SSSSSSSS SSSSSSS SSS SS SS SSS S S SS S SSSSS S SSSSSSSSS SSSSSSS sss ssssssssss
5-TURN.. . > 5555<
4-TURN... >>>>XXXXX (<<X444< >> >4<<< > > > > x x x < < < x> 4 4 < < >>>> x x x x x x x x x ~
3-TURN. .. >33< >33< >33x33<>3 3<>33< >> 3<< >33<
SUMMARY.. H H H H H H H H HHHTTTT STT SS TT TTSBT T TT H HHHHS 8 S H H H H H H H H H HHHHSTT HHH HHHHHHHHHH
EXPOSURE. 8'65'0**21 8'923'5321 * * 8 2 * 8 9 9 1 2 2 3 9 3 2 4 6 5 ' 8 0*0*84'43* 329.924927 8.21682298 0 * * 3 0 9 7 2 ' 5 * 5 * 1 1 * * * 4 5 3 9 2 0 1 1 3 1 7 9
1 SEQUENCE. GDVAKGKKTF VQKCAWHTV ENGGKHKVGP NLWGLFGRKT GQAEGYSYTD ANKSKGIWN NDTLMEYLEN PKKYIPGTKH IFAGIKKKGE RQDLVAYLKS
SHEET.. ..
BRIDGEI..
BRIDGE1..
CHIRALITY +
BEND..... S
5-TURN.. .
4-TURN.. , <<<
3-TURN.. .
SUMMARY.. HH
EXPOSURE. 20'
1 0 1 SEQUENCE. ATS
SUMMARY.. .., ...HIALPHA-HELIX.. . E=BETA-STRAND.. ..B=BETA-BRIDGE.. ..GD3-HELIX.. ..1-5-HELIX.. ..T=3-, 4-, OR 5-TURN.. ..S-BEND.. ..
7) 1C2C FERRICYTOCHRQYE C2 [ELECTRON TRANSPORT] (BACTERIAL: RHOWSPIRILLUM RUBRUM). ................................... lC2C
SHEET.... A A B B
BRIDGE2..
BRIDCEI.. a a B B
CHIRALITY -++++++++ ,++++++--+ -++--+-t-- -++++-+--+ -+-++-t--+ +++++++--- -+-+++++++ ++-+*+++++ +--------+ ++----++++
BEND.. SSSSSSSS sssssss sss sss s s ss s s sss s sss ssss sssssss ssssssssss ss ss s ssss
5-TURN.. >5555< >5
....
4-TURN... >>>>XXX<< x<4><44< > 4 44< > > > > < x << 4 < > 4 4 4 < >>>>X
3-TURN... >> ><<x33< >33< >33< >33x33<>3 3<>33< > 33< >33<>3 3<
SUMMARY.. THHHHHHHH HTTTBTTB STT TTS TT TT BT T TTS T TTS SSSS B SHHHHHH TTSSSTTTTT TS SS S HHHH
EXPOSURE. '6'23286'1 2* * 2 9 6 6 2 1 2 **7364**12 1 2 4 2 1 3 ' 6 4 1 1 1 6 * * 6 8 4 2 * 1 3 6 4 3 * 6 * 5 2 5 2 5 9 8 2 1 3 5 3 2 * 9 0 8 4 1 1 6 * 6 2 5 8 8 * 2 * 1 * 568'2''921
1 SEQUENCE. ECDAAACEKV SKKCLACHTF DWGANKVCP NLFCWENTA AHKDNYAYSE SYTEMKAKCL WTEANIAAY VKNPKAFVLE KSCDPKAKSK MWKLTKDDE P
SHEET.. .. z
BRIDGE2..
BRIDCE1..
CHIRALITY +++++++-++
BEND..... SSSSSSSSSS
)-TURN.. . 55><555<
4-TURN.. XXXX<XX<4< <
3-TURN. ... >33<>33<
SUMMARY.. HHHHHHHHHH
EXPOSURE. 95184073'6 '3
101 SEQUENCE. IENVIAYLKT LK
SHEET.. ..
BRIDGE2..
BRIDGE1..
CHIRALITY --++++++++ +++++++-++ -------+-+ ++
BEND.. s ssss ssssssss s ss s ss s
5-TURN..
....
4-TURN.. . >>>>X xxxx<<<<
3-TURN.. . >33x33< > 3 3<
SUMMARY.. S HHHH HHHHHHHS ST T S SS S
EXPOSURE. 9.27'44222 02112*31*' 6795678478 4146
101 SEQUENCE. FKHCKNQADV VAFLAQDDPD AXXXXXXXXX XXXX
.... CIN3E=S""NWL-S YO ' - P ' - E ~ L ~ ~ ~ ~ X I 1 3 H ~ ~ ~ I ~ ~ ~ ~ ~ X I l 3 H - E ~ 3 " 3 3 a I ~ ~ ~ " L 3 E ~ ~ ' ' ' ~ ~ N Y l L L S - V J 3 E - 3 ' ~ ' ' X I ~ ~ ~ -wwwns
YHdlV~H~~~~''''
a d a u w w 3 a D M D A S D ~ V A XX ~ S I X L S I1dadas33'1
~ ~ ~ h a a ~ w s 3 3 - 1n I a 3 N i i a a I t u a s A N I m I\arDsaIrDr V I ' I ~ V W ~ N OJDSMMIXW *a3~3nb~ I s
LBLrIBrBrZ rZS0000006 V S 6 0 r 6 E B r r 0199086rZI SPr9LZ1000 0 0 0 r E ~ 9 1 r r EBVr9TEPPr Trd9r600r 101L00rLBD ZeSZEZ00LV '3YflSOdX3
HHHHHHHS s ~ ~ 3 3 3 3 3 3 3 3LL LLLHHH HHHHHLLL s 9~~33
~ J L E 333 3 3 3 s LLLU 3 3 LLHHHHH HHHHHHHHHH sss 33333 **Iuvwwns
>EEXE ><E < >EE< >EE< >>E<< >EE< * ' 'NYflL-f
xxxx<<<< >>>> x x < < < < >PtV< >PPP < >>>>xx XXXXXXX<<< < * * 'NYflJ-P
ssssssss s ss s ss sss ssssssss sss
>5sss<
s sssss s sss
>SSSS<
sssssss ssssssssss sss .....
* * 'NWL-5
aN3E
++++++++-- ++-+-++--- -+++++++++ ++++++++-- -+--++--+- ---+-+++++ +--+++---- -+-+-+++++ ++++++++++ +-+-+-+-- ALIlVYIH3
33333 d d qqq qq meee epee "I33aIYE
aaaa pp qWqq "233aIYE
33
. . . " . . . . . . . . . . . . . . . . . . . . . . . . . . . E. . . . .E. . .QVV
N ~ d E " . . . . .VVVVVWY
333
, . . .VQV
(dW WflIaIlLLSO'I3)
VVVV
[LYOdSNVYL N O Y L 3 3 1 3 1
WWV -*.-
L3PHS
( a 3 Z I a I X O ) N I X O a O A Y l d NXdE (7.7
x i 3 3 a b ~x I~L 3 a s w . v A ~ J M A D V ~ Ibaaaidsoas b a I m s u m v a s 3 ~ 3 w 3s ; i d i a i D w a V v a ' I I A L a a o X I L ~ N I ~ a~ aVi immv m ~ n n b a s1:
L S E E r r 6 E rrrS9L9260 Z Z E B T E Z B r r 6r0tr9SV69 Z r V r V L 5 5 1 0 TBYZESI9.L 6 1 V 8 S r P L r r 9EISrrSrSr rPV2669TS2 L99rrr6rI1: '3MnSOdX3
sss ss LL LLS s s sss s ss LJSSSSS L L S SS S S S ' L LS U S L 5 S "AYVWWIS
>EE< > EE< >E E < >fE< >E E < > E E < >E E < "'NWL-E
* * 'NWL-P
' ' 'NMflL-5
sss ss ss sss s s sss s ss sssssss s s s ss ss s s ss sss s s s .'.** a N 3 E
---+-+ ++--+++--- +++--++++- +++++++--+ ---+-++--+ ++++-++-+- +++-++-+-+ +--+-+-+++ +-++-+-+-+ +++++--+- AJI'IYHIH3
....
* '1 3 1 M I Y E
"133aIYE
3XdI ......................................................... (SIS N I J V l d Y N I l f l Y I d S 1 [ L Y O d SNQYZ N O Y J 3 3 ' 1 3 1 NIXO(13YLIBd
L33HS
JXdI (17
a 3 d N d Q W d X S V 3 S3XI3SaW I V A I S S O I I N A d 3 3 d X 3 V 3 5 V I J S a N I A A Q '33NBflbPS T
r r 9 8 IBP96EYrPT ZZ96629rPL ZSS6eSrBTe L6Ze69IPZL SrE9rSZ9EL '3YflSOdX3
S3 3 S S HHHH H ALL 0 SSS E LL 33 LL 333LL "AYVWWIIS
>E E < >>EE< >EE< tEX>EX<E< * ' 'NYflL-E
>>>P< << >PIP< >PVP< * "WL-P
x s
+- +++-+-++++
ss ssss
t+-+--+++-
sss sss
----+-++++
ss ssss
-+-++++++- +--++----
S
.....
* ' 'NWL-S
aNae
ALIlVMIH3
Q Q 0 a vv **I~~CIIIE
"Z33aIYE
xadI...."........".,.............. .......................V V a
(S3N33013V S n 3 3 J 3 0 J d 3 d )
E VV * ' . .L33HS
[ L Y O d S N V Y L N O Y L 3 3 1 3 1 NIXOC13MM3d XalT (01
X b S l A M X Y l J d Y 3 C K l S A V N d d Y d I d 3 M S O S 3 N X I Y O Y 1 3 V 3 V3Vb3V91YV A C U A V d 3 A W U I I Q H 3 V A 3 3 X N X d ' I A 3 d 0 3 '33N3nO3S I
-9 8 E I Z 8 0 0 Z 9 0 2 ~ 6 1 1 6 P - 6 5 ; 8erC699rEZ 6928r02r56 IPrr8lrZrPB IBrZ02188r BrSZZS89ZE LrrZ08rILr '3WSOdX3
LHHHHHHHHH HHH s E ssssss EHHHHHHHHHH HLJ LLHHHH HHHH sss sss LDWJ JHHHHHHH **Aiivwwns
> >E<< >EEXEE< >>E<X CE< >EE< "'NYflL-E
>>>>xxxxxx <<<< > >>>xxxxx<<< < >>>>x x<<<< >PPP< >>>>X<<<< "'NMflL-0
>s s s s < * ' *NYflL-S
ssssssssss sss s ssssss ssssssssss ss s ssss ssss sss sss sss s ssssssss *.*.* a N 3E
++++++++++ +++--+---- --+--++++- -+++++++++ +++-+-++++ ++++---+-+ -++--+++++ -+++++++- ALIlVUIH3
V Q "133cIIME
Q V
3 1 5 ~ . . " . . . , . . " . . . . . . . . . . . . . . . . . . . . . . . . . .( V S O N 1 3 n Y 3 V
....
"za3aIYa
L33HS
SVNOWOan3Sd) [LYOdSNYML N O Y L 3 3 1 3 1 (03ZIaIXO) I S 5 3 3WoYH3JLA3 3TSZ (6
8
co
c.l
SHEET.. .. AA AAM
BRIDGE2..
BRIDGE1.. dd eeee
Y
CHIRALITY +++++-+--- +---+-'--+- ++++++++++ ++++++
BEND..... SSSSSSS S SSS SSSSSSSSS SSSSSS
5-TURN.. >5555<
4-TURN.. XX<<<< > > > > x x x xxx<<<<
..
3-TURN.. . >>3<< >>3<< >>3<<
SUMMARY.. HHHHHTT EE S E E E E S S GGGHHHHHH HHHHHHT
EXPOSURE. * 6 0 8 7 6 3 1 7 3 43*2261*87 09*29*'018 186.01.9
101 SEQUENCE. ERMNGYGCW VETPLIVQNE PDEAEQDCIE F G K K I A N I
SUMMARY.. ..... .HIALPHA-HELIX.. ..E=BETA-STRAND.. ..BIBETA-BRIDGE.. ..G-3-HELIX.. .. 1 - 5 - H E L I X . . ..T-3-, 4-, OR 5-TURN.. .SEEEND..
. ..
TABLE A111 (continued) E3
Q)
0
l P C Y PLASTOCYANIN [ELECTRON TRANSPORT, C O P P E R B I N D I N G ] ( P O P L A R LEAVES: POPULUS NIGRA VARIANT I T A L I C A ) .............. lPCY Lo
SHEET,. .. AAAA AA E B B B AAAAA A C B B C AA AAA BBB B B B BBBBEB
BRIDGEZ.. BB ccccc F F F FFF FFFFFF
BRIDGEl.. aaaa BB ddd d aaa a G E E G CC CCC EE dddd
CHIRALITY ---t--t-+ -+--+t+t-t --+t------ +-+-++--I- --ttt--tt- -tt++---++ --tt-tt--- t--tt-t--- t---+t++t- t+t----
BEND.. ss ss s ssss ss ss ssss ss sssss ss sss s s SSSSSS s
5-TURN.. > 5 5 5 5<
....
4-TURN.. >>44<<
3-TURN.. >33< >33< >33< >33< >>3<< >33 < >33< >>3<x33 <
..
SUMMARY.. EEEES TT S E E S S E E E E T T E E E E E E S S B E E TTSS T T HHHHS T T B S T T E E EEE S EEE E E E GGGTTT T E E E E E E
EXPOSURE. 6 9 0 5 0 0 2 * * 4 6 6 3 0 6 5 5 ' 2 7 0 6 6 5 8 9 0 3 0 6 0 3 2 4 7 5 0 0 0 1 1 5 * * 5 2 2 * 9 3 829'1224'' 871*5'4765 *14089*3*0 3050474997 48715038'
SEQUENCE. IDVLLGADDG S L A F V P S F S S I S P G E K I M K N N A G F P H N I V F D E D S I P S G V D A S K I S M S E E D L L N A K G E T F EVALSNKGEY S F Y C S P H Q G A GMVGKVTVN
SHEET.. .. A
BRIDGE2..
BRIDGEl.. B or
CHIRALITY ++++++++-- --++++---+ +
BEND..... SSSSSSSSS SSSSS S S
)-TURN.. .
4-TURN... XXXX<<<<
3-TURN.. . >>3<< > > 3 x < 3 < > >3 < <
SUMMARY.. HHHHHHHTS GGGBT G GG
EXPOSURE. 8 0 8 4 4 1 4 ' 4 8 9 5 4 + 4 6 * 3 9 7 **6
1 0 1 SEQUENCE. NAAIdFSKVP YNKEHKNLDK KNb
SHEET.. ..
BRIDGE2..
BRIDGE1..
CHIRALITY +++++--+++ ++++++++++ ++-+++++++ +++++-++++ ++++++++++ ++++-+-+++ +-
BEND..... SSSSSS SSS S S S SSSSSS SSS SSSS SSSSSSSSSS SSSSSSSSSS SSSSSSS S SS
5-TURN.. . > 5555<
4-TURN... XXX<<<X>44 << >>>4XX< 4<< >444< > > > 4 < < x > > > x x < x xx > < < < <
)-TURN... 3<< >>3< X33< >>3 X<3< > 3 3 < >33< >3><3< >33<
SUMMARY.. HHHHHH H H H HTT HHHHHH HHTT TTTS SSSTTSHHHH HSHHHHHHHH HHHHHSS S SS
EXPOSURE. 00136636*4 01639881'4 19'**5'801 8583'3.17. '37.209581 508.725542 1*.*
1 0 1 SEQUENCE. NMVFWGFTC VAGFTNSLRM LQQKRWDEAA VNLAKSRWYN OTPNRAKRVL TTFRXTWDA YKNL t
9
a3
SUMMARY.. ..... .H=ALPHA-HELIX.. ..E=BETA-STRAND.. ..B=BETA-BRIDGE.. ..Ga3-HELIX.. ..I-5-HELIX.. ..T=3-, 4-, OR 5-TURN.. .. S-BEND.. .. 0
w
TABLE A111 (continued)
7LYZ LYSOZYME (TRICLINIC CRYSTAL FORM) [O-GLYCOLSYL HYDROLASE] (HEN EGG WHITE: GALLUS GALLUS)......................7LYZ
SHEET.... A B B A CC CC CC DD DD
BRIDGE2.. DD
BRIDGE1.. A B B A CC CC DD ee ee
CHIRALITY t+++t+ ++++-tt-t- -t--+ttt+t tttt---+-t +-----t-t- ----t----t
--- ttt-t-t+-+ ++-++-t--+ +tt+-t-ttt +++++t+t+t
BEND.. ssssss ssssss sss ss ssssss ssssssss s s sss sss s sss ss s s ss s sssssss ss ssssssssss
5-TURN.. >5555< >5555< >5555<>5 555<
4-TURN.. > > > 4 x x x ><<<< >>>>xx xx<<<< >444< > 4 44< >>> >xxxxxx<<<
...
3-TURN.. . >33<>3>x 3<<>>3<< > 3 3< > 3 3< > 3 3 < > 3 3 < >> > x < < < >3 3< >33
SUMMARY.. B H H H H H H HHHHTT TTB TTB THHHHH HHHHHTSSBT T EE SSS EETTTTEET TTTEE SS T T TT EEG GGGGSSS HH HHHHHHHHHH
EXPOSURE. 9'1473.803 108'6506'7 ' 6 5 4 0 0 2 0 0 0 0 0 5 6 5 2 9 3 5 0 *287'7*720 8 2 2 0 0 0 1 0 3 0 6'38746'19 65'672*470 8304669152 1 0 9 0 0 5 '1 1 3
SEQUENCE. KVFGRaELAA AMKRHGLDNY RGYSLGNWVb AAWESNFNT QATNRNTDGS TDYGILQINS RWWcNDGRTP GSRNLdNIPc SALLSSDITA SVNdAKKIVS
SHEET.. ..
BRIDGE2..
BRIDCE1..
CHIRALITY -t--ttt-+t ttttt-t-tt tttttt-
BEND.. sss s ssss ssssssss ssssss
5-TURN.. >5555<
....
4-TURN.. < > > > ><<<<
3-TURN.. < >>><<< > 3 3 < > >3 < < > 3 3 <
..
SUMMARY.. SSSGGGGSHH HHHHTTTS G GGSSTT
EXPOSURE. * 7 * 1 0 5 5 1 7 1 2***0*97*1 '364'72.9
SEQUENCE, DGDGMNAWVA WRNRbKGTDV Q A W I R G a R L
SHEET.. .. 8 8
BRIDGE2..
BRIDGE1.. H H
CHIRALITY ttttt-tt-- tt---tt+tt ttttt+tt+t +ttt---+++
BEND.. ssssssss ss s ssssssssss sssss sss
5-TURN.. >5555< >5555<
....
4-TURN.. >X<<<< >>>>xxxxxx x < < < <
3-TURN.. >33x33< >33<>3 3< > 3 3 < > > > < <<
..
SUMMARY.. HHHHHTTS E E TT T THHHHHHHHH HHHHTT GGG G
EXPOSURE. 4 2 0 8 8 * 0 1 8 9 01'3**4146 34'989'125 787'9'4444 5'
SEQUENCE. EALVRWLAK VAYWKPNNT HEPHLRKSEA QAKKEKLNIW SE
SUMMARY........H=ALPHA-HELIX....E~BETA-STRAND....B-BETA-BRIDGE....G~3-HELIX....I~S-HELIX....T~3 - , 4 -, OR 5-TURN....S=BENDe...
23) lRNS RIBONUCLEASE-S [PHOSPHORIC DIESTER ( R N A ) HYDROLASE] ( C W PANCREAS: 80s TAURUS). .lRNS
AAAAA BBB B B BBB A AAAAAAA C
SHEET.. .. A
.............................. C AM
BRIDGEZ.. BBBBB EEE C CCCCCCC
BRIDGE1.. a a DDD D D DDD BBBBB G G CCC
CHIRALITY --+++++++ +++--+-+ ---+++++ +++-+-++-- ---+----+- -+++++++++ --+---+-+- ++----+--- +---+---++ +-+++----+
BEND..... SSSSSSS SS SS sssss ssssssssss s s sssssssss ss sss ss ss ss ss
5-TURN.. >5555<
4-TURN.. > > > > X X X < <<< > > > > x x <<<< >>>4<<< >444<
..
3-TURN.. . >33< > > > x < << >33< >33 <>33<
SUMMARY.. HHHHHHH HHB SS H H H H H HHHTTSSSSS SEEEEE S HHHHHGGGG SEEEEETTTE E E E E E SS E EEEEEEE TT BTTB EEE
EXPOSURE. '584348688 3114667*6* kl"45801.3 8876580*9* 2 4 9 5 2 1 8 8 2 5 376987588' 4*9470**69 '48456'761 31138*6889 2*'7'241*5
1 SEQUENCE. KETAAAWER QHMDSSTSAA 1 SSSNYaNW MKSRNLTKDR bKPVNTPVHE SLADVQAVCS QKNVAdKNGQ TMYQSYSM SITDaRETGS SKYPNbAYKT
24) ICPA CARBOXYPEPTIDASE A (C-TERMINAL A M I N O ACID HYDRCLASE] ( C W PANCREAS: BOS TAURUS) SICPA
..............................
SHEET.. AAAAA AA AAAAAA A A AAAA B B
BRIDGE.?. BB B d dddd
...
BRIDGEl.. AAAAA AA AAMAA A c cccc h h
CHIRALITY -+++-+++- ---+++++++ ++++++++++ ++---+---- +-+------- +--+------ -++++++-++ ++++++++++ ++++++++++ +-++++++++
BEND..... SSS SS SSSSSSS SSSSSSSSSS S sss s ss ssss ssssssss ssssssssss s ssssssss
5-TURN.. .
4-TLBN..
. .-......- >>>>YY!iY Y Y X < < < 0 4 4 A<
......................... >>>>xxxxx x x < x x < 4 x < 4 4 x > > > x < x < <
3-TURN.. >33<
. > 33< >33 < > 33< > 3 3 x33< >33< >33x33
SUMMARY.. STT SS HHHHHHH HHHHHHSSTT TEEEEEEEE TTS E E E E E E E S SS E EEEE SSTT THHHHHHHH HHHHHHHHBT TBHHHHHHHT
EXPOSURE. *30**38355 4351*803*0 695837748' 3076**22*1 * * 2 8 3 0 4 8 0 5 0 2 5 3 5 f 8 5 4 0 R000880003 0 4 8 0 0 1 0 0 1 4 8014018286 '594035067
1 SEQUENCE. RSTNTFNYAT YHTLDEIYDF MDLLVAQHPE LVSKLQIGRS YEGRPIYVLK FSTGGSNRPA IWIDLGIHSR WITQATGVW FAKKFTENYG QNPSFTAILD
SHEET.. .. AAAAA C C eeeee
A A AAAAA A
G
BRIDGES.. CCCCC
BRIDGE1.. EBB i i d d ddd f
CHIRALITY +------+++ +++++++++- ++++++--+- ++-+-+-+-- -++++-+--+ +-++--+-++ -++---+-++ +-++++++++ +++++++++- ---+-++-+-
BEND..... SS S S S SSSSSSSS S SS S ss s ssssssss sssss s ss sss ssss sssssssss sssss s ss
5-TURN.. . > 5 5 5 5<
4-TURN...
. . . . . . .A< >>>d<YYdA<
........... < >h44< > > 4 > x x > x x xx<<<<
3-TURN... < >33<>33< >33< >>3<< > 3 3< >33 x33< > 3 3 <>33x33< > 33<
SUMMARY.. TSEEEEES S SHHHHHHHHH T TT S ss s GGGSSSSST TSSSSBS TT STTB SSTT SHHHHHHHH HHHHH EEE EEEEE SS E
EXPOSURE. 6108000080 0 0 0 0 1 2 4 0 7 9 * 6 5 9 3 3 8 0 5 1 7*8*3*5381 0 0 8 3 8 0 8 0 1 6 5'61339737 374220**22 2160071807 41**36518R 0 0 0 0 8 0 7 3 2 2
1 0 1 SEQUENCE. SMDIFLEIVT NPNGFAFTNS ENRLWRKTRS VTSSSLaVGV OANRNWDAGF GKAGASSSPa SETYHGKYAN SEVEVRSIW FVKNHGNFM FLSIHSYSQL N
Q,
0
SUMMARY........H~ALPHA-HELIX....E-BETA-STRAND....B-BETA-BRIDGE....G~3-HELIX....I~5-HELIX....T-3-,4-, OR S-TURN....S-BEND.... UI
tQ
TABLE A111 (continued) 0,
0
SHEET.... AAA AAA A AAAAA Q,
BRIDGEZ.. GG GGG
BRIDGEl.. fff fff f eeeee
CHIRALITY t--++++-+- --++++++++ ++++++++++ +-------+- -++++----+ +-+++++++- +++++----- +--+-+-++- -+++++++++ ++++++++++
BEND..... S SS ss SSSSS ssssssssss sss S SSSSS s ssssssss s s s ssssss sssssssss ssssssssss
5-TURN.. > 5 5 5 5<
4-TURN.. >>>>xxx xxxxx<<<< >>44<< > > > > x x < < << >>>>xxx xxxxxxxxxx
..
3-TURN.. . >33< >>3< < >33< >>><< < >33< >>3<< >33<
SUMMARY., EEES SS TTHHHHHH HHHHHHHHTT SSS EEE EHHHHS S H H H H H H H H T S EEEEE S SSSSTT GGGHHHHHH HHHHHHHHHH
EXPOSURE. 00102044** 61948'5869 00*502630* 9 8 * 4 1 8 2 * 9 0 285857'610 0 0 0 0 0 2 0 2 7 8 6 1 6 0 0 0 0 0 0 0 31'5*+14*1 43*4053005 0024002200
2 0 1 SEQUENCE. LLYPYGYTTQ SIPDKTELNQ VAKSAVAALK SLYGTSYKYG S I I T T I Y Q A S GGSIWSYNQ CIKYSFWEL RDTGRYGFLL PASQIIFTAQ EIWLGVLTIM
SHEET....
BRIDGE2..
BRIDGE1..
CHIRALITY ++++
BEND.. SSSS
5-TURN..
....
4-TURN.. X<<<<
3-TURN.. >>3<<
..
SUMMARY.. HHHHT
EXPOSURE. 7604**042
3 0 1 SEQUENCE. EHTVNNIGY
25) 1APR ACID PROTEASE [HYDROLASE: PROTEINASE] (RHIZOPUS CHINENSIS). ................................................... lAPR
SHEET..
.. A AB C C CC BBB B DD E E F GGF H H
BRIDGEZ.. ccc
BRIDGE1.. A AB E E B D ff g 9 H IIH J J
CHIRALITY -+------- --++----+- --+--+-+--EE ----++-+-+ ----++--++ +++-+----- +++-+++--- +--+--+-+- ---+--++-+ +----+---+
BEND..... S ssss sss ss s ss ss ss sss ss ss sss
5-TURN..
4-TURN.. >444<
..
3-TURN... >33< > 33< >33<
SUMMARY.. TT B TTSS B B E E SSS EE EEESS B EE BS SS SS SS B TTT SS B SS EEB BSSS B
EXPOSURE. ''0489273' 5'04512090 4 0 1 2 ' 5 6 7 1 5 0 1 1 1 0 2 2 0 1 1 1 1 0 8 6 8 1 ' 5 5 0 0 * 8 6 2 8 7 1 5 4'80*66*75 9**175584* 2 6 1 5 1 5 0 1 1 4 1 0 6 1 2 6 1 7 0 5
1 SEQUENCE. GVGTVPMTDY GNDVEYYGQV TIGTFGKSFN LNFMGSSNL WVGSVQaQAS GaKGGRDKFN PSDGSWKAT GYDASIGYGD GSASGVLGYD TVQVGGIDVT
SHEET.. .. BBBB B BB B B B C DD C
BRIDGEZ.. hh h I 1
BRIDGEl.. ffff I1 h h h J KK J
CHIRALITY ---++---+- t-t---t+-- ------tt-- --ttt-tt++ +++ttt+t++ t-----++-t t+ttt++++t ++++tt+t+- t--+-+--tt +t--++-+++
BEND., s sss sss sss sss ssss ssssssssss sss ssss ssssssssss ssssssssss ss s ss sss ss ss
5-TURN.. > 5 555< >555 5<
....
4-TURN.. >444< > > > > x xxxxxxxx<< << >>> >xxxxxxxxx xxxxxxx<<< <
3-TURN.. >>3<< > 33<>33<
..
SVPIMARY.. EEEEEE'ITT E E SSSE E E E SSSB GGG H H H H HHHHHHHHHH HTT SSHH HHHHHHHHHH H H H H H H H H H H TSS SEES SSB SS SS
EXPOSURE. * 8 8 0 1 3 4 * 3 6 5 7 8 6 3 7 3 8 1 0 0883195'61 4 2 8 8 0 8 8 0 1 0 0 1 1 0 0 1 0 0 0 7 *4121998*6 1 8 8 8 0 8 8 8 8 0 1 8 8 0 1 8 8 3 9 4 * 4 * 4 3 3 3 8 0 3 587277**64
1 0 1 SEQUENCE. RSSVHYSQGY NNAFWNGSEH WGDGDGQTF IPLSGGIDW AHELTWVTD YTAGLIYCHE SGAINEAISD IPGTLVEFYA NKNPDWEIGE DWTPCISGD
SWMARY.. ..... .H-ALPHA-HELIX.. ..EsBETA-STRAND.. ..B=BETA-BRIDGE.. ..G-3-HELIX.. ..1-5-HELIX.. ..T=3-, 4-, OR 5-TURN.. ..SIBEND.. ..
SHEET.. .. DD EE EE
BRIDGE2..
BRIDGE1.. RK LL LL
CHIRALITY ----++++++ -+--+-+++- ++-++---++ ++++++++++ +++++-4-+- -+-------+ ++++++++++ ++-+--++-- ++++++++++ +++++--++-
BEND.. ... s ss sss ss ssss ssssssss ssssssssss ssssss s ss s ssssssssss ssss ss ssssssssss ssssss ss
5-TURN.. a > 5 5 55< >5555< > 5555< >5555<
4-TURN.. > > 4 4 x < 4 4 < > > > > x x x xx<<<< > 44>X4>XX4< << > >>>xxxxxxx x<<<< >
3-TURN.. .. >>3< < >>3<< >>><<< >> 3 < < > 33x33< >33< >33< >33<
SUWUARY.. SEESS GGG TT SGGG SSSHHHHT TTTTHHHHHH HHHHHT EES SS EE S TTTHHHHHHH HTTT TT HHHHHHHHHH HHHHTT TT
EXPOSURE. 0 5 3 0 0 4 8 0 5 * 77222369'' 7'34.59108 5 4 1 0 0 0 0 0 2 0 86001813*2 '758174239 '688908370 04'9167'14 0 5 9 0 3 4 1 8 4 9 80456839*1
201 SEQUENCE. SLRSUSDPAK YGDPDHYSKR YTGTQDNGGV HINSGIINKA AYLISQGGTH YGVSWGIGR DKLGKIFYRA LTQYLTPTSN FSQLRAAAVQ SATDLYGSTS
SHEET.. ..
BRIDGE2..
BRlDGE1..
CHIRALITY ++++++++++ ++-+
BEND..
5-TURN..
.... ssssssssss sss
> 5555<
or
4-TURN.. >>>xxxxxxx <<<<
3-TURN.. >33< > 3 ><3<
..
SWUARY.. HHHHHHHHHH HHHT
EXPOSURE. '204809508 80020'
301 SEQUENCE. QEVASVKQAP DAVGVK
28) 2GCH G M U A CHYUOTRYPSIN A [HYDROLASE: SERINE PRoTEINASEl ( C W PANCREAS: BOS TAURUS) ............... , ..2GCH
SHEET.. A 88 CCCC D CC
.. ccccc c cccc c c cccc c ccc ccccc
............... D
BRIDGEZ.. KKKK L L L unun N NNN 00
BRIDGEI.. A 88 JJJJJ J JJJJ LLL K KKK NNNNO 0000 P P nn
CHIRALITY +--+----- +++-----++ +++++----- +-+-++---- ---++--+-- ++----++-- --++++-++- ++-+-----+ +-----++-- ++-++-++--
BEND.. ss sss ss ssss ss ssss sss sss s s ss sss ss sss s
5-TURN.. > 5555<
....
(-TURN.. > 444<
3-TURN.. >33< > 3 3 <>33< > 33< >33< > >3<< >33< >33< >33<
..
SUUMARY.. SS SET E E TT SSTTEEEEE TT E E E E E E EEETTEEEE GGG TTSE EEES S TT SSS EEEEE E E E E E TT B TTTTBS EE
EXPOSURE. 48.854'29' 32*28*0472 8 1 0 0 1 0 0 0 2 4 *'5*141000 0 2 4 9 6 i 0 0 0 0 1 6 2 5 2 7 9 7 2 3 0002223464 '*5'64*0*: 5'44'0*'57 8'55338010
1 SEQUENCE. aGVPAIQPVL SVNGEEAVPG SWFUQVSLQD KTGFHFbGGS LINENWVVTA AHBGVTTSDV WAGEPDOCS SSEKIOKLKI AKWKNSRYN SLTINNDITL
2 9) lALP ALPHA LYTIC PROTEASE [HYDROLASE: SERINE PROTEINASE] (MYXOBACTER495: LYSOBACTER ENZYMOGENES) ........., , ,..... . .lALP
SHEET.. .. A BBB B BBB BBBB BBB BB BBBS BB BBB BBB C BBSBBB AAA A AAAA D
BRIDGE2.. DD D FF G GG HHHH I11 111 111111 BBB B
BRIDGE1.. a CCC CCC EEEE E E E E DDD H H HH j GGG a BBBB q
CHIRALITY -+-++---- --++-+---- -+----+-t- -+--+++--+ +----+-+-+ -----+--++ --++-..---- ++----+--+ --+----+++ +----++-+-
BEND.. ss sssss sss ssss s s ssss s s sss ss ss
5-TURN..
...
4-TURN..
3-TURN.. >33< >33< > 3 3< >33< >33< >33<
..
SUMMARY.. BT EEE ESSSSEEE EEEETTEEE EE SSSS T T EEEETTEE EEEEEEEE S S EEEEEE SSS EEE ETTEEEE B TT
EXPOSURE. 6'02817702 1 8 * 7 8 8 0 0 0 0 0105'89640 0 0 0 0 0 4 2 0 9 6 ' 0 5 8 6 1 8 8 6 8 0 0 6 1 4 3 9 6 2 4 3 4 0 4 0 2 0 3 2 6 993'443.03 58'7'140.1 3973675370
1 SEQUENCE. ANIVGGIEYS INNASLaSVG FSVTRGATKG FVTAGHaGTV NATARIGGAV VGTFAARVFP GNDRAWVSLT SAaTLLPRVA NGSSFVTVRG STEAAVGAAV
30) IPTN BETA-TRYPSIN (NATIVE AT PH 8) [HYDROLASE: SERINE PROTEINASE] ( C W PANCREAS: 80s TAURUS) ............ lPTN
SHEET.... A BB ccccc c ccccc c cccc cccc ccccc cc cc
...........ccccc
BRIDGE2.. KKK LL L MMMM NNNN 00000
BRIDGEl.. A BB JJJJJ J JJJJ LLL KKK N N N N O 00 00 MMMM
CHIRALITY ++------+ +-++----+- +-++-+--+- -++--+--++ ++-++----+ +++-++-+-+ +--+---+-- ---++--++- +++++----- +----+-+++
BEND...,. S SS SSSS S SS sss ss s ss s sssss s S ss sss s ss sss
5-TURN..
4-TURN.. >444 <
..
3-TURN.. . >33 0 3 3 < >33< > > 3 << >33< >33< >33<
SUMMARY.. BS EE TT SSTTEEEEES SSEEEEEEEE ETTEEEE GG G SS EEEE S SSTTS S EEEEEEEE EE TT TTT TT EEEEE SS SSS
EXPOSURE. 0153.82364 9132100023 7 * 4 3 1 0 0 0 0 2 5 4 7 1 0 0 0 0 0 7 0 * 5 * 6 1 5 0 5 0 1 1 2 3 3 * 7 * * 5 5 3 3 8 2 4 2 7 6 4 452'7678'7 3420000040 *851887*70
1 SEQUENCE. IVGGYTaGAN TVPYQVSLNS GYHFbGGSLI NSWWSAAH bYRSGIQVRL GEDNINVVEG NEQFISASKS IVHPSYNSNT LNNDIMLIKL KSAASLNSRV
SHEET.. .. D BBBBB
BRIDGEZ.. 1111
BRIffiE1.. P GGGG
CHIRALITY ++------++ ++++++++++ t or
BEND..... S ss ssssssssss s
5-TURN..
4-TURN.. > > > > x x x x <<<<
..
)-TURN.., 3< >>> <<< > 3 3<
SUMMARY.. TB EEEEEGG GGHHHHHHHH H H
EXPOSURE . 870 0 0006 0 2 7 1 3950**14 8 89
2 0 1 SEQUENCE. NKPGWTKVC NYVSWIKQTI ASN
31) lSGA PROTEINASE A ISGPA) . . [HYDROLASE: SERINE PROTEINASEl ISTREPTWYCES GRISEUS) , ... ................................. lSGA
SHEET.. .. AA AA BB BB BBBBBB i 'BBB B c BBBB B - D DD DDD EEEEE
BRIDGE2.. C C DDDD FFF F FFF F NNN
BRIDGE1.. AA AA BB BB BBBB E E a DDDD L LL LLL MMMMM
CHIRALITY -++-+++-- +-+-----+- ---+-+--+- -+++++--++ -+----++++ +++-+----- -+-+++--+- ---+-+---- +-+----++ -----+-++-
BEND..... SSS SS S sss ssssss s ss sss sssss ss sss ss sss
5-TURN.. .
4-TURN.. >>44<<
3-TURN.. .. >33< >33< >33< >33 <
SUMMARY.. SSSEE SS S EE EE EETTEEEEEE HHHHSS S BTTEEEEE SBS EEEE ESSTTS SE EE S S S EEE TT EEEEE S S S
EXPOSURE. 4 0 0 0 5 5 0 4 5 7 9 5 * 1 0 0 0 1 1 1 42'6422008 0040068349 1 6 2 1 7 7 9 1 5 3 478202000* 18'9832528 0447'7776' 08724*2*96 2817000251
1 SEQUENCE. IAGGEAITTG GSRCSLGFNV SVNGVAHALT AGHCTNISAS WSIGTRTGTS FPNNDYGIIR HSNPAAANGR WLYNGSYQD ITTAGNAFVG QAVQRSGSTT
35) EPAP PAPAIN (HYDROLASE: SULFHYDRYL PROTEINASE] (PAPAYA FRUIT LATEX: CARICA PAPAYA]. ............................ .BPAP
SHEET.. AA B C B
BRIDGE?.
...
BRIDGEl.. AA H I H
CHIRALITY -+-+-+++- ++-------- +-++++++++ ++++++++++ +-+-----++ +++++++++- -++++--+++ ++++++---- --+++--+-+ -+--+++++-
BEND..... S SSSS S ss SSSSSS ssssssssss ss s SSSSSS sss sss sss ssssssss ssss 5 s ssss
5-TURN... > 5 5 5 5< >555 5<
4-TURN.. > > > > x x x xxxxxxxxx< <<< >> > > x < < < < >>>> xxxx<<<< >>44<
3-TURN.. >33 < >33< >33< >33< >33< >33
..
SUMMARY.. S EESTTT T SS HHHHHH HHHHHHHHHH HH B H HHHHHH TTT TSTT HHH HHHHHHHS B BSSSS S S HHHH
EXPOSURE. *7**134*** 3022932714 *030210100 0100000155 *85*636000 0000031**0 '04'3281'8 005202**00 194'606491 86**150*'9
1 SEQUENCE. IPEYVDWRQK GAVTPVKNQS SaGSCWAFSA W T I E G I I K I RTGNLNQYSE O E L L W D R R S YGaNGGYRJS ALQLVAQYGI HYRNTYPYEG VQRYbRSREK
SHEET.. .. C A AAA A AAAA AA AA AAA AA A AAAA AAAAA
BRIDGE2.. D FF F G GGGG GGGGG
BRIDGEl.. I B BBB c cccc ee CC CCC AA F FF ee
CHIRALITY +--++----- ----++-+++ ++++++---- +-+-+++-++ ++++-+--+- -+--+-++-+ --+----++- -----+--++ ----+----- ++-+-+--+-
BEND..... S S S ss sss ssssssss s ss sss sss s s ss ss ss sssss ss ss ss
5-TURN.. .
4-TURN... < >>>> x x x x < < < < > 444< >444<
)-TURN... < >33< > 33< >>3
SUMMARY.. S SB S E EEE S S HHH HHHHHHHS E EEEE S SS TTT SSSEE S S EE EEEEEE SSE EEEE SS SS STTTSEEEEE SS S S G G
EXPOSURE. 3'951'8925 '*1*5*7552 12'1028210 670202871'7 0**2'3372' 2'42'*5921 0000010565 0102016398 216'028807 45'777'018
101 SEQUENCE, GPYAAKTDGV RQVQPYNQSA LLYSIANQPV SVVLQAAGKD FQLYRGGIFV GPcGNKVDHA VAAVGYGPNY I LIKNSWGTG WGENGYIRIK RGTGNSYGVC
SHEET.. .. AAAA
BRIDGE2.. D
BRIDGEl.. BBBB
CHIRALITY +++-+-+---
BEND..... S SS
5-TURN..
4-TURN..
..
3-TURN... X<3<
SUMMARY.. GTTS EEEE
EXPDSURE. 0013412403 '5
201 SEQUENCE. GLYTSSFYPV KN
SUMMARY........H=ALPHA-HELIX....E=BETA-STRAND....B=BETA-BRIDGE....~=3-HELIX....I=5-HELIX....T~3-,4-, OR 5-TURN....S=BEND....
361 lFAB LAMBDA IMMUNOGLOBULIN FAB (NEW1 lHUMAN).......................................................................lFAB
SHEET.. .. A BB BB AA AAA BBBBBB BB AAAA AA AAAAAA BB BBBBBBB B BBBBB BBB
BRIDGE2.. BB EBB FF cccccc GG GGGGGG G GGGGG GG
BRIDGE1.. A dd dd A EEEEEE FF CCCC CC BBBBB E EEEEE ddd
CHIRALITY -+-+--+-- +--++----- +-+--+-+++ -+------+- -++------- +-+-+++--- +--+ ---- +- ---- ++----
BEND..,.. S S ss ss
sss sssss ss ssss
___ss+++---- +-+-++++--
s ss s sss
5-TURN.. >5555<
4-TURN..
..
3-TURN.. . >33< >33x33 c >33< >33< >33<
SUMMARY.. S B S E E E E TTS EE EEE TTTTT SS EEEEEE SSSS E E SS SSEEEE EETTEEEEEE S STT EE EEEEEEETTE E E E E E E E E E E
EXPOSURE. 9 3 * 1 * 2 3 * 7 2 7 0 2 7 7 8 ' 1 7 8 5 0 5 2 5 ' 6 0 8 3 5 6 7 6 1 5 0 0 5 1 886908'43. 6*+5**2676 * 8 6 5 4 0 2 0 1 0 538'4'1211 0101004'41 1010420'84
1 SEQUENCE. XSVLTQPPSV SGAPGQRVTI SaTGSSSNIG AGNHVKWYQQ LPCTAPKLLI FHNNARFSVS KSGSSATLAI TGLQAEDEAD YYaQSYDRSL RVFGGGTKLT
SHEET.. .. BBBBB
BRIDGE2.. HH
BRIDGEl.. dddd
CHIRALITY
BEND..
5-TURN..
.... -----
4-TURN..
3-TURN..
..
SUMMARY.. EEEEE
EXPOSURE. 0090*5*
1 0 1 SEQUENCE. GTKLQIT
38) 3PGM PHOSPHOCLYCERATE MUTASE (DE-PHOSPHO) [ISWERASE] (YEAST: SACCHARWYCES C E R E V I S I A E I ........................... .3PGM
SHEET.... AA B AAAAA AAA
BRIDGE2.. BB ddd
BRIDGE1.. a f ccc dd d
CHIRALITY +--+----- -+++++-+++ -+-+----++ ++++++++++ ++++-+---+ -----+-+++ ++++++++++ -++++----+ -+++------ ++++--++++
BEND.. sssssss ss ss ssssssssss ssssss s s sss ssssssssss ss s sss s s SSSS
5-TURN.. > 5555< > >555<< > 5 5 55< >
....
4-TURN.. >>44<< >>> >xxxxxxxxx xx<<<< >>>> XXXXXXKX<< << >>>
3-TURN.. >33< >33<
..
SUMMARY.. EE B SSHHHHS TT HH HHHHHHHHHH HHHHHS S E E E E E S HHH HHHHHHHHHH HT S EE ESSS S S SSHH
EXPOSURE. 5400014104 07525'8552 1 1 3 4 3 9 1 2 5 * 0'*601900* 1181.36986 1110062521 3 5 0 0 8 2 1 2 * * 1'5**4957* 1371140041 8321845391
1 SEQUENCE. PRLVLVRHGQ SEWNERNLFT G W W V K L S A K GQQEAARAGE LLKEKGVNVL W Y T S K L S R A I Q T A N I A L E R ADRLWIPVNR SWRLNERHYG DLWKDKAQT
SHEET.. .. AAA
BRIffiE2.. ccc
BRIDGE1.. a
CHIRALITY +++++-++++ +++--++t-- ----++-++- +-+-+++++- -+++--+--- ++t+++++++ ++++++++++ +-+---+-+- ++++++++++ -+--++++++
BEND., , ssss sssss sssssss s sssss sssssss ss s ss ssssssssss sssssssss ss s ssssssssss ss ssssss
5-TURN.. 5555< > 5 5 5 5< >>555<<
...
4-TURN.. 4 < < < > > > 4 < << > 44>x>4xxx> x<<<x>44<< >>>>xxx<<<<
)-TURN.. >33< >33< >3><3< >>3<< >>3<< >3 3<
..
SSSSS SSTTTTS _.
-..... ~ SS
S S S ~. TTTHHHHHHH HHHHTHHHH SS EEE T THHHHHHHHH SS SSSSSS
SUMMARY.. HHHT SHHHH HSTTSSS S
EXPOSURE. ~ * + * i * * * s * 1 6 6 9 3 * + 6 4 a 3739;;;;93 3*39*71**2 *9*413*211 2 4 8 1 5 9 2 2 6 4 33**412*51 8 9 3 0 8 1 0 0 2 1 1 1 0 3 0 1 2 4 ' 1 9537.7625'
101 SEQUENCE. LKWGEEWN TYRRSFDVPP P P I 3 A S S P F S QWCDEKYKYV DPNVLPETES LALVIDRLLP YWQDVIAKLV GKTSMIAAHG NSLRGLVKHL EGlSDADIAK
SHEET.. .. B A A A
z4
BRIDGE2.. E
BRIDGE*. . f 88 E
CHIRALITY +---++---- -+--+-+-++ ----- t t +
BEND.. sss ss s s s
5-TURN..
....
(-TURN.. . >444<
3-TURN.. . >33< >33<
SUMMARY.. BSS EE TTT S B S T T
EXPOSURE. **24301011 1416**4*47 *969725**6
2 0 1 SEQUENCE. LN I PPGT I LV F ELDENLKPS KPS Y Y L DPEA
39) l T I M TRIOSE PHOSPHATE ISOMERASE [D-GLYCERALDEHYDE-3-PHOSPHATE-KETOL-ISOMASE] (CHICKEN BREAST MUSCLE: CALLUS GALLUS) l T I M
SHEET.. .. AAAAA B AAAA A AAA M A C AA Ah
BRIDGE2.. bbbb ccc , ee
BRIDGE1.. aaaaa I aaaa a c c c ddd J dd d 0
CHIRALITY --------- --++--++++ +++++++++- +++-t----- +--+++++++ +++--++--+ --+---+--+ -+-++---++ ++++-+-+-- +++-++++++
BEND.. s ssss sssssssss ss sssssss ssss sss s ssss s ss ss ssssss ssssss 0
5-TURN.. >5555< >555
....
4-TURN. >>>>X xxxxxx<<<< > 4 > > x > x x <<<< >>> >x<<<< >>>>X<<
3-TURN.. >33< >33< >33 < >33<
...
SUMMARY. . EEEEE B HHHH HHHHHHHHH
.................. S
.S. EEEE
~ _ _ _E TTHHHHH HHHS TTEEE EEE S S S S BS SS H H HHHHHT EE EE H H H H H H
EXPOSURE. * 9 9 9 3 0 0 0 0 1 13042'8'41 1*20*71362 * 9 6 8 * 3 7 1 1 0 0 0 0 2 1 0 0 1 3 0 5 * 8 1 6 9 9 0 2 0 0 1 1 2 0 1 6 5 7 * 2 5 5 0 1 2 1 0 0 4 01'7231630 10001213'6
1 SEQUENCE. APRKFFVGGN WKMNGKRKSL GELIHTLDGA KLSADTEW C GAPS I YLDF A RQKLDAKIGV AAQNC YKVPK GAFTGE IS PA M I KD I GAAWV ILGHSERRHV
SUMMARY........H=ALPHA-HELIX....E=BETA-STRAND....B=BETA-BRIDGE....G~3-HELIX....I=5-HELIX....T=3-~4-, OR 5-TURN....S=BEND....
TABLE A111 (continued)
SHEET.. .. AAAA AAAA
BRIDGEZ.. h h
BRIDGE1.. qgqq bbbb
CHIRALITY tt-----t-- t-tttttttt t-ttt----t ttttt-+ttt ttttt
BEND..... SSS S SSSSSSSS SSSS ssssssssss ssss
5-TURN..
4-TURN.. <<< > 4 > > x > < < << > 4 4 4 < >>>> <<<<
..
3-TURN.. . > 3 3x33< >>><<x33< >33<
SUMMARY.. HHSEEEE S TTHHHHHH TSTT EEEE SGGGGSTHHH HHHT
EXPOSURE. 8 8 0 2 0 0 1 1 2 8 045451'701 62'5100028 26207.7502 *0151**
2 0 1 SEQUENCE. VQSRIIYGGS VTGGNCKELA SQHDMGFLV GGASLKPEFV D I I N A K H
40) 3CNA CONCANAvALIN A [LECTIN, AGGLUTININ] (JACK BEAN: CANAVALIA ENSIFORMIS] . .... ,... . ... ...CCA .... ,.3CNA
....AAAAAA
B
SHEET.... AAAAAAA AAAAAA AAA BBBB BBBBB
..BBBBBB
........, , ...BBBBBB
BRIDGEZ.. BBBBBB ccc H H HHH I 11111 EEEEE x
BRIDGEl.. AAAAAA BBBBBB ccc GGGG GGGGG HHHHH 111111 OOD DD 3.
CHIRALITY --+-+----+ +-ttt+-t-- t-+-+----+ -+--+----- ---++----- -tt--t+-+- ++--+--++- ---t---t-- tttt--tt-- t-t---ttt- w
BEND., s sss ss ss sss ss ss sss sss ssss ss sss cn
5-TURN.. >5555<
....
4-TURN.. >444< >444< 2
3-TURN.. >33< >33< >33<
..
SUMMARY.. EEEEEEE S TTTT S S EEEEEES SSS SSEEE TT EEEE EEEEETTT E E E E E E E TTS EEEEEE TTTS SEEE EEEEEE S S S 5,
EXPOSURE. * 5 4 1 0 0 1 0 0 0 6 5 4 1 9 ' 2 5 1 3 9 8 4 0 1 0 0 0 2 ' 21'29'62'1 * 5 7 7 3 * 8 0 5 0 807174'*9* 12261518.4 '729138'27 2**4169*04 10000002*8
1 SEQUENCE. ADTIVAVELD TYPNTDIGDP SYPHICIDIK SVRSKKTAKW NMQDGKXTA HIIYNSVDKA LSAWSYPNA DATSVSYDVD LNDVLPEWVR VGLSASTGLY
SHEET.. .. A AAAAA
BRIDGE?. . DDD
BRIDGE1.. A AAAAA
CHIRALITY -+-+-----+ -+-+--++-- --+++-++-- tt--t
BEND.. ssss ss ss sss sss s
5-TURN..
....
4-TURN.. >4 44<
3-TURN..
..
SUMMARY.. SSSS E EEEEE S S SS S S S T TTS S
EXPOSURE. 1*5**10100 0 0 0 2 0 2 8 * 2 9 41'4858930 0 1 0 7 9 6 7
2 0 1 SEQUENCE. SPDS HPADGI AFF I S N I D S S I PSGSTCRLL GLF PDAN
SUMMARY........H~ALPHA-HELIX....E=BETA-STRAND....B=BETA-BRIDGEI...G=~-HELIX....I=~-HELIX....T~~-,~-, OR 5-TURN....SeBEND....
41) lCAC CARBONIC ANHYDRASE FORM C [CARBON-OXYGEN LYASE, CARBONATE DEHYDRATASE] [HUMAN ERYTHRDCYTESl...................lCAC
SHEET., .. AA BBBB BBBB BBBB BBBB BBB BB BBBB
BRIDGEZ.. EEE FFF GG H H H HH HHHH
BRIDGEl.. aa cccc DDD E EE cccc GG F F F
CHIRALITY +-+-+++-+ ++++-+++++ +-++-+---+ -+++--++++ ------++++ -----+-+-+ +-------++ -+++-++-++ -+----+--+ --+--+-+++
BEND..... SSSSSSS SS SSS SS S SS SS SSSS S S ss s s s ssss ss ss ssss
5-TURN.. .
4-TURN.. , >444<
3-TURN., . > 3 3 < > > 3 < x 3 3 < > > > < << >33 < >33 < >33 < >33<
SUMMARY.. SSTTSSG GG TTS GGG G SS SS EE TTTS TT S E E E E TT EEEE S S EEEE S SSSSEEEETT SS EEEEEE EEEE SSTT
EXPOSURE. * 5 1 4 3 * * 4 0 3 * 5 1 * * * 7 9 7 1 * 3 ' * 0 0 0 1 5 1 85*917'3** 4.734561'' 2666.03171 5 0 0 3 0 5 0 7 4 9 '68111'222 2.53051431 2 1 1 1 0 4 * 8 6 5
1 SEQUENCE. HWGYGKHDGP EWHKDFPIA KGERQSPWI DTHTAKYDPS LKPLSVSYDQ ATSLRILNDG HAFNVEFDDS EDKAVLKGGP LDGTYRLIQF HFHWGSLDGE
42) 1DFR DIHYDROFOLATE REDUCTASE [OXIDOREDUCTASE: NADPH/DONR, DIHYDROFOLATE/ACCPTRl (BACTERIAL: LACTOBACILLUS CASE11 , lDFR ...
SHEET.... AAAAAAA A B AAA AAA A AAA
BRIDGE2.. bbbbbbb eee f eee
BRIDGE1.. aa C C I aaa ddd f aa
CHIRALITY --+------ +-+---+--- -+++++++++ ++++++---+ --++++++-+ -+--++---- -+-+-++-+- ++--+-+-++ ++++++++-+ ++---++--+
BEND.. ss s s ssss s s sssssss sssssss ssssssss sss sss ss sss s ss sssssssss s s s
5-TURN..
....
4-TURN.. >>>>XXKX X<<<< >>4><<4< >>> > x < < < < >>
3-TURN.. >33 < > > 3 << > 3 3 < >>3<< > 33< >33< >>
..
SUMMARY.. EEEEEEETT B SBSSSS S S H H H H H H H HHHHTTSEEE HHHHTTSS SSS SSSEEE SS TTB S HH HHHHHTTSS S E E E S H
EXPOSURE. 32082121" 2 1 3 1 5 * 5 9 8 0 2 ' 0 6 5 2 9 * 4 4 ' 6 5 0 4 5 2 0 0 2 Bl''30755f **435*4400 041*9**3*1 '524548956 327581**3* '7'2002821
1 SEQUENCE. TAFLWAONRN GLIGKDGHLP WHLPDDLHYF RAOTVGKIMV VGRRTYESFP KRPLPERTNV VLTHQEDYQA CGAWVHDVA AVFAYAKQHL DQELVIAGGA
SUMMARY. . ......H=ALPHA-HELIX.. .,E=BETA-STRAND.. ..B=BETA-BRIDGE.. ..G=3-HELIX.. ..I = 5 - H E L I X a . . .T=3-, 4-, OR 5-TURN.. ..S=BEND.. ..
TABLE A111 (continued) tQ
6,
p\3
SHEET.. .. AAAAAAA B A A AAAA A AAA A AA 0
BRIDCE2.. GGGGGGG HHHHHH
BRIDGE1.. bbbbbbb I H H HHHH GGGGGGG
CHIRALITY ++++++++++ +-------+- --+-++---+ -++-+--+++ +-----+-++ ++--+-----
BEND..... SSSSSSSSS S ss s s s s ssss s
5-TURN.. .
4-TURN... >>XX<<<< > 4 4 4<
?-TURN...
. .. . .. 1<< >??<
__ >
.??
-- <.
SUMMARY.. HHHHHHHTS SFEEEEEESS SB S S EEE EEEE SSTT T EEEEEEE
EXPOSURE. * 2 0 8 5 2 8 * 8 1 6 3 1 2 0 0 6 0 7 1 68'387.22' 2R9.929986 7**3*5*6*3 2 2 0 2 2 2 3 4 * * **
1 0 1 SEQUENCE. QIFTAFKDDV DTLLVTRLAG SFEGDTKMIP LNWDDFTKVS SRTVEDTNPA LTHTYEVWQK KA
SUMMARY.. ......H=ALPHA-HELIX, ...E=BETA-STRAND.. ..B=BETA-BRIDGE.. ..G-3-HELIX.. ..I=S-HELIX.. ..T=3-, 4-, OR 5-TURN.. . .S-BEND.. ..
SHEET.. , . DDDDD
BRIDGE2.. 000
BRIDGE1.. NNNNN
CHIRALITY +++-t-+--- ++++++++++ ++++++++++ +
BEND.. ss ssssssss ssssssssss s
5-TURN.. >55 55<
....
4-TURN.. >>>><xx<>xx>xxxxxx<< <<
3-TURN.. 33< >>3xx3<< >33<
..
SUMMARY.. TTEEEEE HHHHHHHHH H H H H H H H H H H HT
EXPOSURE. 85'1711831 0123120310 8 3 1 1 9 6 0 3 ' 3 28.
3 0 1 SEQUENCE. KTF VKWSWY DNEF GYS QRV I DLLKHMQKV DSA
44) 4ADH APO-LIVER ALCOHOL DEHYDRWENASE IOXIDOREDUCTASE: CHOH/DONR, NAD/ACCPTR] (HORSE LIVER: EQUUS CABALLUS). ...... ..4ADH
SHEET.. .. AAAA AAAB AAAAAAA CCCCC DDD B D CC CC CEE E
BRIDGEZ., B cc HHHH K f I
BRIDGE1.. AAAA AAAE AAAAAAA GGGGG JJJ E K H H HH IMM M
CHIRALITY -++---+-+ ---++-++-- --+------- --++-----+ ------++++ +++-++-+-- ++--++++-+ --++---+++ ++--++-+-- --+--+-++-
BEND.. sss ssss ss ssss sssssss ss s ss s ss ss sss
5-TURN.. >5555<
....
4-TURN.. > > > > X <<<< >
3-TURN.. >33< >33< >>3<< >33< >33<
..
SUMMARY.. SSS EEEE EEEB STTS EEEEEEE TTEEEEE EEE H H H H HHHTTSS SSB B E E E E E TT S TT B E E E S S SSS
EXPOSURE. *'54*81'09 00008*'**9 1 4 7 5 9 0 5 1 3 5 1'4'100060 4 0 0 0 0 1 ' 5 0 1 3015244.14 5 2 0 1 0 0 0 0 0 0 0 6 1 3 2 3 2 * 5 1 * 6 0 * 7 6 2 5 0 0 0 0 2 0 1 2 4 6 * 2
1 SEQUENCE. STAGKVIKCK AAVLWEEKKP FSIEEVEVAP PKAHEVRIKM VATGICRSDD HWSGTLVTP LPVIAGHEAA GIVESIGEGV TTVRPGDKVI PLFTPWGKC
45) 4LDH LACTATE DEHYDROGENASE, APO ENZYME M4 [OXIDDREDUCTASE: CHOH/DONA, NAD/ACCPTRl [DOGFISH MUSCLE: SQUALUS ACANTHIUSILDH
SHEET.. .. AAAAA AAAA A AAAAA AAAA
BRIDGEZ.. bbbb cccc c dd
BRIDGEl.. aaaaa aaaa a ccccc bbbb
CHIRALITY -++++++-- -+++--+--+ +----+-+++ ++++++++++ +-++-+---- +-+-++++++ ++++++++++ +-t-+++--- +-++++++-+ ---+++----
BEND..... SSSSSS S s s ss ssssssssss ss ss s ssssss ssssssss s ss s ssssss s s
5-TURN.. >5555< >5 555<
4-TURN.. >>44<< > > >>xxxxxx<< << > > > > x x <x x x 4 < < <
..
3-TURN.. >>3<<
. >33< > 3 3x33< >37< > > 3 < < > 1 > x 7 << >>7<<
SUMMARY.. THHHHS TT s S E E E E E S H H H H H H H H H H H HTT S E E E E E s HHHHHH HHHHHHTTGCGS S E E E E E SSSGCG s s E E E E
EXPOSURE. * 8 * * 9 * * 9 6 * '*9*****2' 6 1 0 0 0 0 0 1 6 3 3 8 2 1 0 0 8 1 0 1 * ' 7 1 0 6 1 0 8 1 0 2 * 9 9 + 9 1 ' 4 43'*37*25' *4'1882421 *927303417 00001442*6
1 SEQUENCE. ATLKDKLIGH LATSQEPRSY NKITWGCDA VGMADAISVL MKDLADEVAL MVMEDKLKG EMMDLQHGSL F LHTAKIVSG KDYSVSAGSK LVVITAGARQ
SHEET.. .. AA A BB C BB
BRIDGEZ.. e
BRIDGE 1. . dd e FF C FF
CHIRALITY -++--+++++ ++++++++++ ++++++++++ ----++-+++ ++++++++++ -+--++++-- ++++++++++ +++++++-+- -+++-+-+++ +-++++----
BEND..... SS S S S S S SSSSSSSSSS S S S S S S S S S ss s ssssssssss s sss ssssssss sssssssss SSSS ssssss
5-TURN.. >5555< >5555<
4-TURN.. > > > 4 x x x>xx<xx<4x x>>x<<<< >> >>xxxxxx<< << >4>>X4X< < > < 4 > < 4 4 < > 4 4 4 < >
..
3-TURN.. . > 33< >33x33 < >33< >33< >33< > > 3 << >33<
SUMMARY.. SS H H H H H H H H H H H H H H H H H H H H H H TT EE S S H HHHHHHHHHH H TTS B TTTTHHHHH TTTTTTTTT TTTS E E BSSSTT E E
EXPOSURE. + * 5 8 4 * 6 * 1 1 3 9 0 0 8 7 8 7 ' 1 06583'387' 0800822'31 0 2 0 0 3 0 0 3 9 2 2 6 1 7 ' 7 4 0 0 8 1 0 0 2 2 1 1 4 7 0 7*310"*75 '386030580 0262'40118
1 0 1 SEQUENCE. QEGESRLNLV QRNVNIFKFI IPNIVKHSPD C I I L W S N P V DVLTYVAWKL SGLPMHRIIG SGCNLDSARF RYLMGERLGV HSCSGVGWVI GQHGDSVPSV
46) 2GRS GLUTATHIONE REDUCTASE IOXIWREDUCTASE: GSSG/ACCPTR, NADPH/DONR, FLAVOENZYMEI (HUMAN ERYTHRCCYTE)..............ZGRS
SHEET.. .. AA B AA AAA B
BRIDGEZ.. b cc ccc
BRIDGE1.. aa f aa f
CHIRALITY +--++--+- +-++++++++ ++++-+---- -+-+--+--+ ++++++++++ ++++++++++ ++-++++-+- --+----+++ ++++++++++ ++++++++++
BEND.. s s sssssssss sssss ss ss s ssssssssss ssssssssss ssss ss ss ssssssssss ssssssssss
5-TURN.. >5555<
....
4-TURN.. > 4 > > < > x x >x < < < < > > > > < < < x > > > x xxxxxxxxxx< <<< >>> >xxxxxxxxx xxxxxxxxxx
3-TURN.. >>3<< >3><3 < > 3 3< >>3<< >33<
..
SUMMARY.. S EES BTTnHHHHH HHHHTT E E EEESS TBHH HHHHSHHHHH H H H H H H H H H H HHSS GGGS H H HHHHHHHHHH H H H H H H H H H H
EXPOSURE. * 5 5 3 7 0 0 0 0 1 0 1 3 1 0 1 1 0 0 5 702*960*80 0836**2826 2 8 2 4 2 5 3 0 3 7 4028304.23 *5*6045*7* **8*5*7*1* 732**05391 7'23698'86
1 SEQUENCE. VASYDYLVIG GGSGGLASAR RAAELGARAA WSHKLGGT aVNVCaVPKK WNTAVHSE FMHDHAWGF PSCEGFFNWR V I K E K R D A Y V SRLNAIYQ"
SHEET.. .. AAAAA C C C M A DD EE EEEE EEE
BRIDGE2.. H ddd 111 mrn
BRIDGEl.. ccccc G G H b 11 __ ii kkkk 111 m
CHIRALITY +++-+----+ --+---+-+- +--+--+--- +-+---++++ +------+++ -+++++---+ +++++++--- ++---+-+-+ ++++++++++ +-+-----+- 2
BEND.. SSSS s ssss s sss ss sss sss ss s sssss ss ss ss ssssssssss sss
5-TURN.. >5555< > 5 555<
....
4-TURN.. <<<< >> 44<< >> >>xxxxxx<< <<
3-TURN.. .. >33< >33< >33 < >33< >3 3x>3<< >33<
SUMMARY.. HHHTTEEEEE S B S S S S S B STT B SSEEE EE STT SSS TT E E H HHHTT S S SSEEEE SH H H H H H H H H H H HTT EEE
EXPOSURE. 09'391961' 2*12117*9* 51081.6854 5885000150 183392**9* 4 7 1 0 9 7 2 6 3 1 4282*5**61 3200021248 5 0 8 l l l l B R 3 1 2 4 1 9 0 0 1 0 0
1 0 1 SEQUENCE. LTKSHIEIIR GHAAFTSDPK PTIEVSGKKY TAPHILIATG GMPSTPHESQ IPGASLGITS DGFFQLEELP GRSVIVGAGY IAVEMAGILS ALGSKTSLMI
SHEET.. .. D D BBBBB B B B B B AA
BRIDGEZ.. 11111 I I111
BRIDGEl.. k k GGGG E BB
CHIRALITY ++-t-+++-+ ++------+- ----+++--+ ++++--+++- +++-----+
BEND.. s sssss sss ss ss ss sssss s
5-TURN..
....
4-TURN.. > 4 44<
3-TURN.. > 3 3 < > 33< >33<
..
SUMMARY. . S BSSSTTB TTSEEEEESS SS S T TTTTS S EEEEEE EE
EXPOSURE. 683293'820 10310000** 71571*649* *047404037 6301110021
1 0 1 SEQUENCE. L I S L S G E Y S I I G R W I W H E R PDDLGRGGNE E S T R T G N A G S R L A a G V I G I A K
48) l M B N MYOGLOBIN [OXYGEN STORAGE] (FERRIC I R O N METWYOGLOBIN) (SPERM WHALE)
- ......................................... lMBN
SHEET.. ..
BRIDGEZ..
BRIffiEl.,
CHIRALITY --+++++++ ++++++++++ ++++++++++ +++++-++++ ++-+++++++ -++++++-++ ++++++++++ +++++++--+ ++++++++++ ++++++-+--
BEND..... SSSSSSS SSSSSSSSSS ssssssssss sssss ssss ss sssss sssssss ss ssssssssss ssssssssss sssssss sssssss
5-TURN.. . >5555< >>555<<
4-TURN... > > > > X X X X X X X X < < < < > >>>xxxxxxx x x < < < x > > 4 << < > > 4 4 < < > > > > < < < x >>xxxxxxxxx
> xxx<<<< >>4>xx>xx xx<<<< >
3-TURN.. . >3>x3<< > > 3 < x 33X33X33< >33< >>3x<3 <>33< >
SUMMARY.. H H H H H H H HHHHHHHTTS H H H H H H H H H H H H H H H H H H H HT HHHHT SHHHHHH HH H H H H H H H H H H HHHHHHTTTT H H H H H H H H HHHHHTT
EXPOSURE. *15*551*50 5'81650997 3 2 1 0 0 2 6 1 1 1 * 1 1 * 5 5 8 8 1 2 * 8 6 + * 8 8 + 1 + 68791*83*6 0 7 * 3 0 5 * 4 0 8 418310'882 '8'781'748 74508'9'25
1 SEQUENCE. VLSEGEWQLV LHWAKVEAD V A G H G Q D I L I RLFKSHPETL EWDRFKHLK TEAEMKASED LKKHGVTVLT A L G A I L K K K G HHEAELKPLA QSHATKHKIP
SHEET.. ..
BRIDGE2..
BRIDGE1..
CHIRALITY ++++++++++ ++++++++++ +++-++++++ ++++++++++ ++++++++-+ +
BEND..... SSSSSSSSSS SSSSSSSS S ss sssssss ssssssssss ssssssssss
5-TURN.. . >5555 <
4-TURN... >>>XXXXXXX XXXXX<<<X4 4 4 < > > > > x x x xxxxxxxxxx xxxxxx<<<<
3-TURN... 33< > 3 3 X 3 3< >33< >33< >33<
SUMMARY.. HHHHHHHHHH H H H H H H H H T TTTSHHHHHH H H H H H H H H H H HHHHHHHHHT
EXPOSURE. 4'42634091 119189892' 78848'1730 0 6 9 0 0 + 2 2 3 * 6 0 1 6 2 1 * * 6 7 '8'
1 0 1 SEQUENCE. IKYLEFISEA IIHVLHSRHP GNFGADACGA MNKALELFRK D I A A K Y K E L G YW
49) l E C D HEMOGLOBIN (ERYTHROCRUORIN, DEOXY) [OXYGEN TRANSPORT] (CHIRONOMOUS THUMMI THUMMI) ............................. lECD I/,
SHEET.. , .
BRIDGE2..
M
BRIDGE1..
CHIRALITY -++++++++ ++++++++++ ++++++++++ -++++++-++ +++--+++++ +-++++++++ ++++++++++ +-++++++++ +++++++++- t--+++++++
BEND.. ssssssss ssssssss s ssssssssss ssssss ss ssss sssss ssssssssss ssssssssss ss s ssss ssssssssss sssssss
5-TURN.. >555 5<
....
(-TURN.. >>>>xxxxx x < x < < 4 < > > >>xxxxx<<<x > 4 4 < < > 4 4 4< > > 4 4 < < >>>>xxxxxxxxxxxxx<< << > > > > xxxxx<<<< >>>>xxxx
3-TURN.. >33< >>3<x33< > 3 3 x > 3 < x 3 3 x33<>33<>3 3< >> 3x<3< >>3< <
..
SUMMARY.. H H H H H H H H HHHHTTTT H H H H H H H H H H H HHHHTT TT TTTS HHHHT TSHHHHHHHH HHHHHHHHHH HTTT HHHH HHHHHHHGGG T HHHHHHH
EXPOSURE. 768'526507 611'+1*861 31114100'5 295226.5'9 859'82'818 8 6 7 5 0 * 7 5 0 8 '514724'11 5428'8'620 8832755'" 4367'21754
1 SEQUENCE. LSADQISTVQ ASFDKVKGDP WILYAVFKA DPSIMAWTQ FAGWLESIK GTAPFETHAN RIVGFFSKII GELPNIEADV NTFVASHKPR GVTHDQLNNP
SHEET.. ..
BRIffiE2..
BRIDGE1..
CHIRALITY ++++++++++ +-++++++++ ++++++++++ ++++
BEND.. ssssssssss ss ssssss ssssssssss ssss
5-TURN..
....
4-TURN.. xxxxxxxx<< << > 4 4 > x > > xxxxxxxxxx xx<<<<
)-TURN.. >>><<< >33<
..
SUMMARY.. H H H H H H H H H H HS GGGGHHH H H H H H H H H H H H H H H H
EXPOSURE. 65003710'8 *3'1571981 02201'6214 4 0 6 9 5 5
1 0 1 SEQUENCE. RAGFVSYMKA HTDFAGAEAA WGATLDTFFG MIFSRCI to
Q,
SUMMARY.. .... .H~ALPHA-HELIX....E=BETA-STRAND....B~BETA-BRIDGE....G~3-HELIX....I=5-HELIX....T-3-,4-, OR 5-TURN....S=BEND.... to
v1
TABLE A111 (continued) N
03
N
50) 2MHB H E M O G L O B I N ( A Q U O MET) [OXYGEN TRANSPORT] (HORSE: EQUUS CABALLUS) .2MHB
............................................. Q)
SHEET.. ..
BRIDGEZ..
BRIDGE1..
CHIRALITY --+++++++ +++++++-++ ++++++++++ +++++-++++ +++++--+-+ t - + + + + + + + + ++++++++++ ++++++++++ ++++++++++ ----++++++
BEND..... SSSSSSS SSSSSSS S SSSSSSSSSS SSSSS SSSS SS SSS SS S SSSSSSSS SSSSSSSSSS SSSS SSSSS SSSSSSSSSS S SSSSSS
5-111111..
.......... . > ~
. > > 5 5 < <. <
4-TURN... >>>>XXXX XXXX<<<< > >>>XXXXXXX XX<<<< > > > > x x x x x xxxxxxx<x< < 4 < > > 4 4 < x >>>xxx<<<< >>>>xx
3-TURN.. . >33<>>3<< >>>xx <<x33< > 3 3< >3 3x33< > 3 3 < >>><<<
SUMMARY.. H H H H H H H HHHHHHHGGG H H H H H H H H H H H H H H H GCGG GG TTS ST T HHHHHHHH HHHHHHHHHH TTTT HHHHT HHHHHHHHHT S THHHHH
EXPOSURE. * 3 6 8 6 3 5 9 5 0 *4 1 2 9 * 0 4 8 8 1 1 * 1 0 1 4 0 0 6 0116138*0* *73**4855* 71871'6609 *309316702 6 8 2 '91 8 4 2 1 7*419750** 7.186822.2
1 SEQUENCE. VLSAADKTNV XAAWSXIICGH AGEYGAEALE RMFLGFPTTK TYFPHFDLSH CSAQVKAHGK XVCDALTLAV GHLDDLPCAL SDLSNLHAHK LRVDPVNF K L
SHEET..
BRIDGE?.
...
BRIDGEl..
CHIRALITY ++++++++++ +++++++-++ ++++++++++ +++++++++ ---++++ +++++++++- -+++++++++ ++++++++++ ++++++-+++ +-++++++-+
BEND..... SSSSSSSSSS SS SSS SS SSSSSSSSSS SSSSSSS S ssss sssssssss sssssssss sssssss s s sss ss ssssssss s
5-TURN.. .
4-TURN... XXXXXXXXX< < < X 4 4 4 < > > >> X X X X X X X X X XXX<<<X444 < >>>>X XXXX<<<< >>>>XXXXXX XXX<<<< >>>4<<<>>
3-TURN. >33< >33< >3><3< >>3<< >33< >>>x xx<xx3<< >33<>33<
SUMMARY.. HHHHHHHHHH H H TTT HH HHHHHHHHHH HHHHHHTTTT HHHH HHHHHHHTT HHHHHHHHH HHHHHHSGGG GGGGGGG SSHHHHHT H
EXPOSURE. 4220110010 1 63 4 ' 6 1 5 2 4 2 1 1 0 1 2 7 0 1 6 2227221679 *09*255*45 5406521**2 74**001200 00102036*1 5*74*'3591 '8 6 5 3 8 5 5 2 "
1 0 1 SEQUENCE. LSHCLLSTLA VHLPNDFTPA VHASLDWLS SVSTVLTSKY RIVQLSGEEK AAVLALWDKV NEEEVGGEAL GRLLWYFWT QRFFDSFGDL S NPGAVMGN P
SHEET.. ..
BRIDGE2..
BRIDGE1..
CHIRALITY ++++++++++ ++++++++++ ++++++++++ +++++++-+- -+++++++++ +++++++++- +++--+++++ ++++++++++ ++++++
BEND..... SSSSSSSSSS SSSSSSSS S SSSSSSSS SSSSSSSS SSSSSSSSS SSSSSSSSSS SSS SSSSS SSSSSSSSSS SSSSSS
5-TURN.. . >>555<<
(-TURN... >>XXXXXXXX XXXX<<<< > > 4 4 < X > > >X X < < < < >>>>xxxxx xxxxxxx<<< < > > > > x x xxxxxxxxxx x < < < <
3-TURN.. . >>3xx 3<< >>3<< > >3<<
SUMMARY.. H H H H H H H H H H HHHHHHHTTG GGHHHHSHHH HHHHHTTS THHHHHHHH HHHHHHHHHH GGGS H H H H H H H H H H H H H H H HHHHSS
EXPOSURE. *896706*30 9310900*81 * 9 1 * 4 5 0 2 7 5 8.8528'7'2 '4'2282362 1 0 0 0 0 0 0 6 7 4 1 * * 0 5 2 * 2 0 1 1 0 1 9 1 0 4 1 0 1 *023**5*
2 0 1 SEQUENCE. KVKAHGKKVL HSFGEGVHHL DNLKGTFAAL SELHCDKLHV DPENFRLLGN VLWVLARW GWFTPELQA SYQKWAGVA N A L A H K Y H
511 lLHB HEMOGLOBIN(MET)-CYANIDE V [OXYGEN TRANSPORT] [SEA LAMPREY: PETRWYZON MARINUS]. ......................... ....... lLHB
SHEET.. ..
BRIDGEZ..
BRIDGE1..
CHIRALITY --++----- ---++++++++ ++++++++++ ++++++++++ ++++++++++ +-+++++++- ++++++-+++ ++++++++++ +++++++++- -+++++++++
BEND.. SSS ssssssss sssssssss ssssssssss ssss SSSSS s sss s s ssssss sss ssssssssss ssssssssss s sss
5-TURN..
.... > 5 5 55< ,
4-TURN.. > > 4 > x x > < x < < > x > 4 < < x 4 >>x>xxxxxx x < < < < > 4 4 4 < > 4 4 4 < > > > > xxxxxxxxxx xxxx<<<< >>>>
3-TURN.. >33< > 3 3 < > 3 3 x 3 > < 3< > 3 3 < > 3 3x>3x<3< >33< >> 3 < x 3 3 < >33< >3>x>x<<<
..
SUMMARY.. SSS H H H H H H H H HTTHHHHHTT THHHHHHHHH H H H H TTTTT T GCCTT S SSTTTS H H H H H H H H H H H H H HHHHHHHSSS STTGGGHHH
EXPOSURE. *3 4 2* 6 * 379 2788436'1' 732663.997 ' 7 2 0 1 7 1 6 2 6 1 0 6 7 2 8 4 0 3 * 4 4 9 * 1 * 7 3 * 7 5*70**1780 8 * 4 0 4 * 3 1 9 3 1 4 9 1 2 6 2 1 * * 6 * * 2 3 9 3 * * 5
1 SEQUENCE. PIVDTGSVAP LSAAEKTKIR SAWAPWSDY ETSGWILVK PFTSTPAAEE FFPWKGLTT ADELKKSADV RWHAERIIDA VDDAVASMDD TEKMSSMKDL
SUMMARY.. ..... .H-ALPHA-HELIX.. ..E-BETA-STRAND.. ..B-BETA-BRIDGE. ...G=3-HELIX.. ..I-5-HELIX.. ..T=3-, 4-, OR 5-TURN.. .. SxBEND.. ..
SHEET.. ..
BRIDGE2..
BRIDCE1..
Y
CHIRALITY +++++-+--- -t+ttt+++t tt+tt--+-+ - t + t + + t t t t ++++++
BEND..... SSSSSSSS SSSSSSSSS SSSSSS S SSSSSSSSS SSSSSS
5-TURN.. . 5555<
4-TURN... XX<<X<44< >>>>XXX XXX<<<< >>>>XXXXX X<<<<
3-TURN.. . >>3<< > 3 3 < >>><<<
SUMMARY.. HHHHHTTT CCCHHHHHH HHHHHH S SHHHHHHHH HHHHCCC
EXPOSURE. 83*71****2 62'5182385 221.62378' 2 1 2 4 8 1 1 7 3 2 1421'55'
1 8 1 SEQUENCE. SCKHAKSFEV DPEYFKVLAA V I A M V A A C D AGPEKLLRMI C I L L R S A Y
52) I H B L LECHEMOGLOBIN (ACETATE,MET) [OXYGEN T R A N S W R T ] (YELLOW L U P I N ROOT NODUI.ES: LUPINUS LUTEUS L ) . ................. lHBL
SHEET.. ..
BRIDGE2..
BRIDGEI..
0
CHIRALITY -+-t+tt44 4tttt4ttt+ ttttt+++4+ t t t + t + - 4 t +t t t - t t t - 4 t - - - - t + - + t t t t t t + t t t t t t t t + t t t + + t- t - - - t - + t + t t t t t t t + t -
r
BEND..... S SSSSSS SSSSSSSSSS SSSSSSSSS SSSSSS SSS SSS SSS SS SS SSS SSSSSSSSSS SSSSSSSSSS SSS SSS SSSSSSSSSS
5-TURN.. . >555 5< >555
4-TURN... >>>>XXX XXXXXX<<<< >>4>XX>XXX <XX<4<< > 444X444< > > > > xxxxxxxxxx xxxxxxxx<< << >>>> xxxxxx<<<<
3-TURN.. >33<
. >>3 x<3< >>>x x<<x>3<x33 < >33< >33
SIRIMARY.. T T HHHHHH HHHHHHHHHT THHHHHHHHH HHHHHH GCC CCC CCC T T SS H H H HHHHHHHHHH HHHHHHHHHH HSS HHH HHHHHHHHHT
EXPOSURE. 2.56'61156 89703'7088 654'285818 3103.83981 5.65628'3' 3 * 2 3 * * 4 * * 8 47782'4844 14'114826' 8682578762 99305555"
1 SEOUENCE. CALTESQAAL VKSSWEEFNA NIPRHTHRPF I L V L E I A P A A KDLFSFLKCT SEWQNNPEL QAHACRWKL W E A A I Q L E V T C W A S D A T L KNLCSVHVSK
SHEET.. ..
BRIDGEZ..
BRIDGEI..
CHIRALITY t--ttttttt +ttt+ttttt +ttttt-ttt ttttttt+tt ttt+tt+ttt +
BEND..... SSSSSSS SSSSSSSSSS SS SS SSS SSSSSSSSSS SSSSSSSSSS S
5-TURN... 5<
4-TURN... >44>X>>X XXXXXXXXX< <<< >>>> XXXXXXXXXX XXXXXXXX<< <<
3-TURN... < >>3X<3< >>3<< > 33<
SUMMARY.. T CCCHHHH HHHHHHHHHH HHCCC HHH HHHHHHHHHH HHHHHHHHHH HT
EXPOSURE. 535'863734 4*882'809* 114"54*61 3 4 8 0 3 5 8 2 6 8 119225'41' *4*
1 8 1 SEQUENCE. CVADAWPW K E A I L R T I K E V V G A M S E E L NSAWTIAYDE L A I V I K K E M D DAA
SHEET.. ..
BRIDGE2..
BRIDGE1..
CHIRALITY -++++
BEND.. SSSS
5-TURN..
....
(-TURN... 4<
3-TURN.. .
SUMMARY,. TSSSS
EXPOSURE. 2'11055
1 0 1 SEQUENCE. GSSYFM
56) 3 P TI TRYPSIN INHIBITOR [PROTEINASE INHIBITOR] [CCW PANCREAS: 80s TAURUS).................................... .......3 P TI
SHEET.. .. AAA AAAA A A AAAAA A
BRIDGE2.. B
BRIDGE1.. AAA AAAA A A AAAAA B
CHIRALITY -++++---- --+-+-+-+- ----++-+-- -+----++-- -+++-+-+++ +++++-
BEND.. SSSSS S sssss sss ss sssss ssssss
5-TURN.. >5555<
....
4-TURN.. >444< >>>> xx<<<<
3-TURN.. >>><<< >33<
..
SUPIMARY.. GGGGS S EEE EEEETTTTEE EEEEE S S S SS BSSHHH HHHHHS
EXPOSURE. '5.40'5'59 7295'5'6'5 72145'5482 88272842'5 ''822'33.8 07.5024.
1 SEQUENCE. RPDPaLEPPY TGWKARIIR WYNAKAGW QTFWGGbRA RRNNPKSAED cMRTaGGA
SHEET.. .. E DD F F DDDD E
2 2 ??
BRIDGE2.. JJ
-J J_
BRIDGEI.. K il L L hhhh 4 4 K
CHIRALITY +---++--+- -++++--+-+ ----++++++ ++++-+--++ ----++---+ ++++++++++ ++-++++--- -+--++++++ +--+++-+-+
BEND.. ss sssss sss ssssss ssssss ss s sss ssssssssss ssss ss ssssss ss ssss s S
5-TURN.. >5555< > 5555<
....
4-TURN.. > > 4 > x x > x<<<< >>>>xxxx <<<< > > > > x < <<<
3-TURN.. >33< >3><3<>33< >33< >>3<< >33 < >3><3< > 33< > 3 3 < >3>X3<X33< >>3<<
..
SUMMARY.. B TT EE TTTTB TTS B HHHHHH HHHHTT TT S EEEE SSS STTHHHHHHH HHHT TT EE ETTTHHHHHH HS GGGSB S S
EXPOSURE. 1 8 3 0 * 6 1 5 2 8 2 2 9 7 0 1 4 ' 9 8 5 2 5 3 8 * 8 0 * 5 4 0 * 8 8 * 1 6 4 + ' 6 8 0 0 0 8 2 4 8 0180080000 1 5 4 5 1 1 8 8 8 8 810083204* *167*2353* 5 * *
2 8 1 SEQUENCE. SGH IRGSVNM P F M W LTENG F EKS PE ELRA MFEAKKVDLT KPLIAXRKG VTACHIALAA YLCGKPDVAI YDCSWFEWFH RAPPEWVSQ G KG
62) 2PAB PREALBUMIN [THYROXIN, RETINQL TRANSPORT] (HUMAN PLASMA).....r......................,.......................,..2PAB
SHEET.... AAAAAAA AA B BBBBBB BBB BBBB AA BBB BBBB BBBBBBB AAAAA
BRIDGEZ.. BB cc FFFFF GGG GGGG DDDDD
BRIDGEI.. aaaaaaa CC E EEEEEE EEE EEEE BB F FFFF CGGGGGG aaaaa
CHIRALITY ++------+ +-+---++-- -------+-+ ---+--+--- -+-+-+++-- -++++-+--- ----++++++ ++-+--+-- ----+--+++ -___-___--
BEND.. s ssss s sss s sss s ssss ss ssss ssss s s s ss
5-TURN.. > 5 555< > 5555<
....
4-TURN.. > 4 44< r444i >>>>xx ::::
j-TURN.. >33< >33< >33< > >3<<
..
SUMMARY.. EEEEEEET TTTEE S E EEEEEE TTS SEEEEEEEE TTSEE S TTTS SEEE EEEE HHHHH HHHT S S EEEEEEE S SS EEEEE
EXPOSURE. 6284888785 8.74486'81 0704761"'* 99'*4282*8 5'68418822 B"'16'160 3050515630 '"77693'5' 789274783. *3'2'34040
1 SEQUENCE. CPLMVKVLDA VRGSPAINVA V H W R K A A D D TWEPFASGKT SESGELHGLT TEEQFVEGIY KVEIDTKSYW KALGISPFHE HAEVVFTAND SGPRRYTIAA
SHEET.... AAA AAAAA hAA
BRIDCE2.. DDD m
BRIDGEI.. aa DDDDD DDD 0
CHIRALITY -+-+++--+- -- 0
BEND..... SS
5-TURN.. 3
4-TURN..
..
3-TURN.. . >33<
SUMMARY.. EEETTEEEEE EEE
EXPOSURE. 7843*7866e 799'
101 SEQUENCE. LLSPYSYSTT AVVT
SUMMARY........H~ALPHA-HELIX....E~BETA-STRAND....B~BETA-BRIDCE....G~3-HELIX....I~5-HELIX....T~3-,4-r OR 5-TURN....S-BEND....
i
, 1 1 1 , 1 1 1 1 , 1 1 1 1 ~ 1 1 1 1 ~ 1 1 1 ,1 I l l
100 20 0 300 N
...................................................... Q,
1 CPU Hllyly.......:::ym.
.......lllw........ .......lywy......:::w.............. w
N
.................................. ................................
lRBP ........:.......w w ............ymrmw...... ......UwyMYwH...-., ....... ................
....
......
.........................................................
1HIP ...........IIHH..:..HI..H......rn..~~:..:: .........::::......j: ...:.....:... 11-14)
.................................................
156B ~ l m l..............
. . I H H ...........
...........................................................
lCYT ..IIIHWI(R......................... :..........W(II,,,:..H~IWII.M
.............llmllllll
................................................................
.......:..................... :...w.......................... IIUIHHIHH..
.............................................................
....................................
1FDX ..::..........w( .....:......:......... .....::...
.....................................................................
1 FXC ..................................................................................................
L I 1 I I 1 I 1 1 ~ ~ I ~ ~ ~ 1 ~ I ~ I ~
Fig. A l . Strip maps of secondary structure and solvent exposure for 62 proteins. Top line dots: residues with more than three contacting water molecules.
Vertical bars: short, 3-helix; medium, 4-helix (a-helix);long, 5-helix. Dots above baseline: residue has antiparallel /3-bridge partnerb). Dots below baseline:
P-strand has parallel /3-bridge partner(s). The four-letter code is the Protein Data Bank data set identifier (Table AI).
DICTIONARY OF PROTEIN SECONDARY STRUCTURE 2633
I , r , , , , r , l ' ' ' " ' ' ' ~ 1 I I I I I I I '
N
100 20 0 30 0 Q,
........ ,..... ............................ ..................... w
P
21LN ",.wHyYryI ....... ......::....:.........::...M...... ......yy .....yyyy
......
1PTN
............... T
1SGA
.......................
lSB1
lEST
..................................................................
2RCT
ePAP
.................... .......................... ............................
lFAB ..................................
............ HI.:
....................................
lREI
...................................................................... ..............................
3PGfl
................... ."." .......... ....................................
lTIfl ............
..... :...#NHHrm. ......... ?..UyHIk ................:....... ...::::. y .......-.m .::::....... ........::m.
L 1 I I 1 I I I I I 1 I 1 I I 1 I I 1 I I I I I 1 1 I I
1 DFR
1 GPD
.... .....
..:::::.....Y.._l__
...........
4RDH ...........
.....yI......... .....
.... ,.Iyyy) .................. .....+::.
....
.................................................................................................
4LDH ) ....y:.-
........, ....
................................... . .
2GRs ..................................................
. . ...... . .
............................ ........................
...w...w .........E...
.................
2SOD .......*...:................... v,
............................................................................................
1 nBp( ...~..~.*)...~'..... yy....._._..I........____........I..... I.
.......................... ........................................
1 ECD .,- .... .........)." ....yyc..."'.-.
.......................................................................................................................................................
2fWB ---
I I I I 1 1 I 1 1 I I l 1 I I I l I L I I I L L I l N
a
w
Fig. A l . (continued from the previous page) wl
10
Q,
w
Q,
I r r r , I
l " " l " ' ' l " ' ' ~ ' r ' l ~
100 20 0 30 0
.......................................................................................
lLHB . w~ ............
............~ . . ~ . . ........ ~ Hmmylw....... ......vmyywl...... ywIwm.
................................................................................................ 51) -62)
lHBL
1 CRN
...................................
low0 ...................... ......:.::...
:::....::..IIIHH(II
........................ .......
2.5s I ....llHHlHll..................... ..#rm ........
x
3PT I
......................................
1CTX
1nLT ~IlW~.W#wnrrm~
1 NXB
2RDK
................... ......................... ......................... ...............
1 RHO ..................ry*l.........:..........IIy.....::I'
2PRB
L I L 1 I I L I I I ~ ~ l I l ~ ~ I ~ ~ ~ ~ ~
For reasons of space it is impossible to cite the tremendous amount of work by the crys-
tallographers on which this paper is based; references for each structure are in the Protein
Data Bank. C. Oefner provided computer graphics software. The Deutsche Forschungs-
gemeinschaft gave financial support to the project Protein Structure Theory.
References
1. Pauling, L. & Corey, R. B. (1951) Proc. Natl. Acad. Sci. USA 37,729-740.
2. Bernstein, F. C., Koetzle, T. F., Williams, G. J. B., Meyer, E. F., Brice, M. D., Rodgers,
J. R., Kennard, O., Shimanouchi, T. & Tasumi, M. (1977) J Mol. Biol. 112,535-542.
3. Kuntz, I. (1972) J. Am. Chem. SOC. 94,4009-4012.
4. Lewis, P. N., Momany, F. A. & Scheraga, H. A. (1973) Biochim. Biophys. Acta 303,
211-229.
5. Rose, G. D. & Seltzer, J. P. (1977) J. Mol. Biol. 113,153-164.
6. Chou, P. Y. & Fasman, G. D. (1977) J. Mol. Biol. 115,135-175.
7. Smith, J. A. & Pease, L. G. (1980) CRC Crit. Reu. Biochem. 8,315.
8. Richardson, J. S. (1981) Adu. Prot. Chem. 34,167-339.
9. Lifson, S. & Sander, C. (1980) J. Mol. Biol. 139,627-639.
10. Lee, B. & Richards, F. M. (1971) J . Mol. Biol. 55,379-400.
11. Levitt, M. & Greer, J. (1977) J. Mol. Biol. 114,181-239.
12. Feldmann, R. J. (1976) Atlas of Macromolecular Structure O R Microfiche, Tracor Jitco
Inc., Rockville, Md.
13. Lifson, S., Hagler, A. T. &Dauber, P. (1979) J . Am. Chem. Soc. 101,5111-5121.
14. IUPAC-IUB Commission on Biochemical Nomenclature (1970) J. Bid. Chem. 245,
6489-6497.
15. Rackovsky, S. & Scheraga, H. A. (1978) Macromolecules 11,1168-1174.
16. Shrake, A. & Rupley, J. A. (1973) J . Mol. Biol. 79,351-371.
17. Chothia, C. (1975) Nature 254,304-308.
18. Bode, W., Schwager, P. & Huber, R. (1975) Proceedings of the 10th FEBS Meeting,
Vol. 40, Desnuelle P. & Michelson A. M., Eds., North-Holland, Amsterdam, pp. 3-20.
19. Finney, J. L. (1979) in Water, A Comprehensiue Treatise, Vol. 6, Franks F., Ed., Plenum
Press, New York, p. 93.
20. Brant, D. A,, Miller, W. G, & Flory, P. J. (1967) J. Mol. Biol. 23,47-65.
21. Schellman, C. (1980) in Protein Folding, Jaenicke, R., Ed., Elsevier, Amsterdam, pp.
53-61.
22. Provencher, S. W. & Gloeckner, J. (1981) Biochemistry 20,33-37.
23. Williams, R. W. & Dunker, A. K. (1981) J . Mol. Biol. 152,783-813.
24. Hennessey, J. P. &Johnson, W. C. (1981) Biochemistry 20,1085-1094.
25. Deisenhofer, J. & Steigemann W. (1975) A C ~Crystallogr.,
Q Sect. B 31,23%250.
26. Wuethrich, K. & Wagner, G. (1979) J . Mol. Biol. 130,l-18.
27. Timkovich, R. & Dickerson, R. E. (1976) J. Bid. Chem. 251,4033-4046.
28. Schulz, G. E., Elzinga, M., Marx, F. & Schirmer, R. H. (1974) Nature 250,120-123.
29. Wilson, I. A., Skehel, J. J. & Wiley, D. C. (1981) Nature 289,366-368.
30. Diamond, R. (1966) A C ~ Crystallogr.,
Q Sect. A 21,253-266.
31. Diamond, R. (1971) Acta Crystallogr., Sect. A 27,436-452.
32. Hendrickson, W. A. & Konnert, 3. H. (1980) Computing in Crystallography, Diamond,
R., Ramaseshan, S. & Venkatesan, K., Eds., Indian Academy of Sciences, Bangalore, pp.
13.01-13.23.
33. Dodson, E. J., Isaacs, N. W. & Rollett, J. S. (1976) A C ~ Q Crystallogr., Sect. A 32,
311-315.
34. Jack, A. & Levitt, M. (1978) Acta Crystallogr., Sect. A 34,931-935.
35. Levitt, M. & Lifson, S. (1969) J . Mol. Biol. 46,269-279.
36. Agarwal, R. C. (1978) A C ~Crystallogr.,
Q Sect. A 34,791-809.
37. Chambers, J. L. & Stroud, R. M. (1977) A C ~Crystallogr.,
Q Sect. B 33,1824-1837.
38. Sussman, J. L., Holbrook, S. R., Church, G . M. & Kim, S.-H. (1977) A C ~ Crystallogr.,
Q
Sect. A 33,800-804.