Protein Structure

CY60004 Biophysical Chemistry
Structures of biological macromolecules (proteins and polynucleic acids);
Molecular Mechanics: simulating macromolecular structure;
Spectroscopic (NMR, Fluorescence and Circular Dichroism) methods to study

structure of proteins and DNA; solving macromolecular structures by X-ray
diffraction; structural transitions in polypeptides, proteins and polynucleic acids.
Interactions between macromolecules: thermodynamics of protein

folding/stability by fluorescence and circular dichroism techniques.
Binding of small ligands by biological macromolecules: kinetics and energetics

of protein-drug, protein-surfactant and DNA-drug interactions by fluorescence,
CD and calorimetric methods.
Books :
Biophysical Chemistry, Parts I, II and III, Cantor and Schimmel
Principles of Physical Biochemistry, van Holde, Johnson and Ho
Amino acids Protein Structure
Sidechain (R)
Peptide = a short chain of amino acids
20 common amino acids
O NH2 R side chain Polypeptide = a longer chain of amino acids
C Three letter code Protein = a polypeptide that occurs in nature and folds
CH2 One letter code into a defined three-dimensional structure
CH2
HO H H H H O
C C N H H O
H + N C
O
carboxyl H amino
H N
C C N C C C O-
group group
H H O H O
H H R O R
R
+
C N C
Building blocks H N C C O- The beginning of the protein is known as the
H H
R R amino-terminus and the end of the protein is
known as the carboxyl-terminus.
Peptide bond
Amino Acid Characteristics

Hydrophobicity
Size
Charge
Secondary structure preference
Aromaticity
Special characteristics:
bridge forming by cysteines,

rigidity of prolines,
titrating at physiological pH of histidine,
flexibility of glycines
Nomenclature
Glx can be Glu or Gln
Asx can be Asp or Asn
Polypeptide chains are always described from the N-terminus to the C-
terminus
ISOELECTRIC POINT (pI)

Nomenclature Definition: the pH at which a molecule carries pH
Nonhydrogen atoms of the amino acid side chain are named in sequence no net electric charge 12
with the Greek alphabet

: using the Henderson-Hasselbalch pK2 10
equation H3+NCH2COO-
H2NCH2COO- + H+
pI = (pK1 + pK2) 8
: for amino acids

pI 6
pK1 = pK -COOH
pK2 = pK -NH3 4
Isoelectric Focusing
: separation of proteins according to charge pK1 2
H3+NCH2COOH
: pH gradient set up by ampholytes H3+NCH2COO- + H+
: electric field applied such that one pole is 0
positively charged and one pole negatively 0.5 1.0 1.5 2.0
0
charged
: proteins migrate in the pH gradient until
H+ ions dissociated/molecule
their net charge = 0 (isoelectric point)
+ Titration of glycine
pI
-
Ionization of Amino Acids Acid-base properties of amino acids
The dissociation of first proton The dissociation of the second

Equilibrium dissociation constant from the -carboxyl group is proton from the -amino group
Gly+ + H2O Gly0 + H3O+ Gly0 + H2O Gly- + H3O+
Henderson-Hasselbach Equation
[Gly0][H3O+] [Gly-][H3O+]
K1= [Gly+]
K2= [Gly0]
The pKas of these two groups are far enough apart that they can be
approximated by Henderson-Hasselbalch
[Gly0] [Gly-]
pH = pK1 + log pH = pK2 + log
[Gly+] [Gly0]
Titration curve of glycine Isoelectric Point of Amino Acids
COO-
+ Neutral
H3N C H
form
H
Titration of Gly
COOH COO- COO-
+ +
H3N C H H3N C H H2N C H
H
pK1 pK2 H
H
Gly+ Gly0 Gly-
pH 2.3 pH 9.6
From the pK values we can calculate the pI (isoelectric point) where the
amino acid is neutral.
pI average of (pK below neutral+ pK above neutral)
So, for Gly,

pI = (pK1 + pK2)/2
= (2.3 + 9.6)/2 6
Lysine
The Acid-Base Chemistry of the Amino Acids Lys
Glycine
pI = 2.35 + 9.78 = 6.1 Gly
2
Glutamic acid
Glu
pI = 2.10 + 4.07 = ~3.1 pI = 9.18 + 10.79 = ~10.0

2 2
Charged polar (acidic) side chains
Henderson-Hasselbach Equation
9.5 COOH 2.1 COOH 2.0
9.8
+ +
H3N C H H3N C H
CH2 CH2
Buffering zone = pK +/- 1 pH unit
CH2 C
At pH = pK, [HA] = [A-]
C O OH 3.9
At pH below pK, [HA] > [A-]
At pH above pK, [HA] < [A-]
O OH 4.1 Aspartic acid
Glutamic acid Asp
Glu D
E
Side chain
The 5 Complex Amino Acids are:
R His has 3 ionizing groups, alpha-carboxylic acid (pK 1.8),
Glutamic acid (Glu) H O
+ side-chain amino (pK 6.0) and alpha-amino (pK 9.2):
Aspartic acid (Asp) H N C C
His+2 goes to His+1 via pK 1.8;
Lysine (Lys) H OH
H His+1 goes to His0 via pK 6.0;
Arginine (Arg)
Amino Carboxyl His0 goes to His-1 via pK 9.2.
Histidine (His). group group
Therefore, pI = (6.0 + 9.2)/2 = 7.6.
Each of these 5 amino acids has 3 ionizable groups 3 pKs.
pI = average of (pK-below) + (pK-above)

where pK-below is the pK between AA+1 and AA0
and pK-above is the pK between AA0 and AA-1.
Histidine Titration Histidine Titration

Free Amino acid
COO - Amino acid on
3
H C CH 2
N
surface of enzyme
N
NH 2
H
Mols OH- added per mol histidine
pKa=9.3
-1
-
COO
N O C
2 H C CH 2
H C C
N
N H2
NH 3 +
N
NH
H
0 H 0
Mols OH- added
pKa=6.0 pKa=6.0
-
COO H
N+ O C H
+
1 H C CH 2
H C C
N
+ N H2
NH 3
N
NH
H
+1 H +1
pKa=1.8
COOH H
+
N
H C CH 2
0 0
N
NH 3 +
0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14
H
pH +2 pH
Histidine Titration
Amino acid in a
pocket (active site)
1. Carboxylic acids ionize at acidic pH; ie. Carboxylic acids

give up their protons at acid pHs.
O C
N + 2. Amino groups ionize at basic pH; ie. Amines give up their
pKa<6.0
H C C
H2 + protons at basic or alkaline pHs.
+
N
NH
0
3. Carboxylic acids near an amino group have a more acidic
H
pKa=6.0
Mols OH- added
+ pK than isolated carboxylic acids.

pKa>6.0
O C
N+
H
+ 4. Amino groups near a carboxylic acid have a more acidic pK
H C C
H2 +
NH
N - than isolated amines.
H +1 -
-
0
-
0 2 4 6 8 10 12 14
pH -
-
For Ala-Lys, there are 3 ionizable groups: For the tetrapeptide: Glu-Ala-Lys-Tyr
1) alpha-amino group contributed by Ala - assign pK 9.9. Write out the structure.
2) alpha-carboxylate group from Lys - assign pK 2.2. With the assigned pK values, determine the net charge at pH 1, 3, 5, 7, 10, 11.
3) side chain amino group from Lys - assign pK 10.8. Calculate the pI of this tetrapeptide.
pH 1 pH 5 pH 7 pH 10 pH 12 The pK values of the amino acids are:
Ala- 2.4, 9.9
amino
Glu - 2.1, 4.1, 9.5
-carboxylate Lys - 2.2, 9.2, 10.8
Tyr - 2.2, 9.7
Side chain amino
Net Charge
Peptide Bond - Structure

1) DOUBLE BOND CHARACTER
O O-
.. +
C N C N
: fractional charges
: no rotation between C and N
: HIGH DEGREE OF ROTATION RESTRICTION
2) COPLANAR NATURE
: C-N can rotate through angle R1
: C-C can rotate through angle C
: R groupings can rotate through a H
characteristic angle rotation
C N rotation
: as it rotates, gives 3D structure to
the protein O
: degree of rotation will depend upon R group C
1) BULK R2
eg. free rotation with glycine (R = H) held in
position
2) CHARGED STATE because
two positives repel one another of double
bond
3) WATER REACTION
polarity of R groupings
polar - rotate out to H bond with water
nonpolar - on inside of molecule
Torsion angles phi () torsion angle around N-CA bond
psi () torsion angle around CA-C bond Residue Globular protein Membrane protein
omega () peptide bond; trans ~180 degrees, cis ~0
O R2 Non-polar In interior Surface lipid
VLIMFYW Hydrophobic core anchor
H2N C CH OH Polar charged Surface Hydrophilic core

CH

N
C RKDEH Catalytic sites

Polar neutral H bond network Inside surface
R1 H O
STNQYW part of channel
Steric hindrance:
Most Pro
Least - Gly
Importance of Protein Structure

Amino Acid Partitioning Into Membrane Regions
Sequestering hydrophobic residues in the protein core protects proteins
proteins
from hydrophobic agglutination.
Region Amino Acids
Bulk water Arg, Asn, Asp, Gln, Glu, His, Lys, Pro
Hemoglobin A: Val-His-Leu-Thr-Pro-Glu-Glu-Lys-
Hemoglobin S: Val-His-Leu-Thr-Pro-Val-Glu-Lys-
Bulk water +
Ala, Cys, Gly, Ser, Thr
interfacial
Interfacial Tyr
Hydrophobic Ile, Leu, Met, Phe, Trp, Val
http://chemed.chem.purdue.edu/genchem/topicreview/bp/1biochem/amino2.html
sticky patch causes hemoglobin S to agglutinate (stick together) and form
fibers which deform the red blood cell
Classes of Proteins
Fibrous Proteins
Based on structure and solubility,
Fibrous proteins contain
proteins can be grouped into three large classes: polypeptide chains
organized parallel along a
single axis, producing
long fibers or large sheets.
They are mechanically
strong, play structural
roles in nature;
Difficult to dissolve in
water;
-Keratins and Collagen
are examples of fibrous
proteins
Fibrous Globular Membrane
-keratins are found in hair, fingernails, claws, horns and beaks;
Globular Proteins
Sequence consists of long alpha helical rod segments capped with non-helical
N- and C-termini Globular proteins are classified according to the type and arrangement
of secondary structure
-keratins are found in silk and consist of gly-ala repeat sequences; Antiparallel alpha helix proteins
Ala is small and can be packed within the sheets Parallel or mixed beta sheet proteins
Antiparallel beta sheet proteins
Metal binding sites examples

Protein Architecture
Primary structure (1) : the amino acid sequence.

Secondary structure (2) : helices, sheets and turns.
Tertiary structure (3) : side chain packing in the 3-D structure.
Quaternary structure (4) : association of subunits.
1) M is Fe (rubredoxin) or Zn (aspartate transcarbamoylase)

2) Carboxypeptidase A
3) Catalytic ion in liver alcohol dehydrogenase
4) Azurin and plastocyanin
Protein Structure
Primary Structure Secondary Structure

Physical properties of protein that influence stability &
therefore, determine its fold:
Two major types:
Rigidity of backbone Alpha Helical Regions
Beta Sheet Regions
Amino acid interaction with water
Hydropathy index for side chains Other classification schemes:
Turns
Interactions among amino acids Transmembrane regions
Electrostatic interactions Internal regions
Hydrogen bonds External regions
S-S bonds Antigenic regions
Volume constraints
Features
Secondary Structure Conformations -helix is one of two secondary structures
(the other being the -sheet) predicted and
discovered by Linus Pauling in 1951.
alpha helix -57 -47
alpha-L 57 47 It is a right-handed helix with the following
3-10 helix -49 -26 spatial parameters:
helix -57 -80 = -57
type II turn -79 150 = -47
-sheet parallel -119 113 n = 3.6 (number of residues per turn)
-sheet antiparallel -139 135
pitch 0.54nm (or 5.4)
Residue conformational preferences: -sheet: parallel ( =-119 and =113) or

Glu, Ala, Leu, Met, Gln, Lys, Arg - helix anti-parallel pleated sheet structures
Val, Ile, Tyr, Cys, Trp, Phe, Thr - strand
Gly, Asn, Pro, Ser, Asp - turn
other regular structures
a) 310 helix - i, i+3 hydrogen bonding pattern
b) -helix - i, i+5 hydrogen bonding pattern
c) turns and loops
antiparallel beta-sheet parallel beta-sheet

Properties of the alpha helix
60
All the common Hydrogen bonds
secondary between C=O of
structure residue n, and
conformations fit NH of residue
in the sterically n+4
allowed regions of 3.6 residues/turn
phi-psi space
1.5 /residue rise
100/residue turn
alpha-helix
Properties of -helices
4 40+ residues in length

Often amphipathic or dual-natured
Half hydrophobic and half hydrophilic
Mostly when surface-exposed
If we examine many -helices,
we find trends
Helix formers: Ala, Glu, Leu, Met
Helix breakers: Pro, Gly, Tyr, Ser
the alpha-helix: repeating i, i+4 H- Helix nomenclature: -helix
bonds example
11 1 Hydrogen bond between
C=O H-N
10
12
right-handed helical (residue i) (residue i+4)
9
region of phi-psi space 15 helix
8
2 Repeating unit:
7 5 turns
18 residues per repeat
5
6
185 helix
4 hydrogen bond 3 Loop formed between C=O H-N
13 atoms 1 5
1
3.6 residues per turn
3 Which of residues 1-12 are in the nomenclature used for
2 helix? Does this coincide with the 3.613 helix 310 helix
residues in the helical region of phi-
psi space? helices extend with approximately 1.5 Angstrom per
residue, 5.4 Angstrom per turn.
Helical Wheel
310 helix i, i+3 -helix i, i+4 -helix i, i+5
Helical Wheel
Beta Strands
For each residue the
rotation is 1000
lys his gly val thr val leu thr ala leu gly ala ile leu lys lys
phi(deg) psi(deg) omega (deg)

------------------------------------------------------------------
strand -120 120 180
-----------------------------------------------------------------
beta strands/sheets
beta-strand region of
57 phi-psi space
56
54
53
52
51
50
parallel or anti-parallel sheet?
49
Secondary Structure: Beta Sheets

Turns and Loops
BETA PLEATED SHEET: a result of H-bonding between polypeptide chains Secondary structure elements are connected by regions of
turns and loops
Four-stranded beta Turns short regions
sheet that contains of non-, non-
three antiparallel
conformation
and one parallel
strand.
Hydrogen bonds are

indicated with red
lines (antiparallel
strands) and blue
lines (parallel
strands) Loops larger stretches with no secondary structure. Often
disordered.
Random coil
Sequences vary much more than secondary structure
regions
-turns in proteins:
Secondary Structure: Turns
reversing the chain direction
turn
residues -turn consists of two
residues, where there is a
hydrogen bond between the
carbonyl of the residue
preceding the turn and the Gamma-turn
amide nitrogen following the It involves three amino acid residues and the intraturn hydrogen bond
turn. There are a number of for a gamma-turn is formed between the backbone CO(i) and the
ways to configure the backbone backbone NH(i+2).
to achieve this Beta-turn
A beta-turn involves four amino acid residues and may or may not be
stabilized by the intraturn hydrogen bond between the backbone CO(i)
direction of and the backbone NH(i+3).
polypeptide chain Alpha-turn
An alpha-turn involves five amino acid residues where the distance
between the Calpha(i) and the Calpha(i+4) is less than 7 and the
pentapeptide chain is not in a helical conformation.
Pi-turn
It is the largest tight turn which involves six amino acid residues.
Tertiary Structure: Aggregation of individual protein. Tertiary Structure of Protein
1. Hydrophobic attraction: the close association

attraction of hydrocarbon side-chains.
2. Ionic bond: between positively charged groups and
negatively charged groups.
3. Hydrogen bonds
4. Disulfide bonds
A protein has size and shape as well as unique arrangement

through hydrogen, ionic, hydrophobic and disulfide bonds.
Side chain conformations--canonical staggered forms

Side chain O Newman projections for 1 of glutamate:
conformation O C CO CO CO
3 H C C H H H
side chains differ in their number CH2
of degrees of conformational 2 N H N H N H
freedom CH2 H H C
1
but side chains of very different 1 = 180 1 = +60 1 = -60
C CH NH
size can have the same number of
t g+ g
O
angles.
glutamate t=trans, g=gauche
name of conformation
Side chain angles are defined moving outward from the backbone, starting
with the N atom: so the 1 angle is NCCC, the 2 angle is CCC C ...
IUPAC nomenclature:
http://www.chem.qmw.ac.uk/iupac/misc/biop.html
Rotamers Anfinsens experiments, late 1950s through 1960s
a particular combination of side chain torsional angles 1, 2, etc. for a

particular residue is known as a rotamer.
for example, for aspartate, if one considers only the canonical staggered
forms, there are nine (32) possible rotamers: g+g-, g+g+, g-g-, g-g+, tg+, g+t, tg-,
g-t, tt
not all rotamers are equally likely.
for example, valine prefers distribution of
its t rotamer valine rotamers
in protein structures
CO
H C1
N H
Ribonuclease, an enzyme involved in cleavage
C2
of nucleic acids. Structure has a combination of
and segments and four disulfide bridges
1=180, trans or t 1=0 180 360
What are Disulfide Bridges?
Oxidation
Active,
native
Cys-SH + Cys-SH Cys-S S-Cys structure
Reduction
BME is reducing agent add -mercaptoethanol (BME)

Urea unfolds proteins HS-CH2-CH2-OH
add urea, H2N-C-NH2
SH O
SH
Cys110 Cys110 SH
SH SH
Cys58 Cys58 SH SH SH
Denatured, inactive, random coil, many conformations
SH SH SH SH
SH SH
SH SH SH SH
SH SH SH SH SH SH
Many Conformations Many Conformations
Remove BME Remove BME

Remove urea
S S Remove
urea
- Native structure S S
- fully active S
- 4 disulfide bond S S S
One One
correct Conformation Conformation
Mixture of 105
different conformations, 1% active
Sequence specifies structure
Anfinsens experiments The native structure of a protein is the form we

Denatured ribonuclease find when we isolate that protein in an active state
Spontaneously regained enzymatic from a natural source.
activity
Evidence that it re-folded to native If the protein loses that structure, by unfolding, or
conformation unwinding, it loses activity.
Therefore, native = folded
denatured = unfolded
The essential The native structure is necessary to create the
structure binding pockets that make up the active site of
information is stored
in the primary
an enzyme.
sequence of amino
acids
Structure Stabilizing Interactions
Structure Stabilizing Interactions
Noncovalent
Higher interaction energy than a Van der Waals forces (transient, weak electrical
simple van der Waals interaction attraction of one atom for another)
H-bond Hydrophobic (clustering of nonpolar groups)
Hydrogen bonding
- + -
DH A
Covalent
Disulfide bonds
helix sheet
The hydrophobic effect

H H Quantification of contribution to the hydrophobic effect
H
H H (Hydropathy)
H H
H H H
H H H
H
H Partitioning of the molecules between water and an organic solvent such as octanol
H H H -
H + H H
Amino acid placed in two-solvent system
H H H H aq
H H H H
+ Hydropathy represented by partition coefficient P, P=
H
H
- H
org
H
H H H H H
Hydropathy represented by log( P )
H
H
Non-polar solute Polar solute
The dissolution of a non-polar solute in water is unfavourable

for polar solutes there is favourable hydrogen bonding
Structure Stabilizing Interactions Solving Protein Structures

Only 2 kinds of techniques allow one to get atomic resolution
pictures of macromolecules
X-ray Crystallography (first applied in 1961 - Kendrew &
Asn Perutz)
NMR Spectroscopy (first applied in 1983 - Ernst & Wuthrich)
H C H
H
N C
H
H H H H H +
O H Structure Function
O
Lys C C C C N H O C C Asp
Structure Mechanism
H H H H H H +
H
Structure Origins/Evolution
O
H C H Structure-based Drug Design
Ser Solving the Protein Folding Problem
QHTAWCLTSEQHTAAVIWDCETPGKQNGAYQEDC
HHHHHHCCEEEEEEEEEEECCHHHHHHHCCCCCC
Hydrophobicity scales
Hydropathy plots
An hydropathy plot is a graphical display of the local hydrophobicity of
amino acid side chains in a protein. A positive value indicates a
hydrophobic residue and a
A positive value indicates local hydrophobicity and a negative value negative value a hydrophilic residue
suggests a water-exposed region on the face of a protein.
Hydropathy plots are generally most useful in predicting transmembrane

segments and N-terminal secretion signal sequences.
Hydropathy index
Sliding Window Approach Hydropathy plots

Hydropathy plots
The window size can be changed.
A small window produces
"noisier" plots that more
Calculate property for first accurately reflect highly local
sub-sequence hydrophobicity.
I L I K E I R A window of 9 or 11 is generally
4.50+3.80+4.50-3.90 optimal for recognizing the long
-3.50+4.50-4.50 = 5.40 hydrophobic stretches that typify
transmembrane stretches.
= 5.4/7=0.77 In an -helix the rotation is 100 degrees per amino acid
The rise per amino acid is 1.5
Move to the next position
To span a membrane of 30 approx.
30/1.5 = 20 amino acids are needed
Transmembrane Helix Predictions

Not many structures known
of transmembrane helix
proteins Hydropathy plot for rhodopsin
Hydropathy analysis can be

used to locate possible
transmembrane segments
The main signal is a stretch

of hydrophobic and helix-
loving amino acids

Protein Structure

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Protein Structure

Uploaded by

Copyright:

Available Formats

CY60004 Biophysical Chemistry

Structures of biological macromolecules (proteins and polynucleic acids);

Molecular Mechanics: simulating macromolecular structure;

Spectroscopic (NMR, Fluorescence and Circular Dichroism) methods to study

Interactions between macromolecules: thermodynamics of protein

Binding of small ligands by biological macromolecules: kinetics and energetics

Amino acids Protein Structure

Amino Acid Characteristics

bridge forming by cysteines,

ISOELECTRIC POINT (pI)

with the Greek alphabet

: for amino acids

Ionization of Amino Acids Acid-base properties of amino acids

The dissociation of first proton The dissociation of the second

Gly+ + H2O Gly0 + H3O+ Gly0 + H2O Gly- + H3O+

pI average of (pK below neutral+ pK above neutral)

So, for Gly,

pI = 2.10 + 4.07 = ~3.1 pI = 9.18 + 10.79 = ~10.0

pI = average of (pK-below) + (pK-above)

Histidine Titration Histidine Titration

1. Carboxylic acids ionize at acidic pH; ie. Carboxylic acids

+ pK than isolated carboxylic acids.

Peptide Bond - Structure

H2N C CH OH Polar charged Surface Hydrophilic core

Importance of Protein Structure

Hydrophobic Ile, Leu, Met, Phe, Trp, Val

Metal binding sites examples

Primary structure (1) : the amino acid sequence.

1) M is Fe (rubredoxin) or Zn (aspartate transcarbamoylase)

Primary Structure Secondary Structure

Residue conformational preferences: -sheet: parallel ( =-119 and =113) or

antiparallel beta-sheet parallel beta-sheet

4 40+ residues in length

310 helix i, i+3 -helix i, i+4 -helix i, i+5

phi(deg) psi(deg) omega (deg)

Secondary Structure: Beta Sheets

Hydrogen bonds are

1. Hydrophobic attraction: the close association

A protein has size and shape as well as unique arrangement

Side chain conformations--canonical staggered forms

Rotamers Anfinsens experiments, late 1950s through 1960s

a particular combination of side chain torsional angles 1, 2, etc. for a

BME is reducing agent add -mercaptoethanol (BME)

Remove BME Remove BME

Sequence specifies structure

Anfinsens experiments The native structure of a protein is the form we

The hydrophobic effect

Non-polar solute Polar solute

The dissolution of a non-polar solute in water is unfavourable

Structure Stabilizing Interactions Solving Protein Structures

Hydropathy plots are generally most useful in predicting transmembrane

Sliding Window Approach Hydropathy plots

Transmembrane Helix Predictions

Hydropathy analysis can be

The main signal is a stretch

You might also like