Professional Documents
Culture Documents
Contents
[hide]
1 Primary Structure
o 1.1 Forces that stabilize Protein Structure
o
2 Secondary Structure
o
2.1.1 Structure
2.1.4 SUMMARY
3 Tertiary Structure
o
3.1 Structure
3.2 Domains
3.3 Mutations
3.4 Folding
4 Quaternary Structure
o
4.2 Dimers
4.3 Trimer
4.4 Tetramer
4.7 Analogy
Primary Structure
The primary structure of a protein is a linear polymer with a series of amino acids. These amino
acids are connected by C-N bonds, also known as peptide bonds. The formation of peptide bonds
produce water molecules as a by-product when an amino terminal residue (N-terminal) loses an
oxygen from the alpha-carboxyl group while the other amino acid loses two of its hydrogens
from its alpha-amino group. Thus, polypeptide, or polypeptide chain, is a term that describes the
multiple connected peptide bonds between numerous amino acids. Each amino acid in a
polypeptide chain is a unit, commonly known as a residue. These chains have a planar
backbone, as the peptide bonds have double bond characteristics due to the existence of
resonance between the carbonyl carbon and the nitrogen where the peptide bonds form. The
primary structure of each protein has been precisely determined by the specific genes. The C-N
bond in an amino acids chain has the character of a double bond. This bond has a short length
and stable. It cannot be rotated. This double-bond character can be explained structurally, in that
the R groups in amino acid chains avoid steric clash.
Amino acids are linked by peptide bonds to form polypeptide chain; each amino acid unit is
known as a residue; a polypeptide chain constructed by the same unit is known as the main chain
or backbone and a changing R group, side chains.
recorded by amino acid analyzer. The amino acid analyzer establishes the composition
of a polypeptide by giving a chromatogram, which records the peaks of each amino
acid presents in the sequence. However, the amino acid analyzer can only give the
composition of a polypeptide, not the order in which the amino acids are bound to one
another.
To determine the amino acid sequence, it usually starts from the determination of the
amino terminal of the polypeptide. The procedure is known as Edman degradation,
and the reagent employed is phenyl isothiocyanate.
Phenyl isothiocyanate
In Edman degradation, the terminal amino group adds to the isothiocyanate reagent to
produce a thiourea derivative. Treating with mild acid, the tagged amino acid is turned
into a phenylthiohydantoin, and the remainder of polypeptide is unchanged. Since the
phenylthiohydantoins of all amino acid are known, the amino terminal of the original
polypeptide can be identified easily. However, Edman degradation can only be used to
identify the amino end of the polypeptides; therefore, for polypeptides that are made
up by hundreds of amino acids, it is not a practical method in general. In addition,
multiple degradation rounds will build up impurities which will seriously affect the
yield of peptide. High yield means not completely quantitative, and with each step of
degradation, incompletely reacted peptide will mix with the new peptide, resulting in
a intractable mixture.
Secondary Structure
Secondary structures of proteins are typically very regular in their conformation. They
are the spacial arrangements of primary structures. Alpha Helices and Beta Pleated
Sheets are two types of regular structures. An interesting bit of information is that
certain amino acids making up the polypeptide will actually prefer certain folding
structures. The Alpha Helix seems to be the default but due to interactions such as
sterics, certain amino acids will prefer to fold into Beta pleated sheets and so on. For
example, amino acids such as Valine, Isoleucine, and Threonine all have branching at
the beta carbon, this will cause steric clashes in an alpha helix arrangement. Glycine is
the smallest amino acid and can fit into all structures so it does not favor the helix
formation in particular. Therefore, these amino acids are mostly found where their
side chains can fit nicely into the beta configuration.
Alpha Helix
Structure
The general physical properties of an alpha helix are:
Hydrogen bonding between the 1st carbonyl to the hydrogen on the 5th amino
Ramachandran diagram
Alpha helix falls within quadrant 1 (left-handed helix) and 3 (right-handed
helix) in the Ramachandran diagram
Fibrous Proteins
I. COILED-COIL (-keratin)
An alpha coiled coil consists of two or more alpha helices intertwined, creating a
stable structure. This structure provides support to tissues and cell, contributing to the
cell cytoskeleton and muscle proteins such as myosin and tropomyosin. Alpha keratin
consists of heptad repeats (imperfect repeats of 7 amino acid sequences). This
facilitates bonding between the two or more helices.
II. COLLAGEN
Collagen is another type of fibrous protein that consists of three helical polypeptide
chains. It is the most abundant protein found in mammals, making up a large
component of skin, bone, tendon, cartilage, and teeth. Wrinkles are also caused by the
degradations of this protein. In the structure of collagen, every third residue in the
polypeptide is glycine because it is the only residue that is small enough to fit in the
interior position of the superhelical cable. Unlike normal alpha helices, each collagen
helix is stabilized by steric repulsion of the pyrrolidine rings of the praline and
hydroxyproline residues. However, the three strands intertwined are stabilized by
hydrogen bonding.
Alpha Tertiary
I. MOTIFS
Motifs are simple combinations of the secondary structure such as the helix-turnhelix, which consist of two helices separated by a turn. The helix-turn-helix motif are
usually found in DNA-binding proteins.
II. DOMAINS (GLOBULAR)
Domains, or compact globulars, consist of multiple motifs.They are polypeptide
chains folded into two or more compact regions connected by turns or loops. Their
structure is spherical, which is beneficial for the protein because it conserves space.
Generally, inside the globular protein consist of hydrophobic amino acids such as
leucine, valine, methionine, and phenylalanine. The outside consists of amino acids
with hydrophilic tendencies such as aspartate, glutamate, lysine, and arginine. An
example of a globular protein is myoglobin, which is the oxygen carrier in muscle. It
is an extremely compact molecule made of only alpha helices (70%) except for loops
and turns (30%).
SUMMARY
The alpha-helix consists of a single polypeptide chain in which the amino group (NH) hydrogen bonds to a carboxyl group (C=O) 4 residues away. The alpha - helix is a
rod-like structure. The tightly coiled backbone of the chain forms the inner part of the
rod and the side chains extend outward in a helical array. This results in a clockwise
coiled structure, which is known as a "right handed" screw sense. This folding pattern,
along with the beta-pleated sheets were actually proposed by Linus Pauling and
Robert Corey half a decade before people could actually see it. Most of the alpha
strands are located in the lower left corner or upper right corner of the Ramachandran
diagram . Essentially, most of the alpha helices are found in the right-hand helices
area. An alpha helix is especially suited for cross-membrane proteins because all of
the amino hydrogen and carbonyl oxygen atoms of the peptide backbone can interact
to form intrachain hydrogen bonds while its aliphatic side chains can stabilize in
hydrophobic environment of cell membrane.
Alanine, leucine and glutamic acid (existed as glutamate as physiological pH) are the
most common residues present in alpha-helices.
The alpha-helix content of protein ranges widely, from none to almost 100%.
In general, the alpha helix is the "normal" shape of a polypeptide chain; however,
features of certain amino acids disrupt alpha helix formation and instead favor beta
strand formation. Amino acids with branching at the beta carbon (i.e. valine,
threonine, and isoleucine) are problematic because they crowd the peptide backbone.
H-bond accepting/donating groups attached to the beta carbon (i.e. serine, asparagine,
and aspartate) can bond with backbone amine and carboxyl groups, again interfering
with alpha helix formation.
While individual amino acids may favor one form or another, predicting the 2
structure of even a short (<7 amino acid) peptide strand is only 60-70% accurate. Such
variability suggests other factors, like tertiary interactions with amino acids further
down the chain, influence the folding into its observed 3 structure.
characterized as the beta pleated sheet. The beta strands can be arranged in a parallel,
anti-parallel, or mixed (parallel and anti-parallel) manner.
of beta sheets in a protein is 6 amino acid residues. The actual length ranges from 2 to
22 residues.
defined. Loops and turns generally lie on the surfaces of proteins so they often
participate in interactions between proteins and other molecules. In a loop, there are
no regular structures as can be found in helices or beta strands.
Two hypotheses have been proposed for the role of turns in protein folding. In one
view, turns play a critical role in folding by bringing together interactions between
regular secondary structure elements. This view is supported by mutagenesis studies
indicating a critical role for particular residues in the turns of some proteins. Also,
nonnative isomers of X-Proline peptide bonds in turns can completely block the
conformational folding of some proteins. In the opposing view, turns play a passive
role in folding. This view is supported by the poor amino-acid conservation observed
in most turns. Also, non-native isomers of many X-Pro peptide bonds in turns have
little or no effect on folding.
Fibrous proteins
Fibrous protein such as alpha-keratin and collagen consist of two right handed alpha
helix intertwined to form a type of left handed super-helix called an alpha coiled coil.
The two helices in this type of protein usually cross-linked by weak interaction such
as Van der Waals forces force and ionic interaction. The side chain interaction can be
repeat every seven residues, forming heptad repeats. Another form of fibrous protein,
that of collagen, exists as three helical polypeptide chains. These chains are relatively
long, ~1000 residues, and because of overcrowding, glycine appears once every three
residues. While the helix is stabilized by the steric repulsions, the three strands are
stabilized by hydrogen bonding. These protein usually serve structural roles in
organisms, alpha-keratin is commonly found in the cytoskeleton of a cell, as well as
certain muscle proteins. Collagen is often found in teeth, skin, and tendons.
Torsion Angles
Tosion angles are also called dihedral angles. The torsion angle is the measure in
degrees in bonds between atoms. Folding of proteins are influenced by the degree of
rotation amino bonds can hold. There are two different types of torsion angles existing
in polypeptide bonds. Phi, is the angle between the -carbon and the nitrogen atom
of a peptide bond. The other bond is called phi, which is the angle between the carbon and the carbonyl group. To measure , one must look from the nitrogen atom
towards the -carbon to measure if the angle is negative or positive. The angle is
negative if the -carbon rotates counterclockwise and vice versa. Furthermore, to
measure , one must look from the nitrogen atom towards the carbonyl group.
Likewise, the angle is negative if the carbonyl group rotates counterclockwise and
vice versa.
Ramachandran Diagram
The Ramachandran Diagram, created by Gopalasamudram Ramachandran, helps to
determine if amino acids will form alpha helices, beta strands, loops or turns. The
Ramachandran Diagram is separated into four quadrants, with angle as the x axis
and angle as the y axis. The combinations of torsion angles will put the amino acids
in specific quadrants, which determine whether is it will form an alpha helix, beta
strand, loop, or turn. Those that fall in quadrants 1 and 3 a few times in a row form
alpha helices, and those that repeat in quadrant 2 form beta strands. Quadrant 4 is
generally disfavored because of steric hinderance. Also, it is mostly impossible
because the different torsion angles combinations in quadrant 4 can't exist because
they cause collisions between the atoms of the amino acids. If the amino acids land in
the different quadrants, with no repeats, then they become loops or turns.
Furthermore, the principle of steric exclusion states that two atoms cannot occupy the
same place simultaneously.
Tertiary Structure
The tertiary structure of a protein is the three-dimensional structure of the protein.
This three-dimensional structure is mostly determined by the amino acid sequence,
which is denoted by the primary structure of the protein, however the amino acid
sequence cannot entirely predict on how the three-dimensional structure is formed.
Another contributing factor to the final shape of the tertiary structure is based on the
Structure
Domains
Some polypeptide chains fold into several compact regions. These regions in a
polypeptide chain are called domains and generally range from 30 to 400 amino acids.
On average, domains contain roughly 100 amino acids. Each domain forms its own
tertiary structure which contributes to the overall tertiary structure of the protein.
These domains are independently stable. Stabilization is caused by metal ions or
disulfide bridges that cause the folding of polypeptide chains. Different proteins may
have the same domains even if the overall tertiary structure is different.
There are four types of domains:
Mutations
In order for a protein to be functional (except in food), it must have an intact tertiary
structure. If a tertiary structure of a protein is disrupted, it is said to be denatured.
Once a protein is denatured, it will not be able to perform its intended or original
function. A primary cause for an alteration of the tertiary structure is a mutation in the
gene encoding a protein. The mutation in the gene can cause a domino effect that will
lead to the degradation of the tertiary structure. Degradation can cause several
diseases, one of which is called cystic fibrosis. Cystic fibrosis is brought about by a
mutation of a genes called cystic fibrosis transmembrane conductance regulator
(CFTR). This disease causes the exocrine glands to overproduce mucus. Most
commonly, CF patients suffer from lung failure by the age of early 20-30. Diabetes
insipidus, familial, hypercholesterolemia, and Osteogenesis imperfecta are also
diseases that originate from degraded proteins. A mutation in the tertiary structure
itself, rather than from a mutation in the nucleotide sequence can also lead to diseases.
Such mutated proteins can also aggregate and become insoluble deposits called
amyloids, and therefore lose the ability to function. A common mutation is when a
hydrophobic R group folds in, rather than out, in a hydrophobic environment. The
inherited form of Alzheimer's disease is one disease that is caused by mutated tertiary
structure. Another disease is "mad cow" disease, which is caused due to a-helix
(which are soluble) mutating into b-sheets (which are insoluble and cause amyloid
deposits). [4]
Folding
The folding of a protein is dependent on the amino acid sequence laid out in the
primary structure. It is also dependent on the environment in which the folding occurs.
In a hydrophobic environment, the hydrophobic side chains of the amino acids of the
protein fold out while the hydrophilic side chains fold in and vice versa for a
hydrophilic environment. An example of a protein that is folded in a hydrophobic
environment is Porin. Its hydrophilic side chains are folded in which creates a channel
for water to pass through. Amino acids that have nonpolar/hydrophobic side chains
such as leucine, valine, methionine, phenylalanine, and isoleucine would be folded out
in the folding of the protein in a hydrophobic environment. Likewise, in a hydrophilic
environment, amino acids with polar side chains such as glutamine and asparagine
fold outwards and the hydrophobic side chains would fold inwards.
of the best methods because the wavelength of an x-ray is similar to that of covalent
bonds found throughout proteins, creating a clearer visualization of a molecule's
structure. The scattering of x-rays by electrons is analyzed to determine the structure
of proteins. In order to use x-ray crystallography, the protein in question must be in
crystal form. Some proteins crystallize readily, while others do not. For those proteins
that do not crystallize readily, nuclear magnetic resonance (NMR) spectroscopy must
be used to determine its structure. NMR spectroscopy uses the spin of nuclei with a
magnetic dipole and chemical shifts to determine a molecules relative position.
Quaternary Structure
Atomic structure of the 50S Subunit from Haloarcula marismortui. Proteins are shown
in blue and the two RNA strands in orange and yellow.[11] This is an example of the
tertiary structure of the large unit of a ribosome
A quaternary structure refers to two or more polypeptide chains held together by
intermolecular interactions to form a multi-subunit complex. The interactions that
hold together these folded protein molecules include disulfide bridges, hydrogen
bonding, hydrogen bonding interactions, hydrophobic interactions interactions and
London forces. These forces are usually conveyed by the side chains of the peptides.
These polypeptide chains are the subunits of a protein, capable of taking part in a
variety of functions such as serving as enzymatic catalysts, providing structural
support in the cytoskeletons of cells, and even composing the hair on our heads.
The peptides of the protein can be identical or different. Insulin is a dimer consisting
of two identical peptides, while Hemoglobin is a tetramer consisting of two identical
alpha subunits and two identical beta subunits.
1 subunit = Monomer
2 subunits = Dimer
3 subunits = Trimer
4 subunits = Tetramer
The pattern continues with pent-, hex-, hept-, oct-, and so forth.
Dimers
HIV Protease
o
Dimer
Trimer
Collagen
o Composed of 3 helical polypeptide chains
Tetramer
Structure of human hemoglobin. The protein's and subunits are in red and blue,
and the iron-containing heme groups in green. From PDB 1GZX Proteopedia
Hemoglobin
Hemoglobin
o Consists of 2 alpha and 2 beta groups
Aquaporin
o
The quaternary structure of a protein can be denatured by breaking the covalent and
non-covalent forces that keep it together. Heat, urea or guanidinium chloride will
denature a protein by disrupting the non-covalent forces, while betamercaptoethanol will break disulfide bridges by reducing the bridges.
A protein is never "half folded", at the point where the concentration of the denaturant
is in between that of the folded and unfolded form of the protein, there are two
structures that exist. Folded and Unfolded, at a ratio of 1:1
Proteins are either folded, or not. There does not exist a stage where a protein is
"half-folded". This can be observed by slowly adding denaturant to a protein. This
will result in a sharp transition, from the folded state to the unfolded state,
suggesting there only exist these two forms. This is a result of cooperative transition.
For instance, if a protein is put in a denaturant where only one part of the protein is
unstable, the entire protein will unfold. This is due to the domino effect where
destabilizing one part of the protein will in turn destabilize the remainder of the
structure. When a protein is in conditions which correspond to the middle of the
transition between folded and unfolded, there is a 50/50 mixture of folded and
unfolded protein, instead of 'half-folded' protein.
After all is said about being in one structure or the other, there must be something in
between them on an atomic level. Unfortunately, this is an area that is still under
development, and much research is still being done. Theories such as the condensation
Nucleation Principle are concerned with this area of protein folding.
Analogy
If one takes each student in a class to be a different amino acid, each right hand to be
an alpha-carboxyl group, each left hand to be an alpha-amino group, and the head to
be the R group; then by joining right hands to left hands, the class will form a
polypeptide. The "bonds" joining the hands will be peptide bonds. This can be
considered the primary structure of a protein.
If one then takes students and "attract" them to other students 4 "bonds" away, this
structure will then fold into a secondary structure; namely the alpha-helix. If the
students were put into lines and were attracted to respective students in another line,
they would form a beta-pleated sheet.
Now imagine that the heads, or R groups, vary in areas such as personalities, or
polarity, like will attract like. The people who are more compatible will then gather
together, for instance, hydrophobic areas will usually gather together in the center
while surrounded by hydrophilic areas. This makes up the tertiary structure.
Now add in a different class, the people from the new class would have their own
tertiary structure, these new people will then come in and react with the original class
to form quaternary structures.
gas storage, metal harvest and etc. Many approaches have been developed to control
protein assembling. Following are some of them.
1. Transition metal-directed. Metal centers in protein are important, not only because
they are reactive centers, but also they help stabilize the shape of protein by
coordination. Many amino acids are ligands by themselves. Cysteine, Histidine, lysine
are the common ones. Plus, researchers can engineer inorganic ligands onto proteins
by cysteine substitution. Thus, introducing inorganic ligands much broaden the
horizon of protein assemblies.