Professional Documents
Culture Documents
Purification of
Recombinant
Proteins
6.0.1
Supplement 30
In many cases high-level expression of protein in the E. coli host leads to the formation
of highly aggregated proteins, called inclusion bodies. The determination of whether a
protein is soluble or not is made using cell lysates and simple centrifugation schemes as
outlined in UNIT 6.1 (see also UNIT 5.3). Proteins in inclusion bodies usually have the correct
primary sequences, identical to those of their native and authentic counterparts, but are
aggregated and inactive due to noncovalent or conformational differences. In UNIT 6.3, the
preparation of insoluble inclusion body protein from E. coli lysates by a simple washing
procedure is described.
Before insoluble proteins in inclusion bodies can be purified, they must be extracted and
solubilized; in this respect their purification is analogous to the extraction of intrinsic
membrane proteins from membranes. The solubilization of inclusion body protein,
however, requires denaturation with reagents such as concentrated solutions of urea or
guanidineHCl (see APPENDIX 3A), whereas the extraction of membrane proteins is normally
carried out under nondenaturing conditions using nonionic or zwitterionic detergents (von
Jagow et al., 1994). As the solubilized recombinant protein is denatured or unfolded, it
must be folded into the correct tertiary and quaternary structures in order for the protein
to acquire its functional and biological properties.
Many nonrecombinant proteins that have been isolated under native conditions can be
reversibly denatured and renatured using well-established methods. A concise account of
protein folding is given in UNIT 6.4 (see also Pain, 1994) to provide some of the theoretical
background required to interpret protein folding experiments.
The preparation of folded and active recombinant proteins from inclusion bodies involves
the process known as preparative protein folding. In traditional protein folding experiments, pure native proteins such as ribonuclease were used to study the reversible
denaturation/renaturation process to gain information about folding pathways (Pain,
1994). These experiments are usually carried out using low protein concentrations (in the
micrograms per milliliter range) to minimize nonspecific inter- and intramolecular
interactions that lead to aggregation. In preparative protein folding, the aim is to obtain
folded protein in as high a yield as possible using relatively high starting concentrations
(in the milligrams per milliliter range) and without using very large reaction volumes.
The various strategies usually used to achieve this are illustrated in UNIT 6.5.
In preparative protein folding, apart from the need to avoid or minimize aggregation,
formation of the correct disulfide bonds is required. After extraction of the recombinant
protein from inclusion bodies using a protein denaturant and reducing agent (UNIT 6.3), the
solubilized protein is unfolded and sulfhydryl groups, if present, are reduced. To form the
correctly folded protein, disulfide bond formation and protein folding must both occur,
and usually in a concomitant manner.
Folding pure or partially purified protein may be advantageous, leading to higher yields
of folded protein. In UNIT 6.3, gel filtration in the presence of guanidineHCl is described
as a simple and rapid method for partially purifying extracted and denatured proteins prior
to folding. A recent example of this approach is the production of the ectodomain of HIV-1
and SIV gp41 (Wingfield et al., 1997). Other methods that can be used to purify proteins
in high concentrations of urea or guanidineHCl are discussed in UNIT 6.1.
Introduction
Three examples of the folding and purification of insoluble proteins expressed in E. coli
are described in UNIT 6.5. Bovine growth hormone (somatotropin) is extracted from
inclusion bodies using guanidineHCl; the protein is then folded and then purified by
conventional chromatographic methods. Aggregation during the folding process is minimized by using a solvent additive cosolvent, in this case urea in a limiting (i.e., nonde-
6.0.2
Supplement 30
Purification of
Recombinant
Proteins
6.0.3
Current Protocols in Protein Science
Supplement 30
Another popular gene fusion system is based on the E. coli intracellular enzyme thioredoxin. This 12-kDa protein is very soluble, is monomeric, and has been thoroughly
characterized (UNIT 5.1). The target protein can be positioned either N-terminal (proteinthioredoxin) or C-terminal (thioredoxin-protein) to the fusion partner. This dual orientation, also possible with Hist-tag fusions, represents a difference from GST fusions, as the
GST moiety is normally only N-terminal. It is worth noting that as thioredoxin is a
monomeric protein, it may be a more suitable fusion partner for oligomeric proteins than,
for example, dimeric GST (Hurd and Hornby, 1996). In UNIT 6.7 the construction and
expression of thioredoxin fusion proteins are discussed. Support protocols deal with the
release of the fusion proteins by mechanical lysis and osmotic stress. E. coliderived
thioredoxin exhibits thermal stability, and a heat treatment of cell lysates at 80C may be
a useful initial purification step. This approach is described in the third support protocol.
The commentary section includes a brief discussion of more specific purification methods. However, if ease of purification is of critical importance, the GST fusion system
should be considered as the glutathione-based affinity method is probably more efficient
than the corresponding affinity-based method used for thioredoxin.
It should be emphasized that although most of the protocols described in this chapter are
applied to specific examples, they are fairly representative of the methods and approaches
used in general for laboratory-scale preparation of recombinant proteins from E. coli. The
individual characteristics of a given protein (e.g., solubility, isoelectric point, pH stability,
and number of sulfhydryl residues) will dictate the conditions required for any particular
purification step or folding process. Method optimization is usually empirical unless
characterization of the purified protein provides useful guidelines.
LITERATURE CITED
Ausubel, F.M., Brent, R., Kingston, R.E., Moore, D.D., Seidman, J.G., Smith, J.A., and Struhl, K.S. (eds.).
1997. Current Protocols in Molecular Biology. John Wiley & Sons, New York.
Hurd, P.J. and Hornby, D.P. 1996. Expression systems and fusion proteins. In Proteins LabFax (N.C. Price,
ed.) pp. 109-117. BIOS Scientific Publishers, Oxford, UK.
Kuge, M., Fujii, Y., Shimizu, T., Hirose, F., Matsukage, A., and Hakoshima, T. 1997. Use of a fusion protein
to obtain crystals suitable for X-ray analysis: Crystallization of a GST-fusion protein containing the
DNA-binding domain of DNA replication-related element-binding factor, DREF. Protein Sci. 6:17831786.
Pain, R.H. 1994. Mechanisms of Protein Folding. IRL Press, Oxford, UK.
Ruldolph, R. 1989. Renaturation of recombinant, disulfide-bonded proteins from inclusion bodies. In
Modern Methods in Protein and Nucleic Acid Research: Review Articles (H. Tschesche, ed.) pp. 149-171.
Walter de Gruyter, Berlin.
Simpson, R.J. and Nice, E.C. 1989. Strategies for the purification of subnanomole amounts of proteins and
polypeptides for microsequence analysis. In The Use of HPLC in Receptor Biochemistry (A.R. Kerlavage,
ed.) pp. 210-244. Alan R. Liss, New York.
von Jagow, G., Link, T.A., and Schgger, H. 1994. Purification strategies for membrane proteins. In A Practical
Guide to Membrane Protein Purification (G. von Jagow and H. Schgger, eds.) pp. 3-21. Academic Press,
San Diego.
Wingfield, P.T., Stahl, S.J., Kaufman, J., Zlotnick, A., Hyde, C.C., Gronenborn, A.M., and Clore, G.M. 1997.
The extracellular domain of immunodeficiency virus gp41 protein: Expression in Escherichia coli,
purification, and crystallization. Protein Sci. 6:1653-1660.
Paul T. Wingfield
Introduction
6.0.4
Supplement 30
UNIT 6.1
The expression of recombinant proteins, especially using bacterial vectors and hosts, is
a mature technology. With the appropriate cDNA and PCR methods, expression plasmids
can be rapidly produced. Following sequence determination of the constructs, plasmids
are transformed into expression hosts, single colonies picked, and fermentation performed. With E. coli, a 2-liter fermentation using complex media will generate 50 to 80
g (wet weight of cells). Assuming modest protein expression (2% to 5% of the total
cellular protein), between 100 and 300 mg of recombinant protein is available in the cells.
The problem is, of course, how to isolate it in an active form. Soluble proteins can be
recovered with good yields (>50%), and insoluble proteins, which must undergo a
denaturation and folding cycle, can be recovered with more modest yields (5% to 20%).
Hence, using small-scale fermentations and laboratory-scale processing equipment,
proteins (or subdomains thereof) can usually be produced in sufficient quantities (10 to
100 mg) to initiate most studies including detailed structural determinations. Some
strategies for achieving high-level expression of genes in E. coli have been reviewed by
Makrides (1996) and Baneyx (1999).
Some of the above characteristics also hold true for the production of proteins using yeast
and baculovirus eukaryotic expression systems, although more effort and expertise is
required to construct the vectors and, with the baculovirus system, produce cells for
processing. A yeast expression system may be a wise choice for proteins that form
insoluble inclusions in bacteria, and for the production of membrane-associated proteins
(Cereghino and Clegg, 1999; UNITS 5.6-5.8). The baculovirus system has proven very useful
for producing phosphorylated proteins and glycoproteins (Kost, 1999; UNITS 5.4-5.5) and
for the co-expression of interacting proteins. The construction of stable mammalian
protein expression vectors requires considerably more time and effort but may be the only
approach for producing complex multidomain proteins (UNITS 5.9-5.10). Cells growing to
cell densities of 1-5 109 cells/ml can be expected to typically secrete >10 mg/liter of
product. Alternatively, transient gene expression systems using various viral vectors (e.g.,
vaccinia virus; UNITS 5.12-5.15), can be used to produce lesser amounts of protein, which is
useful for feasibility studies. It is of interest to note that the large-scale transient expression
systems in mammalian cells are being actively developed by biotechnology companies
(Wurum and Bernard, 1999).
The initial choice of host system for the production of recombinant proteins for many
investigators is Escherichia coli. This is due to such factors as ease of genetic manipulation, availability of optimized expression plasmids, and ease of growth. This unit presents
an overview of recombinant protein purification with special emphasis on proteins
expressed in E. coli. Practical aspects and strategies are stressed throughout, and wherever
possible, the discussion is cross-referenced to the example protocols described in the rest
of Chapter 6.
The first section deals with information pertinent to protein purification that can be
derived from translation of the cDNA sequence. This is followed by a brief discussion of
some of the common problems associated with bacterial protein expression (see also UNIT
5.1). Planning a protein purification strategy requires that the solubility of the expression
product be determined; it is also useful to establish the location of the protein in the
celle.g., cytoplasm or periplasm. This unit includes flow charts that summarize approaches for establishing solubility and localization of bacterially produced proteins (see
also UNIT 5.2).
Contributed by Paul T. Wingfield
Current Protocols in Protein Science (2002) 6.1.1-6.1.37
Copyright 2002 by John Wiley & Sons, Inc.
Purification of
Recombinant
Proteins
6.1.1
Supplement 30
Purification strategies for both soluble and insoluble proteins are reviewed and summarized in flow charts (see also Chapter 1). Many of the individual purification steps,
especially those involving chromatography, are covered in detail in Chapters 8 and 9, and
elsewhere (Scopes, 1994; Janson and Ryden, 1998). The methodologies and approaches
described here are essentially suitable for laboratory-scale operations. Large-scale methodologies have been previously reviewed (Asenjo and Patrick 1990; Thatcher, 1996; Sofer
and Hagel, 1997).
A section on glycoproteins produced in bacteria in the nonglycosylated state is included
to emphasize that, although they may not be useful for in vivo studies, such proteins are
well suited for structural studies. The final sections deal with protein handling, scale and
aims of purification, and specialized equipment needed for recombinant protein purification and characterization.
PROTEIN SEQUENCE AND COMPOSITIONAL ANALYSIS
Analyzing the Protein Sequence
The protein sequence translated from the DNA coding sequence is usually available, and
before attempting any laboratory work, it is useful to carry out a literature survey and
basic computer analyses (see Chapter 2). First, if the natural protein has been isolated and
characterized, reviewing the physicochemical properties of the protein and the established
purification techniques used may aid in planning a strategy for isolation from the
recombinant host. Recombinant proteins that accumulate as insoluble aggregates or
inclusion bodies, require folding into native-like conformations (Lilie et al., 1998; De
Bernardez Clark et al., 1999; UNIT 6.4). Information on the conformational properties,
including denaturation/folding curves, can help rationalize the development of preparative folding processes.
Second, for uncharacterized proteins, analyses of related proteins with sequence similarities or known motifs may provide useful clues for selecting purification steps (UNIT 2.1;
see also the PROSITE database of protein families and domains at the ExPASy Molecular
Biology Server at http://ca.expasy.org/prosite). For example, if the protein contains the
well-known kringle domain, lysine affinity chromatography might be a successful purification technique (Cleary et al., 1989). On the other hand, if the protein contains no
recognizable motifs and has no similarity to other proteins, yet contains many cysteine
residues, other strategies and precautions would be warranted as described in UNITS 6.3-6.5.
The amino acid sequence can be used to direct the synthesis of peptides corresponding
to potential epitopes (e.g., 10 to 20 residues; UNIT 2.2). Polyclonal antibodies raised against
the peptides may be suitable for detecting the protein of interest by immunoblotting. This
approach may be especially valuable for monitoring proteins expressed at low levels
e.g., when E. coli secretion vectors are used. The antibodies may also be useful for
immunoaffinity chromatography.
Analyzing the Amino Acid Composition
Purification of
Recombinant
E. coli Proteins
The amino acid composition (UNIT 3.2) of the protein will also allow calculation of some
basic physicochemical parameters. Using average pKa values for ionizable side chains in
proteins (Matthew et al., 1978), the isoelectric point (pI) can be estimated by applying
the well-known Henderson-Hasselbach relationship. The calculations can be performed
using an electronic spreadsheet such as Excel or via the internet using one of the many
molecular biology servers, e.g., ExPASy (http://www.expasy.ch/tools/pi_tool.html). The
values obtained, although only approximate, are useful for guiding the initial selection of
6.1.2
Supplement 30
ion-exchange resins and the pH of column buffers. When eukaryotic hosts are selected
for protein expression, it should be noted that post-translational modifications such as
phosphorylation and glycosylation will affect the pI.
Another parameter that can be estimated from the amino acid composition is the extinction
coefficient (), usually at 280 nm (Pace et al., 1995). Although this information will be
more useful when the protein has been purified, as most columns are monitored by UV
absorption, proteins with an unusually low (no tryptophan and little or no tyrosine) may
be difficult to detect during the early stages of purification.
Other physicochemical parameters that can be calculated include hydrodynamic parameters such as molecular radii and sedimentation coefficients, the program SEDNTERP is
especially useful (http://www.jphilo.mailway.com/download.htm). These parameters may
help in interpreting results of gel-filtration and centrifugational separations.
CHARACTERISTICS OF THE HOST-VECTOR SYSTEM
Choosing an Expression System
Popular protein expression systems include E. coli, yeast, baculovirus-infected insect
cells, and cultured mammalian cell lines (see Chapter 5). If the requirement is to obtain
a protein post-translationally modified via glycosylation (see Chapter 12) or phosphorylation (see Chapter 13), then a eukaryotic expression system must be used. Stable
mammalian expression systems are the most time-consuming to establish and require the
most expertise; however, they may be the only successful system for certain requirements
including, e.g., proteins with authentic glycosylation patterns; large multidomain and
multisubunit proteins, and especially proteins that are insoluble in E. coli. Post-translational modifications may aid purification (e.g., lectin affinity chromatography can be used
for glycoproteins; UNIT 9.1). On the other hand, these modifications may introduce charge
heterogeneityas is commonly observed with glycosylation due to loss of sialic acid
which may then complicate purification, especially with methods such as ion-exchange
chromatography (UNIT 8.2). Specific modification of proteins expressed in E. coli can be
achieved by the co-expression of modifying enzymes, such as phosphorylation of tryrosyl
residues by tyrosine kinase (Ren and Schaefer, 2001; http://www.stratagene.com/manuals/200124.pdf). However, most of the post-translational modifications observed in E. coli
are nonspecific, such as deamidation (Wingfield et al., 1987a) and proteolytic clipping
(Nagata et al., 1986). Other less common sources of protein heterogeneity arising from E.
coli expression are: (1) internal starts in translation (Dale et al., 1994); (2) partial readthroughs
of the termination codon (Danley et al., 1991), and (3) translation errors (Lu et al., 1993).
The initial choice for protein expression is often E. coli but if direct expression of a protein
of interest fails or yields an insoluble product, there are many other options available
including generating fusion proteins and many other approaches discussed elsewhere in
this overview. If other expression hosts are to be screened, there are universal cloning
systems commercially available (e.g., Gateway cloning system at http://www.invitrogen.com) that allow the rapid transfer of the gene of interest into multivector systems
including yeast, baculovirus, and mammalian cells.
Minimizing Proteolysis
If the protein is expressed in the cytoplasm in a soluble state, the purification can be carried
out directly after cell lysis. Soluble recombinant proteins are, however, susceptible to
proteolysis, which can occur before or after extraction from the cell (Maurizi, 1992).
Choosing protease-deficient E. coli host strains (Goff and Goldberg, 1985), manipulating
growth conditions, especially the time of induction for inducible promoters (Allet et al.,
Purification of
Recombinant
Proteins
6.1.3
Current Protocols in Protein Science
Supplement 30
1988), and using exogenous protease inhibitors can minimize this problem. Nevertheless,
more extreme steps may be required, such as inducing the expressed protein to form
insoluble inclusion bodies, using a secretion vector to locate the protein to the periplasm
or medium, and changing to a eukaryotic expression system. In addition, there is a protein
engineering approach that requires knowledge of the proteolytic cleavage site(s) to
stabilize the protein. It requires alteration of one or both of the residues forming the scissile
bond by site-directed mutagenesis (Mildner et al., 1994). For more detailed discussions
on strategies to minimize proteolytic degradation, see reviews by Murby et al. (1996) and
Makrides (1996).
Removing the Amino-Terminal Methionine
Another common problem with proteins expressed directly in E. coli is retention of the
N-terminal methionine derived from the initiating N-formylmethionine (the formyl group
is almost always removed). The N-terminal methionine is generally removed when the
second amino acid is alanine, glycine, proline, serine, threonine, or valine (cleavable
residues), but not when it is arginine, asparagine, aspartic acid, glutamic acid, glutamine,
isoleucine, leucine, lysine, or methionine (noncleavable residues; Sherman et al., 1985).
When recombinant proteins are expressed at very high levels, the N-terminal methionine
can be retained regardless of the nature of the second amino acid, presumably due to
saturation of the processing enzymes or depletion of required metal cofactors. Removal
of cleavable N-terminal methionine can be carried out in vitro by digestion with purified
methionine aminopeptidase (Miller et al., 1987) or by co-expression of the processing
enzyme (Ben-Bassat et al., 1987; Hwang, et al., 1999).
The need to remove noncleavable methionines can be circumvented by incorporating an
N-terminal secretion leader sequence that localizes the protein, minus the leader, to the
periplasmic space (Holland et al., 1990). Other approaches utilize the incorporation of
N-terminal fusions with, e.g., ubiquitin, which can be cleaved in vitro or in vivo with a
processing enzyme (ubiquitin hydrolase). This approach involves co-expression of the
hydrolase (Miller et al., 1989). Finally, it should be noted that it is sometimes possible to
resolve proteins containing N-terminal Met from those lacking it by chromatographic
methods (Wingfield et al., 1987b).
Dealing with Inclusion Bodies
The expression of eukaryotic proteins in E. coli often leads to the accumulation of
insoluble protein called inclusion bodies (UNITS 6.3 & 6.5). Inclusion bodies can be easily
observed by phase-contrast microscopy as dense bodies, usually located at the polar
extremities of the cell and they can be isolated by centrifugation (Georgiou and Valax,
1999).
Purification of
Recombinant
E. coli Proteins
The rate of protein biosynthesis in prokaryotes is about ten times faster than in eukaryotes.
Comparison of the rates of in vitro refolding of orthologous prokaryotic and eukaryotic
proteins indicates that the former refolds six times faster. This suggests that the rate of
folding correlates with the rate of elongation of polypeptide chains. Hence, part of the
problem in expressing eukaryotic proteins in bacteria might be due to combination of fast
synthesis and slow folding, which favors aggregation (Widmann and Christen, 2000).
Proteins in the unfolded state at high concentration, even small rapidly folding proteins,
are prone to aggregation due to exposure of hydrophobic surfaces that are normally buried
in the native state (see Fersht, 1999, for further discussion). Some proteins are helped to
fold in vivo by binding to accessory proteins called chaperones (see Protein-Assisted
Folding and Oxidation in the discussion of Performing Protein Folding).
6.1.4
Supplement 30
The formation of inclusion bodies can occasionally protect proteins against proteolysis
and can also allow accumulation of proteins normally toxic to the cell; some examples
include proteases (HIV-1 protease; Cheng et al., 1990) and membrane-spanning domains
(Jones et al., 2000). The formation of inclusion bodies also simplifies purification of the
protein, albeit in a denatured/aggregated state (see below). The main disadvantage is that
the protein must be extracted with protein denaturants and then folded into a native-like
conformation. For small (10- to 20-kDa) single-domain proteins, this is usually not
problematic, although the overall recoveries may only be 5% to 20% of those of similar
or identical proteins expressed in a soluble state. For large (40- to 70-kDa), multiple-domain proteins, recoveries may be negligible, although there have been a number of
successful cases, such as the 69-kDa tissue plasminogen activator (Grunfeld et al., 1992).
The formation of inclusion bodies can sometimes be prevented by changing the promoter,
host strain, and combinations thereof; controlling the growth conditions (especially the
pH of the culture); adding nonmetabolizable sugars such as sucrose and sorbitol to the
fermentation medium; and changing the temperature of induction, usually by lowering it
(for reviews, see Schein, 1989; Wetzel, 1992; Baneyx, 1999).
The recombinant protein may be located in both the insoluble and soluble fractions of the
cell (mixed-phase expression), and in these cases, better yields may be obtained by
processing the soluble material (discarding the insoluble) even though it might constitute
only a minor portion of the total expressed protein (Thatcher and Panayotatos, 1986;
Wingfield et al., 1987c). Soluble protein purified from mixed-phase expressions should
be carefully analyzed to check its authenticity (e.g., by mass spectrometry) as the
solubility may have resulted from minor modifications such as deamidation or proteolysis
of a few residues from either the N or C terminus (P.T. Wingfield unpub. observ.).
A successful approach for avoiding inclusion body formation is the use of an appropriate
secretion vector (Guisez et al., 1998; Cornelius, 2001) The N-terminal secretion signal
directs protein to the periplasmic space (see Localizing Protein), and translocation across
the plasma membrane results in cleavage of the secretion leader sequence. The periplasm
contains enzymes that accelerate folding and formation of disulfide bonds (for reviews,
see Missiakas and Raina 1997). Purification is also simplified as the protein content of
the periplasmic space constitutes only 4% of the total E. coli protein (Beacham, 1979).
Fusion Proteins
Apart from direct expression, there are many examples of fusion protein expression.
Fusion proteins consist of the protein of interest partnered or tagged with proteins or
protein domains appended to either the N- or C-terminus (or both) (UNIT 5.1; Uhlen et al.,
1992; also see Table 3 in Makrides, 1996). The appended moieties are commonly called
tags and are often linked to the host protein by a short linker sequence containing a
specific chemical (e.g., Met or Asp-Pro) or protease cleavage site (e.g., thrombin). One
of the main purposes of constructing fusion proteins is to facilitate the recovery and
purification of the recombinant protein. The most popular fusion partners are the polyhistidine tag (His-tag) and the glutathione-S-transferase (GST-tag), these are discussed
in more detail in UNITS 6.5 & 6.6. A tag may help maintain the solubility of a protein that is
normally expressed in an insoluble form (LaVallie et al., 1993; Zhang et al., 1998).
Alternatively, the tags may promote insolubility, especially useful for protecting short,
partially structured polypeptides, and for expressing proteins that are normally toxic to
E. coli (e.g., proteins with membrane-associating or -spanning regions). The Gateway
universal cloning system, previously mentioned, has been used to screen for improved
solubility by comparing the effects of six different N-terminal fusion proteins and the
Purification of
Recombinant
Proteins
6.1.5
Current Protocols in Protein Science
Supplement 30
Table 6.1.1
Protein
expressiona
Native sequence
Nativefusion
Nativesecretion
Locationb
Soluble
Advantages
Disadvantages
Yes
High-level expression
Direct purification with good
recovery
No
High-level expression
May protect against proteolysis
Toxicity effects of protein to cell
may be avoided
Easy partial purification (washed
pelletssee Fig 6.1.1)
Yes
High-level expression
Purification aided with
affinity-tagged protein
Solubility and stability of expressed
protein (or peptide) can be enhanced
by fusion partners
Authentic N terminus after
site-specific cleavage (not always
true)
To obtain native
sequence site-specific
cleavage of fusion
protein required
Overall yield of native
protein may be low
No
No
P/M
Yes
Ease of purification
Correct N terminus
Protein folded and oxidized
No
Correct N terminus
Expression level may be high
May be protected against proteolysis
Secretion leader
unprocessed,
purification usually not
attempted
aNative, native protein sequence, including any site-specific mutations or deletions; Nativefusion native sequence with N- or C-terminal
extension sequence (e.g., polyhistidine tag); Nativesecretion, native sequence plus N-terminal leader sequence coding for an E. coli
secretory signal (e.g., OmpA).
bC, cytoplasm; P, periplasmic space; M, medium.
His-tag (Hammarstrom et al., 2002). This type of study using conventional cloning and
expression would represent a major undertaking.
Purification of
Recombinant
E. coli Proteins
The expression of fusion proteins with affinity handles, such as those containing stretches
of polyhistidine (His-tagged; see also UNIT 6.5), has become extremely popular due to the
ease of protein purification under both nondenaturing and denaturing solvent conditions
(for more details, see discussion on Purifying Denatured Proteins). The soluble fusion
proteins often have native-like conformations and are biologically active. It cannot be
assumed, however, that a tag will have no effect on the proteins function or activity. From
6.1.6
Supplement 30
cells
break cells
mechanical: French press
enzymatic: lysozyme
extract
low-speed supernatant:
polymers,
soluble proteins
low-speed pellet:
inclusion bodies,
cell wall components
centrifuge 90 min
at >60,000 x g
extract
(Triton/EDTA/urea)
centrifuge 30 min
at 10,000 x g
repeat two times
high-speed pellet:
membrane vesicles,
ribosomal particles
high-speed supernatant:
cytoplasmic proteins,
periplasmic proteins
purify
low-speed
extract supernatant:
cell wall components
low-speed
washed pellet:
inclusion bodies
solubilize
denatured protein
native protein
fold
purify
Figure 6.1.1 Differential centrifugation of E. coli cell lysates. Cells are broken with a French press
or by lysozyme treatment. Insoluble (inclusion body) proteins, from either the cytoplasm or
periplasm, are located in the low-speed pellet, which is subjected to preextraction to remove outer
membrane and peptidoglycan material. Inclusion bodies are extracted from washed pellets with
strong protein denaturants such as guanidineHCl. The solubilized protein, which is denatured and
reduced (free sulfhydryl residues), is either directly folded and oxidized (disulfide bonds formed) or
purified before folding. Soluble proteins (from the periplasm and cytoplasm) are located in the
low-speed and high-speed supernatants. The latter can be used directly for chromatography,
whereas the former requires clarification by other techniques such as ammonium sulfate fractionation or membrane filtration.
Purification of
Recombinant
Proteins
6.1.7
Current Protocols in Protein Science
Supplement 30
fermentation broth
supernatant:
secreted protein
pellet: cells
S1
pellet:
washed cells
supernatant:
wash
supernatant:
wash
pellet:
plasmolyzed cells
P1
S2
supernatant:
periplasmic proteins
lyse cells
treat with detergent,
sonicate,
or subject to
hypotonic shock
suspend in
hypotonic medium
(10 mM TrisCI)
centrifuge 90 min
at >60,000 x g
supernatant:
periplasmic proteins
P2
pellet:
spheroplasts
cell lysate
pellet
P3
Figure 6.1.2 Localization of secreted and periplasmic proteins in E. coli. Periplasmic protein
produced via a secretion vector can leak into the medium and be recovered by centrifugation
(supernatant, S1) or filtration. Washing cells with an isotonic solution such as lightly buffered 0.15
M NaCl or 0.25 M sucrose can also release protein (S2). The compartmentalized periplasmic
proteins are released by isotonic shock treatment by directly suspending normal cell paste or
plasmolyzed cell paste into hypotonic medium. Plasmolyzed cell paste is derived by suspending
cells in hypertonic medium and then pelleting. (In hypertonic medium the cell contracts, separating
the inner membrane from the cell wall, and is said to be osmotically sensitized.) The hypertonic
wash often releases protein (P1). The supernatant from shocked cells (P2) will contain constitutive
E. coli proteins and the recombinant product. Osmotically sensitized cells can also be treated with
lysozyme to fragment the outer membrane, thus releasing periplasmic proteins (P3). The pellet from
the lysozyme treatment contains spheroplasts (cells with fragmented outer membranes), which are
easily disrupted by detergents, sonication, or hypotonic shock to release cytoplasmic proteins.
Purification of
Recombinant
E. coli Proteins
6.1.8
Supplement 30
cell lysate
(e.g., low-speed supernatant of Fig. 6.1.1)
clarify lysate
centrifuge (90 min at >60,000 x g)
filter
or salt fractionate and exchange buffer
clarified lysate
preference
exchange buffer
order of
perform
affinity methods
perform other
chromatography methods (3)
dye matrix
hydrophobic
hydroxylapatite
chromatofocusing
concentrate
perform gel filtration
sterile-filter
purified protein
Figure 6.1.3 Purification of soluble proteins from E. coli lysates. Abbreviations for ion-exchange
resins are as follows: CM, carboxymethyl; DEAE, diethylaminoethyl; Q, quaternary ammonium; S,
methyl sulfonate. The order of preference for the stages of ion-exchange (2) and other methods (3)
is based on the authors opinion and does not necessarily represent a consensus view. On the other
hand, the use of a DEAE-based matrix at an early stage (1) is common practice. Affinity methods
(see text and Chapter 9) can be performed at any stage following clarification of the lysate.
protein will be translocated to the periplasm or the medium, though often at low
concentrations, and the tagged protein can be readily purified from the culture medium
after osmotically shocking the cells.
Table 6.1.1 briefly summarizes some of the major advantages and disadvantages of the
various expression scenarios using E. coli. If attempts to express the protein in E. coli or
Purification of
Recombinant
Proteins
6.1.9
Current Protocols in Protein Science
Supplement 30
to fold inclusion body proteins fail, then a eukaryotic system must be considered. The
decision of which system to use is often dictated by the expertise available to the
laboratory. Alternatively, many companies offer custom expression services using various
protein expression systemsthis can be an expedient, although expensive, solution.
SOLUBILITY AND LOCATION OF THE PROTEIN
Determining Solubility
Figure 6.1.1 shows a simple centrifugation scheme that indicates how to determine the
solubility of a protein expressed in E. coli (see also UNIT 5.2). The recombinant protein in
the various fractions is assayed by SDS-PAGE (UNIT 10.1); if more sensitive methods are
required, immunoblotting or biological assays may be used.
Cell breakage carried out with a French press (UNIT 6.2) will disrupt both the outer and
inner membranes. The peptidoglycan layer, which lies underneath the outer membrane in
Gram-negative bacteria such as E. coli, will be fragmented into sheets. Low-speed
centrifugation (30 min at 10,000 g) separates unbroken cells, bacterial outer membrane,
and peptidoglycan components, and highly aggregated inclusion body proteins (pellet
fraction) from soluble bacterial proteins, soluble recombinant proteins, and polymeric
materials, including ribosomal protein complexes and inner membrane vesicles (supernatant fraction). High-speed centrifugation (90 min at 100,000 g) of the low-speed
supernatant will pellet polymers. Soluble proteins, derived mainly from the cytoplasm
and periplasmic space, can then be recovered from the clarified supernatant. Soluble
proteins in the low-speed or high-speed supernatant are purified directly using conventional methods (UNIT 6.2).
It should be noted that a working definition of solubility is the presence of protein in the
supernatant after centrifugation for 100 min at 100,000 g. This definition applies to
solvents of viscosity or density close to that of water.
Occasionally, recombinant protein will be found in both the pellet and supernatant
fractions after low-speed centrifugation due to the accumulation of both soluble and
inclusion body proteins. If partitioning is observed only following high-speed centrifugation, then specific self-association or nonspecific association involving E. coli proteins
and nucleic acid may be suspected. Recombinant proteins that normally bind RNA or
DNA often bind nonspecifically to bacterial nucleic acid (Sherman and Fyfe, 1990;
Wingfield et al., 1990). Lindwall et al. (2000) have developed a sparse screen approach
to optimizing the buffer composition for extracting and solubilizing folded (non-aggregated) proteins.
Inclusion body proteins, which are located in the low-speed pellet fraction, can be partially
purified by extracting with a mixture of detergent [usually 1% to 5% (v/v) Triton X-100]
and denaturant, either urea or guanidineHCl. The concentration of denaturant used for
pellet washing is determined empirically and should be below the concentration required
for solubilization of the recombinant protein; the usual ranges are 1 to 4 M urea and 0.5
to 1.5 M guanidineHCl. The cloudy extract will consist of complex carbohydrate from
the fragmented peptidoglycan layer, lipopolysaccharide, and outer membrane proteins.
The inclusion body proteins in the washed pellets are then extracted with solvents that
disrupt protein-protein interactions (e.g., 6 to 8 M urea or guanidineHCl) and processed
as described below.
Purification of
Recombinant
E. coli Proteins
6.1.10
Supplement 30
Localizing Protein
When proteins incorporating a secretion vector are expressed in E. coli, advantage can be
taken of the fact that the recombinant proteins will be located in the periplasmic space
and/or the culture medium. Secretion into the medium is due to leakage from the
periplasm and appears to depend on the level of accumulation and the fermentation
conditions. Figure 6.1.2 summarizes approaches used to recover proteins selectively from
the periplasmic space or the medium (see also UNIT 5.2). High-level secretion into the
periplasm sometimes results in the formation of aggregates, analogous to cytoplasmic
inclusion bodies (Bowden et al., 1991). Periplasmic inclusion bodies can be extracted
from the low-speed pellet fraction following normal cell breakage (see Fig. 6.1.1).
Proteins in the medium can be recovered by subjecting the culture medium to centrifugation or filtration, steps that remove intact cells and large debris. The clarified protein
is usually dilute and is often concentrated prior to purification by affinity or conventional
chromatography. Periplasmic proteins can be selectively released by osmotic shock
(preferred method) or by selective disruption of the outer membrane and peptidoglycan
layer using lysozyme.
Apart from its use in dissecting the bacterial compartments, lysozyme is often employed
to prepare complete cell lysates, especially in laboratories that do not have access to a
French press. Cells treated with lysozyme can be disrupted with detergents or by brief
sonication (UNIT 6.5).
Useful microscale (<1 ml) E. coli cell fractionation schemes have been based on osmotic
shock treatment (Yarranton and Mountain, 1992) or repeated freezing and thawing of cells
(Johnson and Hecht, 1994). UNIT 5.2 describes small-scale (1- to 25-ml) procedures for
preparing samples of periplasmic extracts and extracellular media for analysis by SDSPAGE (UNIT 10.1).
STRATEGIES FOR ISOLATION OF SOLUBLE PROTEINS
There are no set formulas for isolating soluble recombinant or nonrecombinant proteins;
there are, however, some basic strategies and precautions that can be followed. A flow
chart summarizing some of the methods commonly used for E. coli is shown in Figure
6.1.3 (see also Chapter 1). In Figure 6.1.3, the step, Perform Affinity Methods, refers not
only to conventional affinity purification methods (see Chapter 9), but also to affinity
methods based on the use of fusion proteins. A specific protocol detailing the purification
of the soluble protein interleukin-1 (IL-1) is presented in UNIT 6.2 and two more recent
examples are discussed below. Comments on the various stages are given in order of their
application.
Determining the Isoelectric Point
The section on Analyzing the Amino Acid Composition mentions how the isoelectric point
of a protein can be estimated from the pKa values for ionizable side-chain groups. The pI
can also be determined experimentally by subjecting the soluble protein extract to 1-D
isoelectric focusing (UNIT 10.2) or 2-D titration curve analysis (Watanabe et al., 1994; UNIT
7.3). If the recombinant protein is not a major component in the cell extract, specific
detection on the 2-D gel by immunoblotting will be required. The calculated pI can be
used to optimize the buffer pH in subsequent ion-exchange steps.
Purification of
Recombinant
Proteins
6.1.11
Current Protocols in Protein Science
Supplement 30
Breaking Cells
Cells are efficiently broken by high-pressure homogenization using a continuous-fill
French press, which is suitable for processing volumes of 40 to 250 ml (reviewed by
Hopkins, 1991; see UNIT 6.2). (Yeast cells can also be conveniently broken with the French
press, although two passes are required). For volumes exceeding 500 ml, the MantonGaulin-APV homogenizer (APV Gaulin) should be used. Sonication is also useful for
breaking cells but is best suited for volumes <100 ml. Alternatively, the outer cell wall
can be enzymatically digested with lysozyme (200 g/ml) and the cells broken by
detergents, sonication, or both (Kaback, 1971; Burgess and Jendrisak, 1975; UNIT 6.5).
Proteins that are secreted into the periplasmic space can be selectively released by
hypotonic (osmotic) shock (Heppel, 1967).
The viscosity of the cell lysate may be high due to released nucleic acid. Before
centrifugation, the viscosity must be reduced either by sonicating or by adding DNase
(25 to 50 g/ml plus 5 to 10 mM Mg2+) and RNase (50 g/ml; no Mg2+ requirement). A
standard protease inhibitor mixture should be included in the buffercontaining, for
example, 2 to 5 mM EDTA, 0.5 to 1.0 mM phenylmethylsulfonyl fluoride (PMSF) or 5
mM benzamidine, and 1 M pepstatin A. The serine protease inhibitor 4-(2-aminoethyl)benzenesulfonyl fluoride hydrochloride (AEBSF) is a water-soluble substitute for PMSF
with a much longer half-life in aqueous solution and is used at 50 M. (Roche Applied
Science: http://www.roche-applied-science.com. Go to the Biochemistry section to download the booklet: The complete guide for protease inhibition that lists properties for most
commercially available reagents). The addition of 2-macroglobulin (1 g/mg recombinant protein) before the final purification step(s) can protect protease-sensitive proteins
(Ultsch et al., 1991; see also section on MAP30 purification below). The crude extracts
should be kept cold and the recombinant protein taken rapidly to a stage of the purification
process where it is stable against contaminating proteases (e.g., as an ammonium sulfate
precipitate).
Clarifying Cell Extract by Centrifugation or Selective Precipitation
The lysate is subjected first to low-speed centrifugation to remove unbroken cells and
large cellular debris, then to high-speed centrifugation to remove ribosomal material and
other particulates (see Fig. 6.1.1). If an ultracentrifuge is not available, the extract can be
clarified by the following techniques: ammonium sulfate or polyethylene glycol fractionation (reviewed by Scopes, 1994), phase partitioning (reviewed by Walter and Johansson,
1986), and membrane filtration (van Reis and Zydney, 2001; useful guides on filtration
technology are available from Millipore at http://www.millipore.com). A fairly recent
technology is expanded bed adsorption where crude extracts can be directly applied to
adsorbents, for example ion exchangers, without initial clarification (see Amersham
Biosciences for literature at http://www.apbiotech.com).
Proteins that bind tightly to nucleic acid can be selectively precipitated with polyethyleneimine and resolubilized by salt extraction (Burgess and Jendrisak, 1975). In practice,
particular properties of the protein can be exploited at this stage; for example, the protein
of interest may be soluble under conditions where most E. coli proteins are insoluble,
such as acidic pH or high temperature.
Applying Clarified Extract to a Weak Anion Exchanger
Purification of
Recombinant
E. coli Proteins
Fractionating the extract with an anion-exchange resin is a useful first step as it removes
host E. coli proteins, many of which have pI values in the range 5.0 to 7.0 and will thus
bind to a column equilibrated in 50 to 100 mM TrisCl, pH 7.5 to 8.0. The positively
charged matrix will also tightly bind nonproteinaceous materials such as nucleic acids
6.1.12
Supplement 30
denatured protein:
monomeric and reduced protein
purify protein
gel filtration using urea or GuHCI as solvent
HPLC using TFA/acetonitrile as solvent
denatured protein:
purified protein
fold protein
purify protein
product
Figure 6.1.4 Folding and purification of inclusion body proteins from E. coli. The protein is
extracted with protein denaturants such as guanidineHCl (GuHCl), urea, or an organic acid. The
reductant dithiothreitol (DTT) is included to prevent artificial disulfide bond formation (especially
intermolecular bonds). The denatured protein can be purified by various methods and then folded,
or it can be directly folded. Typically, some purification (e.g., gel filtration in GuHCl) prior to folding
is recommended as it often results in higher folding yields. Protein folding and oxidation are carried
out concurrently. Disulfide bond formation is catalyzed by low-molecular-weight thiol/disulfide pairs
such as reduced (GSH) and oxidized (GSSG) glutathione. GSH/GSSG ratios of 5:1 to 10:1 are
normally used, which are similar to those found in vivo in the endoplasmic reticulum (Hwang et al.,
1992). A cosolvent is included to maintain solubility during folding. Folded protein is purified if
necessary (purification is usually needed if the protein is directly folded). Gel filtration is a useful
final step for removing aggregated and or misfolded protein.
and other polyanionic species (e.g., lipopolysaccharide derived from the bacterial outer
membrane). A useful cleanup of the protein will take place whether or not the protein of
interest binds to the column (see UNITS 6.2 & 6.5). The following column sizes are recommended for processing extracts: for 5 g cells, 2.5-cm diameter packed to a height of 10
to 15 cm; for 50 g cells, 5.0-cm diameter packed to a height of 20 cm.
Purification of
Recombinant
Proteins
6.1.13
Current Protocols in Protein Science
Supplement 30
cells
break untreated
cells with
French press
low-speed centrifugation
B
s
lp
ib
c
lp
ib
c
washing steps
ib
c
Figure 6.1.5 Preparation of washed pellets using lysozyme and the French press. Cells are
broken with the French press with or without prior treatment with lysozyme. After low-speed
centrifugation using a fixed-angle rotor, the contents of the centrifuge tubes have the characteristics
shown. The contents of tubes A and B are labeled: s, supernatant; lp, loose pellet; ib, inclusion body
protein; and c, unbroken cells and large cellular debris. The loose pellet material is derived from the
outer cell wall and outer membrane (see text for further details). After washing the insoluble material
(UNIT 6.3), the pellet should consist mainly of the inclusion body layer (tube C), and the supernatant
should be fairly clear.
Purification of
Recombinant
E. coli Proteins
Before repeating ion exchange, the solvent pH and ionic strength usually need adjustment.
This can be carried out by dialysis (UNIT 6.2) or by gel filtration on a desalting column
using, for example, Sephadex G-25 or G-50 (UNIT 8.3). In preparation for cation-exchange
chromatography, dialysis against slightly acidic buffers (pH 5.0 to 6.0) will result in the
helpful precipitation of some E. coli proteins. It may also be advisable to include a
relatively low concentration of urea (0.5 to 2 M) or a nonionic or zwitterionic detergent
in the dialysis buffer to minimize coprecipitation with contaminants (for an extensive
listing of detergents and properties, see http://psyche.uthct.edu/shaun/SBlack/detergnt.html). Basic proteins (pI >9.0), which do not bind to the DEAE column, can be
6.1.14
Supplement 30
applied after dilution to a cation exchanger equilibrated at pH 7.0 to 7.5 without careful
buffer exchange (Allet et al., 1988).
Repeating Ion-Exchange Chromatography
For a second round of ion-exchange chromatography, one of the ion-exchange resins
indicated in Figure 6.1.3 should be used. Selection kits are available for rapidly screening
and selecting the most suitable ion-exchanger (Amersham Biosciences). For cation-exchange chromatography, phosphate buffer (10 to 50 mM) between pH 5.0 and 7.5 should
be tried first. Cellulose Phosphate (a bifunctional cation exchanger manufactured by
Whatman: http://www.whatman.com) is effective for nucleic acidbinding proteins
(Kelley and Stump, 1979). Protein is usually eluted from cellulose phosphate columns
using phosphate gradients.
After two stages of ion exchange, many proteins will be pure enough for the final
gel-filtration step (see Performing Gel Filtration). However, if the sample contains
contaminants close in size to the protein of interest, then further purification is required.
Some of the frequently used methods are listed in Figure 6.1.3. Hydrophobic-interaction
chromatography (UNIT 8.4) is especially useful following ammonium sulfate fractionation
or salt elution from an ion-exchange resin. Screening kits are also available for rapidly
checking protein binding on several different agarose-dye matrices (Sigma at
http://www.sigmaaldrich.com).
Performing Gel Filtration
The final purification step of gel filtration (using a column 1.5 to 5.0 cm in diameter and
60 to 100 cm in length) will provide good separation of the recombinant protein from
higher- and lower-molecular-weight E. coli protein contaminants. Gel filtration will also
separate aggregated or highly associated recombinant protein from the physically stable
form of the protein (e.g., monomer or dimer). Finally, gel filtration chromatography
allows for easy exchange of the buffer. The protein solution is usually concentrated before
being applied to the column. After chromatography, the protein will be diluted three- to
five-fold (or more) and may therefore require repeat concentration.
Other Methods
In addition to the generalized approach described, affinity methods can be applied at any
stage following clarification of the extract. Biospecific affinity can be exploited with
immobilized natural ligands such as antibodies, substrates, and receptor ligands. Affinity
chromatography, which selects for particular classes of proteins, is carried out with
immobilized lectins (for glycoproteins), dyes (for nucleotide-binding proteins), and
nucleic acids or heparin (for RNA- and DNA-binding proteins). Commercially available
antibodies against post-translationally modified residues (e.g., phosphotyrosine) are also
useful. The application of affinity tags or fusions has been previously described. Affinity
methods are most useful when high degrees of purification are requirede.g., for proteins
secreted into the medium, for small-scale isolations, or for rapid purification requirements.
The most commonly used affinity method is immunoaffinity chromatography. The ideal
reagent is a monoclonal antibody that has been specifically selected to have a moderateto-low affinity for the ligand in question, thus allowing elution under nondenaturing
conditions. Antibodies raised against peptides often have lower affinities for the native
protein than antibodies raised against the intact protein. Elution from peptide-antibody
immunoaffinity columns can be achieved using the competing immunizing peptide
(reviewed by Sutcliffe et al., 1983). Directed immobilization of the antibody, where only
Purification of
Recombinant
Proteins
6.1.15
Current Protocols in Protein Science
Supplement 30
the Fc domain is bound to the column matrix and the antigen binding site (Fab domain)
is thus oriented away from the matrix, results in higher binding efficiencies. An oriented
antibody matrix can be made by binding antibody to immobilized protein A-Sepharose
(or protein G-Sepharose) and fixing it in position with a covalent cross-linking reagent
(Schneider et al., 1982; commercial kits can be obtained from Pierce at http://www.piercenet.com/).
Compilations of standard chromatographic fractionation media are available (Table 8.2.2;
Patel, 1993). However, the best source of information is often the literature from the
various manufacturers.
STRATEGIES FOR ISOLATION OF INSOLUBLE PROTEINS
Recombinant proteins expressed in E. coli that are located in the low-speed pellet fraction
(see Fig. 6.1.1) following cell lysis are highly aggregated (i.e., inclusion bodies). Inclusion
bodies are normally derived from protein aggregation in the cytoplasm, or in the periplasm
if a secretion vector was used. As mentioned above, protein can also be located in either
the low- or high-speed pellet fractions because of interaction with bacterial nucleic acids.
Furthermore, if the protein is known to undergo polymerization in vitro (e.g., viral
nucleocapsid subunits), expression in E. coli can also be expected to lead to polymerization in vivo to varying degrees, and such proteins will be partitioned in both the
supernatant and pellet fractions (Wingfield et al., 1995). There are also examples of
membrane proteins that, when expressed in E. coli, associate with the inner cytoplasmic
membrane and can be extracted with nondenaturing detergents (Bibi and Beja, 1994, and
references cited therein).
When apparent insolubility is due to interactions involving folded protein as described
above, extraction under nondenaturing conditions should be attempted, for example, using
various pH buffers containing salt (e.g., 0.25 to 1.0 M NaCl) and nondenaturing detergents
(e.g., 10 mM CHAPS or 2% Triton X-100). Insolubility due to classic inclusion body
formation requires extraction with denaturing solvents, and the remainder of this section
deals with this subject. The flow chart in Figure 6.1.4 illustrates some of the approaches
possible for processing protein extracted from inclusion bodies.
Breaking Cells
Cells can be broken by mechanical means (UNIT 6.2), enzymatically with lysozyme (UNIT
6.5), or by a combination of methods (UNIT 6.5). It is advantageous to break cells as
completely as possible, as any unbroken cells will be located in the low-speed pellet
fraction from which the recombinant, insoluble protein will be extracted.
Preparing Washed Pellets
The object of the initial low-speed centrifugation and pellet washing is to extract as
many E. coli contaminants as possible without solubilizing the recombinant protein. This
is usually carried out as described in the section on Determining Solubility (see also Fig.
6.1.1).
Purification of
Recombinant
E. coli Proteins
When a fixed-angle rotor is used, pellets from the low-speed centrifugation consist of at
least two light-colored layers and a darker, hard-packed pellet at the bottom of the tube
(Fig. 6.1.5). The hard-packed material is probably a small amount of unbroken cells. The
next layer is inclusion body protein, and the top layer (least dense and lightest in color)
is outer membrane and peptidoglycan fragments. Analysis of the top layer by SDS-PAGE
(after heating proteins in SDS sample buffer at >80C) will reveal two strong bands at
35 and 38 kDa representing OmpA and the matrix proteins OmpC and OmpF, respec-
6.1.16
Supplement 30
tively, from the outer membrane (DiRienzo et al., 1978; see also Fig. 6.3.1). The outer
membrane/peptidoglycan layer can be partially removed by resuspending and centrifuging at reduced speed (or time) or by diluting the suspension. Alternatively, the cells can
be pretreated with lysozyme prior to the French-press cell breakage as described in UNIT
6.5. The lysozyme treatment reduces the size of the loosely pelleted outer membrane/peptidoglycan material so it locates predominately in the low-speed supernatant (Fig. 6.1.5).
The recombinant protein in a well-prepared washed pellet will typically be >60% pure
when analyzed by SDS-PAGE (UNIT 6.3).
Extracting Protein
The washed pellets are extracted with high concentrations of protein denaturants such as
6 to 8 M guanidineHCl or urea. It should be noted that some proteins are resistant to
denaturation with high concentrations of these reagents, especially urea. Some washed
pellets extracted with 8 M guanidineHCl can be viscous and unsuitable for direct
chromatography. In these cases, pre-extraction of the washed pellets with a limiting
concentration (0.5 to 2.0 M) of guanidineHCl can often overcome this problem.
Solubilization with the anionic detergent N-lauroylsarcosine (Nguyen et al., 1993; Burgess and Knuth, 1996) and with 10% to 20% acetic acid has also been useful (UNIT 6.5);
other denaturants for extracting inclusion bodies are described elsewhere (UNIT 6.3;
Marston and Hartley, 1990). For background information on the mode of action of protein
denaturants, readers should consult the reviews of Tanford (1968) and Creighton (1993).
If the protein contains cysteine residues, it is essential to include a reducing agent,
preferably 5 to 10 mM dithiothreitol (DTT). Even in the presence of strong protein
denaturants, it may be necessary to sonicate or heat samples briefly to completely disperse
and solubilize the protein.
The extraction process should completely disaggregate and denature the protein into
unfolded monomers. Urea is not recommended for the initial extraction. For example,
even if it is known that a native version of protein can be unfolded with 4 M urea, the
same protein in an E. coli inclusion body will almost certainly not be completely extracted
as unfolded monomers with that same concentration of urea (or in most cases, even with
8 M urea). Initial extraction trials should be carried out with guanidineHCl, which is
more effective than urea. Most proteins will be extracted with 6 to 8 M guanidineHCl.
There should be adequate reductant present to maintain sulfhydryl groups in the reduced
state, and thus prevent artificial disulfide bond formation. The presence of EDTA and a
slightly acidic pH of 6.0 to 6.5 will help minimize cysteine oxidation. The extract may
require clarification by filtration or centrifugation.
Choosing Purification or Folding
The extracted protein can be further purified, or it can be directly folded and then purified.
Protein folding appears to be unaffected by the protein background in bacterial extracts
(London et al., 1974), however, removal of nonproteinaceous material prior to folding has
been reported to be beneficial (Darby and Creighton, 1990). Based on recent work, it is
worth noting that high concentrations of background bacterial protein may promote
aggregation of the unfolded recombinant protein by macromolecular crowding effects
(Ellis, 2001). If purification of protein in the denatured state is possible, use the purified
material to develop a folding protocol. Then use this protocol with clarified protein
extracts, or better still with protein partially purified by DEAE-Sepharose, to observe if
the presence of contaminants has any effect on the yield of folded protein.
Finally, there may be specific reasons for purifying proteins in the denatured state. For
example, some proteolytic enzymes, such as HIV-1 protease, self-digest (undergo auto-
Purification of
Recombinant
Proteins
6.1.17
Current Protocols in Protein Science
Supplement 30
proteolysis) in the uninhibited state (Mildner et al., 1994, and references cited therein)
but can be purified intact in the denatured (inactive) state, then refolded when required.
Other proteins once folded may have low solubilities and be especially susceptible to
aggregation, resulting in poor behavior on column matrices (see VP26 purification below).
However, in general, unfolded proteins are more susceptible to chemical and proteolytic
modifications.
Purifying Denatured Proteins
If the protein is extracted with guanidineHCl, gel filtration is a useful first purification
method; often protein >80% pure can be obtained (UNIT 6.3; Wingfield et al., 1997). The
proteins exist as random coils in the denaturant and their elution from the column should
be in order of their molecular weight and not be influenced by shape. If the protein is
located in several peaks there may have been incomplete solubilization during the
extraction. In this case, 8 M guanidineHCl should be used for the extraction and the
protein dispersed by sonication or by heating if necessary. Another possibility is intermolecular disulfide bond formation, in which case the DTT concentration in the sample and
column buffers should be increased. It is worth noting that the column can often be
equilibrated and eluted with lower guanidineHCl concentrations (e.g., 4 M) than those
used for the actual extraction process. Only monomeric protein should be selected for
further processing. The protein at this stage can be stored frozen, ideally at 80C.
The partially purified protein in guanidineHCl can be directly folded (see Performing
Protein Folding), or the denaturant can be exchanged by dialysis or gel filtration for 1%
to 5% (v/v) acetic or formic acids (acetonitrile at 5% to 10% v/v can also be included)
and then lyophilized. Alternatively, the protein can be acidified with trifluoroacetic acid
(TFA; 0.1% v/v) and further purified by reversed-phase chromatography (Wingfield,
1997; Wingfield et al., 1999). Useful high-flow matrices (Source 15RPC from Amersham
Biosciences) can be purchased as bulk media. These matrices may not have the resolution
of traditional prepackaged silica-based reversed-phase columns, but they have high
capacity, can be eluted at higher flow rates, and are stable over a wide range of pH. Proteins
eluted with acetonitrile/TFA are also suitable for lyophilization.
Proteins tagged with histidine residues can be purified in guanidineHCl-, urea-, or even
SDS-containing buffers, using metal chelate chromatography (UNIT 6.5). There are many
reports of on-column protein folding by binding the unfolded protein in guanidineHCl
or urea and then accomplishing folding using a reverse urea gradient (e.g., Gulnik et al.,
2001).
Proteins in urea and non-ionic or zwitterionic detergents (e.g., CHAPS) can be purified
by ion-exchange chromatography (e.g., Wingfield et al., 1990). For ion-exchange chromatography, better results have been reported using protein that has been first extracted
with guanidineHCl, and then exchanged into urea (Shire et al., 1984).
If urea is used either for extraction or for maintaining solubility during refolding, a cyanate
scavenger such as a glycine- or Tris-based buffer should be included to prevent carbamylation of the protein (Stark et al., 1960). For critical work, urea can be deionized with a
mixed bed ion-exchange resin (see discussion of Protein Folding Reagents in APPENDIX 3A).
Performing Protein Folding
Purification of
Recombinant
E. coli Proteins
Protocols for folding proteins basically involve controlled removal of the denaturant under
conditions that minimize aggregation and allow correct formation of disulfide bonds. For
overviews of the practical aspects of protein folding, see UNIT 6.4; Wetzel (1992); Thatcher
6.1.18
Supplement 30
et al. (1996); Rudolph et al. (1997); Lilie et al. (1998); De Bernardez Clark et al. (1999);
and De Bernardez Clark (2001).
To minimize nonproductive aggregation, folding is normally carried out at low protein
concentrations (e.g., 0.01 to 0.10 mg/ml); for small, single-domain proteins, higher
concentrations (e.g., 0.1 to 1.0 mg/ml) can often be tolerated. Dilution and dialysis are
the most common methods for removing the denaturant. Solubility during folding can be
maintained with co-solvents such as nondenaturing concentrations of urea (1 to 4 M;
London et al., 1974; UNIT 6.5) or guanidineHCl (0.1 to 1.5 M; Orsini and Goldberg, 1978),
arginine (0.4 to 0.8 M; De Bernardez Clark et al., 1999), nonionic detergents and lipids
(Zardeneta and Horowitz, 1994), cationic detergents (Puri et al., 1992), and polyethylene
glycol (PEG; Cleland et al., 1992). These various additives function by minimizing
intermolecular associations between sticky hydrophobic surfaces present in folding
intermediates. For further discussion of aggregation versus folding, see Goldberg et al.
(1991) and Kiefhaber et al. (1991). Additives such as ammonium sulfate, glycerol,
sucrose, enzyme substrates or inhibitors, and ligands have also been used to improve
protein folding (see Table 1 in De Bernardez Clark et al., 1999, for a useful list of additives
used in folding).
Protein expressed in the cytoplasm of E. coli is in the reduced state; this is true for both
soluble and insoluble proteins. Once insoluble protein is solubilized, it needs to be
maintained in a reduced state by the presence of reductant until protein folding is initiated.
The oxidative formation of disulfide bonds (one of the rate-limiting steps in protein
folding) can be catalyzed by low-molecular-weight thiol and disulfide pairs such as
reduced and oxidized glutathione (GSH/GSSG). Redox buffers facilitate oxidation
through thiol/disulfide exchange reactions (reviewed by Wetlaufer, 1984; Creighton,
1984; Gilbert, 1995). Normally GSH/GSSG ratios of 5 to 10 are used with a total
glutathione concentration of 1 to 5 mM (Wetlaufer, 1984). To reduce the rate of GSH loss
due to air oxidation, 1 mM EDTA should be included in the buffer (Wetlaufer et al., 1987).
The optimal concentrations and ratios of reagents must be established in an empirical
manner. Folding and oxidation are normally carried out concurrently (for further details,
see Rudolph et al., 1997). Analogous to the approach commonly used to optimize
conditions for protein crystallization, various screens have been developed to establish
initial conditions for protein renaturation and oxidation (Hofmann et al., 1995; Armstrong
et al., 1999) and kits are commercially available (FoldIt Screen from Hampton Research
at http://www.hamptonresearch.com).
For examples of preparative protein folding, see UNIT 6.5. In addition, some recent examples
from the authors laboratory are given below. The refolding of Fab fragments expressed
in E. coli (Buchner and Rudolph, 1992) is illustrative of the systematic and empirical
approach used to optimize folding conditions. Other examples of interest are described
by Kohno et al. (1990) and Grunfeld et al. (1992).
Protein-assisted folding and oxidation
Protein folding in vivo is assisted in both eukaryotes and prokaryotes by two classes of
accessory proteins: folding catalysts (for a review, see Schiene and Fischer, 2000) and
molecular chaperones (Eisenberg, 1999; Feldman and Frydman, 2000). Folding catalysts
accelerate rate-limiting steps in protein folding such as disulfide bond formation (protein
disulfide isomerases) and the rotation of X-Pro bonds (peptidyl prolyl cis-trans isomerase) during protein folding. Chaperones bind denatured or unfolded proteins thus
preventing misfolding and aggregation. The cytoplasm of E. coli is maintained in the
reduced state by thioredoxin and the glutathione/glutaredoxin pathways. In hosts where
the reduction of thioredoxin and glutathione is impaired by mutations to the thioredoxin
Purification of
Recombinant
Proteins
6.1.19
Current Protocols in Protein Science
Supplement 30
reductases and glutathione reductase genes, the resultant oxidizing conditions allow the
formation of disulfide bonds in expressed proteins located in bacterial cytoplasm (Bessesste et al., 1999; cells and expression kits are commercially available from Novagen). The
periplasm of E. coli also contains protein disulfide isomerases, the Dsb enzymes, which
have thioredoxin-like folds and act as strong thiol:disulfide oxidants (Missiakkas and
Raina, 1997; Braun et al., 1999). Secretion of proteins into the periplasmic space has been
the traditional approach for producing oxidized proteins in vivo, but with the aforementioned advances in cytoplasmic oxidations, this approach is probably best suited for
proteins that are toxic to the cell when expressed in the cytoplasm (Cornelis, 2000).
As mentioned, molecular chaperones prevent aggregation by interacting transiently with
hydrophobic patches on unfolded proteins and suppressing aggregation and promoting
folding (UNIT 6.4; reviewed by Jaenicke, 1993; Ellis and Hart, 1999; Feldman and Frydman,
2000). There are now many examples of chaperone-assisted protein expression in which
the endogenous levels of the bacterial chaperones GroES and GroEL (1%) are increased
up to ten-fold by co-expression with a target protein (Cole, 1996; Goenka and Rao, 2001).
Often, increases in soluble protein expression are observed, but this is not always the case.
Chaperones have also been used in vitro as protein folding reagents and some examples
of folding in the presence of protein disulfide isomerase, peptidyl prolyl cis-trans
isomerase and GroES/GroEL are given in Rudolph et al. (1997). Protocols for the
high-level expression and rapid purification of E. coli GroEL and GroES are described
by Kamireddi et al. (1997).
Purifying Folded Protein
Once the protein has been folded, any of the purification methods discussed in Chapters
8 and 9 can be used. The number of purification steps required should be fewer than those
for a protein expressed in a soluble state because of the purification factor obtained by
preparation of washed inclusion bodies (UNIT 6.3). One of the purification methods that
should be included is gel filtration, which may be the only one required. A correctly
selected matrix should remove any remaining E. coli proteins and separate aggregated
and misfolded protein from the native folded protein. Misfolded protein may be expected
to have a larger molecular radius (higher apparent mass) than the corresponding native
protein.
Monitoring Protein Folding
The restoration of function (e.g., enzymatic or biological activity) is perhaps the best
criterion for detecting successful folding. However, it is not always practical to use activity
measurements to monitor folding. It is also worth mentioning that an unfolded protein
may become activated following the dilution required for many activity measurements.
Conversely, native proteins can be denatured or inactivated during prolonged incubation
at 37C or by adsorption to microtiter plates. The use of antibodies to monitor protein
folding is briefly reviewed by Goldberg (1991), and reviews of common spectroscopic
methods, such as circular dichroism and fluorescence, are provided in Chapter 7 and by
Schmid (1997).
BACTERIAL EXPRESSION OF PROTEINS NORMALLY GLYCOSYLATED
Purification of
Recombinant
E. coli Proteins
6.1.20
Supplement 30
intermediates, and thus enhance folding, while not necessarily affecting the stability of
the native state (Kern et al., 1993, and references cited therein). In eukaryotic cells,
interference with protein glycosylation can lead to the formation of misfolded, aggregated, and degraded protein. This indicates that in vivo glycosylation (N-linked) may also
prevent the aggregation of folding intermediates (reviewed by Helenius, 1994). Detailed
NMR studies on glycoproteins have clearly shown that carbohydrates stabilize folded
proteins and even prevent marginally stable proteins from unfolding (for a review, see
Wyss and Wagner, 1996).
Despite potential pitfalls, many nonglycosylated protein variants have been successfully
folded from E. coli inclusion bodies. Examples include cytokines of biomedical importance such as granulocyte/macrophage colony-stimulating factor (GM-CSF; Diederichs
et al., 1991) and interleukin 5 (IL-5; Milburn et al., 1993). Inclusion body formation was
avoided in some studies by using secretion vectors; examples include GM-CSF (Walter
et al., 1992) and the extracellular domain of the human growth hormone receptor (deVos
et al., 1992). The aforementioned proteins have been crystallized and their structures
determined by X-ray crystallography, supporting the view that the structural integrity and
conformation of the proteins were not affected by the lack of glycosylation and their
respective preparative histories.
If a glycoprotein of interest is available from a eukaryotic recombinant expression system
or if the natural protein is available, then before investing time with E. coli expression, it
may be worthwhile to determine whether the protein can be denatured and refolded in
vitro. Pilot experiments can be carried out on intact protein and on protein enzymatically
deglycosylated with glycosidases and, if disulfides are present, with and without reduction. Of course, if the protein can be secreted to the periplasm, aggregation and the
necessity for in vitro folding may be avoided.
The production of deglycosylated proteins in E. coli expression systems for in vitro
biochemical and structural studies is obviously of great value; however, the proteins may
not always be suitable for in vivo studies due to low biological activity. Compared to
authentic proteins, nonglycosylated variants can have a reduced circulatory lifetime and
can exhibit increased immunogenicity and protease sensitivity (Rasmussen, 1992).
SOME EXAMPLES OF PROTEIN EXPRESSION AND PURIFICATION
Examples of protein expression and purification can be found in most biochemical
journals, two which may be especially useful: Protein Expression and Purification
(http://www.academicpress.com/pep), which covers advances in the expression and purification of recombinant proteins mainly from E. coli although other expression systems
are often included; and Current Opinion in Biotechnology, which regularly provides
updates on various aspects of recombinant protein production as well as useful reference
lists. Detailed protocols are also given in the units of this Chapter and a few recent
examples of protein expression and purification are discussed below to illustrate some of
the general approaches used to deal with soluble and insoluble E. coli protein expression.
Soluble Proteins
HIV Nef
Nef is a 205-residue myristolylated protein expressed at high levels in the early stages of
HIV infection. The protein is important for the induction of AIDS and is being actively
researched as a potential drug target. Unlike most HIV-1 and related proteins expressed
in bacteria, Nef is recovered from the soluble fraction of E. coli extracts. The purification
Purification of
Recombinant
Proteins
6.1.21
Current Protocols in Protein Science
Supplement 30
protocol adopted following cell breakage and low-speed centrifugation is fairly straightforward comprising two stages of ion-exchange chromatography using DEAE-Sepharose
(weak exchanger) followed by Q Sepharose (strong exchanger) and finally gel filtration
using Superdex 75. Characterization of the purified protein yielded the following information.
1. Nef has a maximum solubility of 0.5 to 0.6 mM (10 mg/ml) in low-ionic strength
buffers at pH 7.5 to 8.0, (e.g., 5 mM TrisCl). The solubility can be increased by the
inclusion of nondenaturing concentrations (2 M) of urea, as established by titration studies
monitored by far-UV circular dichroism. Acetonitrile (5% to 10%) also increases the
solubility of protein.
2. The protein contains three cysteines (positions 54, 141, and 205), none of which are
involved in native disulfide bond formation. The cysteines at positions 54 and 205 are
solvent-exposed.
3. Digestion of the purified protein with proteases indicated rapid digestion of the
N-terminal region (residues 1-38). For example, digestion was complete with a few
minutes using relatively low concentration of trypsin (1% w/w).
The above information was exploited to increase the robustness of the purification
protocol. Low solubility was a major issue during purification and this was improved by
including 4 M urea in the extraction buffer and 2 M urea in the two anion exchange column
buffers. For the final gel filtration step, 10% acetonitrile was included to help maintain
both the solubility of Nef and fortuitously cause aggregation of some E. coli contaminants
that eluted in the void volume. Neither the urea nor the acetonitrile at the concentrations
used resulted in Nef denaturation. The problem of cysteine oxidations was circumvented
by mutating cysteines 54 and 205 to alanines. Mutation of cysteine 205 alone and
including 5 mM DTT in all the column buffers was also a satisfactory solution. The high
susceptibly of the N-terminal region to proteolytic processing indicates that it is solventaccessible and likely to be unstructured. In the case of Nef, this region can be deleted
without affecting the folding of the protein and removes the potential for heterogeneity
due to partial processing by E. coli proteases. The NMR structure of HIV Nef was
determined with protein prepared as described above (Grzesiek et al., 1997).
MAP30
MAP30 is a plant protein obtained from bitter melon that has anti-HIV and anti-tumor
activities. The 30-kDa protein is well expressed in E. coli as a soluble protein and is
purified by two stages of exchange chromatography followed by gel filtration. The
clarified extract is first applied to a DEAE-Sepharose column at pH 8.0; the majority of
MAP30 does not bind or weakly binds the exchange resin. The column flow-through and
early eluting fractions are dialyzed against pH 6.5 buffer then fractionated using SPSepharose (strong cation exchanger). The final step is gel filtration using a Superdex 200
column at pH 8.0.
Purification of
Recombinant
E. coli Proteins
There are clear similarities between the MAP30 purification scheme and the one developed for the Nef protein; both utilize an initial clean-up step using DEAE-Sepharose
followed by a second more discriminating ion-exchange step and finally a polishing
step using gel filtration. For Nef, the second ion-exchange step employs an anion-exchange resin while the MAP30 method uses a cation-exchange resin. The choice of resin
for the second step reflects the difference in the isoelectric points of these proteins. Nef
has a calculated pI of 5.95 and is positively charged at pH values greater than this.
MAP30 has a slightly basic pI of 9.00 and is negatively charged at pH values below this.
Thus, Nef binds to DEAE-Sepharose and Q-Sepharose at pH 7.4 and 8.0, respectively.
6.1.22
Supplement 30
On the other hand, MAP30 does not bind to DEAE-Sepharose at pH 7.4 but binds strongly
to a cation exchanger at pH 6.5.
Apart from purification, there is also another similarity between Nef and MAP30, namely
susceptibility to proteolytic processing during purification. As previously mentioned, the
N-terminal region (residues 1-38) of Nef is at risk for proteolysis, and to maintain the
structural integrity, especially during cell breakage and the initial processing, protease
inhibitor cocktails must be included in the buffers. MAP30 also has a region susceptible
to processing, namely, the 20 residues at the C-terminal end of the protein. Again, this
is due to the fact that this region is largely unstructured in an otherwise folded and stable
molecule (Wang et al., 1999). When purifying MAP30, standard protease inhibitors are
included during the early stages of purification and, in addition, -macroglobulin (15 to
2.0 g/mg protein) is added to the protein prior to the gel-filtration step. The macroglobulin inhibits a wide range of proteases by a trapping mechanism (Sottrup-Jensen, 1989).
If proteins are to be used for structural studies, deletion mutants can eliminate unstructured regions at the N- and C- terminal regions. Deletions of such regions from either Nef
or MAP30 do not significantly change the pI of either protein, so the same purification
procedures can be applied to the deletion mutants. Although incremental structural
determination is an important strategy in structural biology, one should always be aware
that regions deleted, even those that appear unstructured, may have important functional
roles. There are many examples of disordered proteins and protein domains that adopt
folded structures upon binding to their biological targets (for a review, see Dyson and
Wright, 2001), and in the case of Nef, it appears that the apparently unstructured
N-terminal region (residues 1-57) mediates binding to the tumor suppressor protein p53,
possibly enhancing HIV-1 replication (Greenway et al., 2002). A dual vector co-expression system for producing heteromeric complexes in E. coli (Johnson et al., 2000) may
be particularly useful for producing proteins requiring binding partners for folding and
stability.
Insoluble proteins
HIV-1 gp41 ectodomain
The membrane-associated glycoproteins of HIV-1 include gp120 and gp41, the latter
mediating membrane fusion with the host cell. These viral envelope proteins have been
the subject of intense structural analysis over the last several years as inhibition of
membrane fusion, hence viral entry, is a potential drug target in the development of
therapeutics for AIDS. A basic strategy in tackling membrane-associated proteins is to
remove the membrane-spanning region by expressing the non-membrane-associated
region or ectodomain.
The gp41 ectodomain is a 150-residue protein that is recombinantly expressed in E. coli
as an insoluble protein. The protein can be extracted from inclusion bodies with 8 M
guanidineHCl and purified by one step of gel filtration in the presence of 4 M guanidineHCl. The guanidine is removed by preparative reversed-phase HPLC and the protein
folded upon dialysis against 50 mM sodium formate at pH 3.0. The yield of folded protein
is >90%. Characterization of the protein indicates that its solubility decreases dramatically below pH 4.0. Between pH 3.0 and 4.0, the protein has an all -helical secondary
structure with a trimeric subunit structure. The protein was demonstrated to have folded
by determining its full structure at pH 3.5 using multidimensional NMR (Caffrey et al.,
1998). The protein was also crystallized from a buffer at pH 3.5 and its structure
determined by X-ray crystallography (Yang et al, 1999).
Purification of
Recombinant
Proteins
6.1.23
Current Protocols in Protein Science
Supplement 30
Other insoluble proteins expressed in E. coli that exhibit acid stability similar to the gp41
ectodomain can be processed and folded using a similar scheme as described above. For
example, the HIV protease can be purified and folded with this method. The HIV protease,
after folding at pH 3.5, exhibits fair solubility up to pH 5.0, with solubility decreasing at
higher pH values. Other proteins may only be partially folded or unfolded at acidic pH
values; in these cases, the reversed-phase HPLC step could be used to simply remove the
denaturant, then the protein can be freeze dried from TFA-acetonitrile solvent and used
for folding trials.
The gp41 ectodomain contains two cysteine residues in a loop region connecting N- and
C-terminal helical domains. These cysteines do not form intramolecular disulfides and
can be substituted by alanine residues. This is a common theme. If a protein contains free
solvent-accessible cysteines that play no structural or functional role, it is often a good
idea to substitute them (usually with Ala), especially if structural studies are planned.
Human Tissue Inhibitor of Metalloprotease-2 (TIMP-2) and hepatocyte growth factor isoforms (NK1 and NK2)
The TIMP families of proteins are inhibitors of the matrix metalloproteases and are critical
effectors of extracellular matrix turnover. The hepatocyte growth factor (HGF) is a
multifunctional protein stimulating a wide range of cellular targets. The HGF gene codes
for three distinct proteins: the full-length form and two truncated isoforms that include
an N-terminal domain (N) and one-kringle (NK1) or two-kringle domains (NK2). TIMP-2
(21 kDa), NK1 (21 kDa), and NK2 (30 kDa) contain multiple disulfides that stabilize the
folded conformations. For example, TIMP-2, apart from having 12 cysteines that form 6
disulfides, contains a cysteine as the N-terminal residue. All three proteins were expressed
in E. coli as insoluble proteins, extracted with guanidineHCl and reductant, and the
unfolded protein separated by gel filtration in a similar manner to that previously
discussed. The partially purified proteins can be conveniently stored frozen in guanidineHCl at 80C for several years without deleterious effects on folding or recovery of
active protein. The folding and oxidation of the proteins are detailed in the respective
publications, Stahl et al. (1997) and Wingfield et al. (1999), but briefly, the protocols
involve equilibrium dialysis incorporating urea as a co-solvent to maintain solubility
during folding, and a glutathione-based oxido-shuffling system (redox buffer) to promote
formation of disulfide bonds (this approach is also detailed in Basic Protocol 1 in UNIT
6.5). The final stage of the purification process is gel filtration of the folded proteins, which,
apart from removing host contaminates, separates folded monomers from any misfolded
and aggregated protein.
Purification of
Recombinant
E. coli Proteins
When recombinant expressed proteins are insoluble in E. coli, the purification scheme
can be very simple as illustrated above where one or two steps of gel filtration may be all
that is required; the challenge is determining a method to fold and oxidize the protein. In
all three examples discussed above, the key to efficient folding is maintaining solubility,
whether it be by taking advantage of the acid stability of the protein and working at pH
3.5, or by including the co-solvent urea. As mentioned above, TIMP-2 has an N-terminal
cysteine residue. When this protein was originally expressed, an alanine was appended to
the N-terminus since it had been observed that partial N-terminal processing occurred
when cysteine was the terminal residue. The alanine residue was added in an effort to
produce homogeneous protein for structural studies. The purified Ala+ TIMP-2 appeared
monomeric and folded, yet was devoid of its normal inhibitory activity (Wingfield et al.,
1999). It was determined that the coordination of a zinc atom by the N-terminal cysteine
stabilized substrate binding and required a free amino terminal group. This was demonstrated by exopeptidase digestion (using aminopeptidase 1) of Ala+ TIMP-2, which
6.1.24
Supplement 30
removed the N-terminal alanine making cysteine the N-terminal residue and, thus,
restoring biological activity.
A GST fusion protein
The protein VP26 is a 12-kDa capsid protein of the herpes simplex virus and initial
attempts to directly express this protein in E. coli failed. It was possible, however, to
produce this protein at fairly high levels in E. coli as a GST fusion (Wingfield et al.,
1997a). The insoluble protein was treated in the usual manner: solubilized with guanidineHCl and partially purified by gel filtration also in guanidineHCl. The usual purification for GST fusion proteins is affinity chromatography using immobilized glutathione,
which requires that the GST moiety be folded (UNIT 6.6). Due to the low solubility of VP26
and its high propensity for aggregation, the following approach was used. First, the
VP26-GST fusion was folded from the guanidineHCl solution by equilibrium dialysis
against buffer containing 2.5 M urea, 10 mM CHAPS, and 0.25 M NaCl, and then against
the same buffer lacking the urea. The buffer additives were included to maintain protein
solubility (solubility is improved with >0.25 M NaCl, but the cleavage of the GST moiety
by thrombin is inhibited by high salt concentrations). Following cleavage of GST and
VP26, the proteins were denatured again with guanidineHCl, separated by gel filtration
and the purified VP26 refolded from urea and CHAPS as described above. As an aside,
the GST moiety is readily refolded from guanidineHCl and does not require high salt or
CHAPS to maintain solubility during the dialysis steps. The purification approach used
here may appear inelegant, but the fusion system was used not to facilitate purification,
but to facilitate expression of the protein.
PROTEIN HANDLING
Storing Purified Proteins
Purified protein should be filter-sterilized prior to storage. Millex-GV 0.22-m filters
(Millipore) employ hydrophilic membranes with low binding capacities and are recommended for most proteins. Proteins are best stored at 80C or may be stored on ice;
freezing at 20C is not recommended. Rapid freezing in small aliquots using dry
ice/ethanol mixtures is preferred to slow freezing at 20C . The addition of sucrose or
glycerol often increases protein stability during storage and during freezing and thawing
cycles (Arakawa and Timasheff, 1985; Timasheff and Arakawa, 1997). Lyophilization is
best for long-term storage; however, care should be taken in choosing the protein solvent
(Franks, 1993).
Promoting Protein Solubility and Stability
If the recombinant protein contains reactive unpaired sulfhydryl groups in the native
conformation, 1 to 5 mM DTT should be included in the column buffers during purification. However, reductant should not be used gratuitously, as the native protein may contain
intra- or intermolecular disulfide bonds, disruption of which can reduce the stability and
solubility of the protein. Reductants should be included, for example, during gel filtration
if dimers or higher aggregates need to be converted to active monomeric protein. The
presence of intermolecular (and occasionally intramolecular) disulfide bonds can be
determined analytically by SDS-PAGE under nonreducing conditions (UNIT 6.5) by pretreating proteins sequentially with iodoacetamide (to prevent artificial disulfide exchange) and then with SDS in the absence of reductant. The use of reductants can best be
rationalized once the native protein has been characterized.
Purification of
Recombinant
Proteins
6.1.25
Current Protocols in Protein Science
Supplement 30
EDTA (1 to 5 mM) is often included in buffers to remove heavy metals that can catalyze
oxidative processes and inhibit certain proteases. It should be noted that EDTA will bind
to anion exchange resins (Scopes, 1994).
Other components often added to buffers to promote protein solubility during purification
include nonionic or zwitterionic detergents, low concentrations of urea (1 to 2 M), and
salt (0.5 to 1 M NaCl). These additives are compatible with ion-exchange chromatography, except for high-salt concentrations, which are compatible with hydrophobic-interaction chromatography (UNIT 8.4), affinity chromatography (Chapter 9), and gel-filtration
chromatography (UNIT 8.3). Solvent pH is one of the most important variables for maintaining protein solubility; in general, proteins are least soluble at or near their isoelectric
points.
Preventing Contamination
Precautions to prevent contamination of the protein of interest are as follows:
1. To avoid cross-contamination, especially from other recombinant proteins, dedicate
one set of chromatography resins for the purification of each protein. If this is not possible,
or if expensive prepackaged matrices are used, be sure to clean resins thoroughly after
each use. Check the manufacturers recommendations and be aware of the chemical
stability of the resin, especially for extremes of pH.
2. Store resins with preservatives (e.g., 1 mM sodium azide) and avoid storage in
phosphate buffers, which provide a good medium for bacterial growth.
3. To generate reproducible protocols using ion-exchange methods, monitor the pH and
conductivity of all buffers and column effluents (the latter ideally in-line).
4. Avoid protein cross-contamination in concentration equipment such as stirred ultrafiltration cells with ultrafiltration membranes.
5. Keep pH and conductivity probes scrupulously clean, especially when used with
solutions containing proteases. Likewise, use care when using cuvettes for UV measurements.
6. Avoid vigorous stirring of protein solutions to prevent shear denaturation, and handle
soft agarose-based column matrices carefully to prevent bead fragmentation.
Removing Pyrogens
Recombinant proteins used for in vivo studies should be free of endotoxins (pyrogenic
lipopolysaccharide derived from the bacterial outer membrane of Gram-negative bacteria). Yeast and mammalian cell hosts do not contain endotoxins; however, exogenous
contamination from water and others must be avoided. Pyrogens can be detected using
the sensitive Limulus amoebocyte lysate (LAL) assay kits available from Sigma and other
suppliers. As endotoxins are negatively charged, they will be removed by anion-exchange
chromatography. Other methods are reviewed in detail by Petch and Anspach (2000).
SCALE OF OPERATIONS AND AIMS OF PURIFICATION
Determining Scale
Purification of
Recombinant
E. coli Proteins
The amount of protein required and the level of purity will vary dramatically from
laboratory to laboratory and study to study. The following guidelines will help in planning
a strategy.
6.1.26
Supplement 30
Purification of
Recombinant
Proteins
6.1.27
Current Protocols in Protein Science
Supplement 30
Purification of
Recombinant
E. coli Proteins
The rate-limiting step in structure determination using X-ray crystallography is production of crystals that diffract to high resolution (UNIT 17.4). The scientists involved in the
production and characterization of the protein are often best situated to crystallize the
protein. Furthermore, once crystallization conditions have been optimized, it can be quite
easy to interest structural groups in collaboration.
6.1.28
Supplement 30
Purification of
Recombinant
Proteins
6.1.29
Current Protocols in Protein Science
Supplement 30
Some investigators have reported problems with His-tagged proteins and it is generally
recommended to either remove this tag for structural work or use another tag.
Biophysical Studies
Low-resolution structural studies using various biophysical methodologies (Jones et al.,
1994) can be made with less material (<1 to 10 mg). Proteins for spectroscopic studies
should be >95% pure and previously fractionated on a gel-filtration column to remove
aggregated and possibly misfolded variants. The removal of aggregates is especially
important for spectroscopic studies including UV/vis, fluorescence and circular dichroism where excessive light scattering must be avoided (see Chapter 7; Colon, 1999).
Various labeling and tagging strategies can be used to aid both structural and functional
studies. The most common approach is to append affinity tags that can then be used to
immobilize the protein in a directed manner (Nilsson et al., 1997). This approach is
especially useful for studying protein interactions. Also, analogous to the in vivo protein
labeling scenarios as described above for selenomethionine, specific residues can be
modified. For example, tryptophan in recombinant proteins can be replaced by 5-hydroxytryptophan by using an E. coli Trp auxotroph. Protein thus labeled has a strong
absorbance at 310 nm that can be exploited in structure-function studies (Laue et al.,
1993).
SPECIALIZED EQUIPMENT
Breaking and Fractionating Cells
For small- to medium-scale work on a regular basis, a French press (Thermo Spectronic,
http://www.thermo.com) with a continuous-fill cell is recommended (UNIT 6.2). It is also
useful for breaking yeast cells. For large-scale work (>500 ml), the Manton-Gaulin-APV
homogenizer (APV Gaulin) is recommended. For further processing of cells and cell
lysates (e.g., UNITS 6.2 & 6.3), an ultrasonic homogenizer is required. An instrument with a
400-W (or higher) capacity is recommended (Branson, http://www.bransonultrasonics.com).
After low-speed centrifugation using standard preparative centrifuges (Beckman Coulter
Preparative Centrifuge, Avanti Series can be found at http://www.beckman.com), highspeed centrifugation is a convenient and rapid cleanup step before column chromatography (Fig. 6.1.3). With Beckman ultracentrifuges, the 45 Ti rotor is recommended. This
six-place rotor has a maximum speed of 235,000 g; with thick-walled polycarbonate
tubes, its capacity is 400 ml.
Chromatographing Proteins
Purification of
Recombinant
E. coli Proteins
Most chromatography is carried out at 4C either in a cold room or, more conveniently,
in a cold cabinet in the laboratory. The basic components of a chromatography system
are as follows: column, column matrix, pumps, a gradient-making device, UV/visible or
other detection system, and a fraction collector. These components can be bought as units
such as the AKTA Explorer or FPLC chromatograph systems (Amersham Bioscience,
http://www.apbiotech.com), which can be used for laboratory-scale to large-scale work.
Systems can also be custom assembled from individual components from Amersham and
other vendors. Column matrices can be purchased prepacked or as bulk media that are
packed in columns by the user. Ion-exchange separations, using standard low- to mediumpressure resins (agarose/dextran/cellulose-based), require at least one narrow (2.5-cm)
and one wide (5.0-cm) column with adjustable flow adapters so that the resin height can
be varied between 5 and 30 cm. Gel filtration requires columns with diameters of 1.25
6.1.30
Supplement 30
and 2.5 cm (5 cm for larger-scale work) and lengths of 60 to 100 cm. Simple gradient
makers with capacities of 150 ml to 2 liters are generally available.
Concentrating Proteins
Stirred ultrafiltration cells are recommended for laboratory-scale work. The cells range
in size from 3 ml to 2 liters and are used in conjunction with variable molecular weight
cutoff membranes (Millipore, http://www.millipore.com). For larger volumes, Millipore
also sells various systems. For smaller volumes (0.5 to 15 ml), centrifugational concentrators are available (Millipore and others). For a review of the equipment used for protein
concentration, see Harris (1989).
Making Analytical Measurements
A protein purification laboratory should have a dependable scanning UV/visible spectrophotometer, ideally an instrument with computerized data collection and analysis.
Hewlett Packard (Agilent) instruments with diode array detectors are recommended for
most routine work (http://www.chem.agilent.com). For laboratories specializing in purifying recombinant proteins from E. coli, access to a spectropolarimeter (e.g., Jasco J-810,
http://www.jascoinc.com) will be helpful for monitoring and developing folding protocols. For rapid chemical characterization and identity check of proteins, access to a mass
spectrometer is also desired (Chapter 16).
Most of the companies mentioned above have excellent Web sites where technical
information is posted. The series of handbooks on chromatographic separations published
by Amersham Biosciences can be conveniently downloaded as pdf files.
Literature Cited
Allet, B., Payton, M., Mattaliano, R.J., Gronenborn, A.M., Clore, G.M., and Wingfield, P.T. 1988. Purification
and characterization of the DNA-binding protein Ner of bacteriophage Mu. Gene 65:259-268.
Arakawa, T. and Timasheff, S.N. 1985. Theory of protein solubility. Methods Enzymol. 114:49-77.
Armstrong, N., De Lencastre, A., and Gouaux, E. 1999. A new protein folding screen: Application to the
ligand binding domain of a glutamate and kainite receptor and to a lysozyme and carbonic anhydrase.
Protein Sci. 8:1475-1483.
Asenjo, J.A. and Patrick, I. 1990. Large-scale protein purification. In Protein Purification Applications: A
Practical Approach (E.L.V. Harris and S. Angal, eds.) pp. 1-27. IRL Press, Oxford.
Baneyx, F. 1999. Recombinant protein expression in Escherichia coli. Curr. Opin. Biotechnol. 10:411-421.
Beacham, I.R. 1979. Periplasmic enzymes in Gram-negative bacteria. Int. Biochem. 10:877-883.
Ben-Bassat, A., Bauer, K., Chang, S.-Y., Myambo, K., Boosman, A., and Chang, S. 1987. Processing of the
initiation methionine from proteins: Properties of the E. coli methionine aminopeptidase and its gene
structure. J. Bacteriol. 169:751-757.
Bessette, P.H., Aslund, F., Beckwick, J., and Georgiou, G. 1999. Efficient folding of proteins with multiple
disulfide bonds in the Escherichia coli cytoplasm. Proc. Natl. Acad. Sci. U.S.A. 96:13703-13708.
Bibi, E. and Beja, O. 1994. Membrane topology of multidrug resistant protein expressed in E. coli. J. Biol.
Chem. 31:19910-19915.
Bowden, G.A., Paredes, A.M., and Georgiou, G. 1991. Structure and morphology of protein inclusion bodies
in E. coli. Biotechnology 9:725-730.
Braun, P., Gerritse, G., van Dijl, J.-M., and Quax, W.J. 1999. Improving protein secretion by engineering
components of the bacterial translocation machinery. Curr. Opin. Biotechnol. 10:376-381.
Buchner, J. and Rudolph, R. 1992. Renaturation and characterization of recombinant Fab fragments produced
in Escherichia coli. Biotechnology 9:157-162.
Burgess, R.R. and Jendrisak, J.J. 1975. A procedure for the rapid, large-scale purification of E. coli
DNA-dependent RNA polymerase involving polymin P precipitation and DNA-cellulose chromatography. J. Biol. Chem. 14:4634-4638.
Purification of
Recombinant
Proteins
6.1.31
Current Protocols in Protein Science
Supplement 30
Burgess, R.R. and Knuth, M.W. 1996. Purification of a recombinant protein overproduced in Escherichia
coli. In Strategies for Protein Purification and Characterization: A Laboratory Course Manual (D.R.
Marshak, J.T. Kadonaga, R.R. Burgess, M.W. Knuth, W.A. Brennan, and S.-H. Lin, eds.) pp. 205-217 and
245-262. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
Caffrey, M., Cai, M., Kaufman, J., Stahl, S., Wingfield, P.T., Covell, D.G., Gronenborn, A.M., and Clore,
G.M. 1998. Three dimensional solution structure of the 44 kDa ectodomain of SIV gp41. EMBO J.
17:4572-4584.
Campbell, I. and Downing, A.K. 1998. NMR of modular proteins. Nat. Struct. Biol. 5:496-499.
Cereghino, G.P.L. and Clegg, J.M. 1999. Applications of yeast in biotechnology: Protein production and
genetic analysis. Curr. Opin. Biotechnol. 10:422-427.
Cheng, Y.-S.E., McGowan, M.H., Kettner, C.A., Schloss, J.V., Erickson-Viitanen, S., and Yin, F.H. 1990. High
synthesis of recombinant HIV-1 protease and the recovery of active enzyme from inclusion bodies. Gene
87:243-248.
Cleary, S., Mulkerrin, M.G., and Kelley, R.F. 1989. Purification and characterization of tissue plasminogen
activator kringle-2 domain expressed in E. coli. J. Biol. Chem. 28:1884-1891.
Cleland, J.L., Builder, S.E., Swartz, J.R., Winkler,
M., Chang, J.Y., and Wang, D.I.C. 1992. Polyethylene glycol enhanced protein refolding.
Biotechnology 10:1013-1019.
Clore, G.M. and Gronenborn, A.M. 1994. Multidimensional heteronuclear nuclear magnetic resonance of
proteins. Methods Enzymol. 239:249-363.
Cole, P.A. 1996. Chaperone-assisted protein expression. Structure 4:239-242.
Colon, W. 1999. Analysis of protein structure by solution optical spectroscopy. Methods Enzymol. 309:605632.
Cornelis, P. 2000. Expressing genes in different Escherichia coli compartments. Curr. Opin. Biotechnol.
11:450-454.
Creighton, T.E. 1984. Disulfide bond formation in proteins. Methods Enzymol. 107:305-329.
Creighton, T.E. 1993. Proteins: Structures and Molecular Properties, 2nd ed. pp. 292-296. Freeman, New
York.
Dale, G.E., Broger, C., Langen, H., DArcy, A., and Struber, D. 1994. Improving protein stability through
rationally designed amino acid replacements: Solubilization of the trimethoprim-resistant type S1 dihydrofolate reductase. Protein Eng. 7:933-939.
Danley, D.E., Strick, C.A., James, L.C., Lanzetti, A.J., Otterness, I.G., Grenett, H.E., and Fuller, G.M. 1991.
Identification and characterization of a C-terminally extended form of recombinant murine IL-6. FEBS
Lett. 283:135-139.
Darby, N.J. and Creighton, T.E. 1990. Folding proteins. Nature 344:715-716.
De Bernardez Clark, E. 2001 Protein folding for industrial processes. Curr. Opin. Biotechnol. 12:202-207.
De Bernardez Clark, E., Schwartz, E., and Rudolph, R. 1999. Inhibition of aggregation side reactions during
in-vitro protein folding. Methods Enzymol. 309:217-236.
deVos, A.M., Ultsch, M., and Kossiakoff, A.A. 1992. Human growth hormone and extracellular domain of
its receptor: Crystal structure of the complex. Science 255:306-312.
Diederichs, K., Boone, T., and Karplus, A. 1991. Novel fold and putative receptor binding site of granulocyte-macrophage colony stimulating factor. Science 254:1779-1782.
DiRienzo, J.M., Nakamura, K., and Inouye, M. 1978. The outer membrane proteins of Gram-negative
bacteria: Biosynthesis, assembly and function. Annu. Rev. Biochem. 47:481-532.
Dyson, H.J. and Wright, P.E. 2001. Coupling of folding and binding for unstructured proteins. Curr. Opin.
Struct. Biol. 12:54-60.
Ealick, S.E., Cook, W.J., Vijay-Kumar, S., Carson, M., Nagabhushan, T.L., Trotta, P.P., and Bugg, C.E. 1991.
Three-dimensional structure of recombinant human interferon-gamma. Science 252:698-702.
Eisenberg, D. 1999. How chaperones protect virgin proteins. Science 285:1021-1022.
Ellis, R.J. 1994. Role of chaperones in protein folding. Curr. Opin. Struct. Biol. 4:117-122.
Ellis, R.J. 2001. Macromolecular crowding: An important but neglected aspect of the intracellular environment. Curr. Opin. Struct. Biol. 11:114-119.
Ellis, R.J. and Hart, F.U. 1999. Principles of protein folding in the cellular environment. Curr. Opin. Struct.
Biol. 9:102-110.
Purification of
Recombinant
E. coli Proteins
Feldman, D.E. and Frydman, J. 2000. Protein folding in vivo: The importance of molecular chaperones. Curr.
Opin. Struct. Biol. 1026-33.
6.1.32
Supplement 30
Fersh, A. 1999. Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein
Folding. W.H. Freeman and Company. New York.
Franks, F. 1993. Storage stabilization of proteins. In Protein Biotechnology (F. Franks, ed.) pp. 486-531.
Humana Press, Totowa, N.J.
Georgiou, G. and Valax, P. 1999. Isolating inclusion bodies from bacteria. Methods Enzymol. 309:48-58.
Gilbert, H.F. 1995. Thiol/disulfide exchange equilibria and disulfide bond stability. Methods Enzymol.
251:8-28.
Goenka, S. and Rao, C.M. 2001. Expression of recombinant -crystallin in Escherichia coli with the help of
GroEL/ES and its purification. Protein Expr. Purif. 21:260-267.
Goff, S.A. and Goldberg, M.E. 1985. Production of abnormal proteins in E. coli stimulates transcription of
lon and other heat shock genes. Cell 41:587-595.
Goldberg, M.E. 1991. Investigating protein conformation dynamics and folding with monoclonal antibodies.
Trends Biochem. Sci. 16:358-362.
Goldberg, M.E., Rudolph, R., and Jaenicke, R. 1991. A kinetic study of the competition between renaturation
and aggregation during the refolding of denatured-reduced egg white lysozyme. Biochemistry 30:27902797.
Goto, N.K. and Kay, L.E. 2000. New developments in isotope labeling strategies for protein solution NMR
spectroscopy. Curr. Opin. Struct. Biol. 10:585-592.
Greenway, A.L., McPhee, D.A., Allen, K., Johnson, R., Holloway, G., Mills, J., Azad, A., Sankovich, S., and
Lambert, P. 2002. Human immunodeficiency virus type 1 Nef binds to tumor suppressor p53 and protects
cells against p53-mediated apoptosis. J. Virol. 76:2692-2702.
Grunfeld, H., Patel, A., Shatzman, A., and Nishikawa, A.H. 1992. Effector-assisted refolding of recombinant
tissue-plasminogen activator produced in Escherichia coli. Appl. Biochem. Biotechnol. 33:117-138.
Grzesiek, S., Bax, A., Hu, J.-S., Kaufman, J.D., Palmer, I., Stahl, S.J., Tjandra, N., and Wingfield, P.T. 1997.
Refined solution structure and backbone dynamics of HIV-1 Nef. Protein Science 6:1248-1263.
Grzesiek, S., Stahl, S.J., Wingfield, P.T., and Bax, A. 1996. The CD4 determinant for downregulation by
HIV-1 Nef directly binds to Nef: Mapping of the Nef binding surface by NMR. Biochemistry 35:1025610261.
Guisz, Y., Fache, I., Campfield, L.A., Smith, F.J., Farid, A., Plaetinck, G., Van der Heydon, J., Tavernier, J.,
Fiers, W., Burns, P., and Devos, R. 1998. Efficient secretion of biological active recombinant OB protein
(leptin) in Escherichia coli, purification from the periplasm and characterization. Protein Expr. Purif.
12:249-258.
Gulnik, S.V., Afonina, E.I., Gustchina, E., Yu, B., Silva, A.M., Kim, Y., and Erickson, J.W. 2001. Utility of
(His)6 Tag for purification and refolding of proplasmepsin-2 and mutants with altered activation properties.
Protein Expr. Purif. 24:412-419.
Hammarstrom, M., Hellgren, N., Van Den Berg, S., Berglund, H., and Hard, T. 2002. Rapid screening for
improved solubility of small human proteins produced as fusion proteins in Escherichia coli. Protein Sci.
11:313-321.
Harris, E.L. 1989. Concentration of the extract. In Protein Purification Methods: A Practical Approach (E.L.V.
Harris and S. Angal, eds.) pp. 125-172. IRL Press, Oxford.
Helenius, A. 1994. How N-linked oligosaccharides affect glycoprotein folding in the endoplasmic reticulum.
Mol. Biol. Cell. 5:253-265.
Hendrickson, W.A., Horton, J.R., and LeMaster, D.M. 1990. Selenenomethionyl proteins for analysis by
multiwavelengh anomalous diffraction (MAD): A vehicle for direct determination of three-dimensional
structure. EMBO J. 9:1665-1672.
Heppel, L.A. 1967. Selective release of enzymes from bacteria. Science 156:1451-1455.
Hoffman, A., Tai, M., Wong, W., and Glabe, C.G. 1995. A sparse matrix screen to establish initial conditions
for protein renaturation. Anal. Biochem. 230:8-15.
Holland, I.B., Kenny, B., Steipe, B., and Pluckthun, A. 1990. Secretion of proteins in E. coli. Methods
Enzymol. 182:132-143.
Hopkins, T.R. 1991. Physical and chemical cell disruption for the recovery of intracellular proteins. In
Purification and Analysis of Recombinant Proteins (R. Seetharam and S.K. Sharma, eds.) pp. 57-83.
Marcel Dekker, New York.
Hwang, D.D.W., Liu, L.-F., Kuan, I.-C., Lin, L.-Y., Tam, T.-C.S., and Tam, M.F. 1999. Co-expression of
glutathione S-transferase with methionine aminopeptidase: A system of producing enriched N-terminal
processed proteins in E.coli. Biochem J. 338:335-342.
Jaenicke, R. 1993. Role of accessory proteins in protein folding. Curr. Opin. Struct. Biol. 3:104-112.
Purification of
Recombinant
Proteins
6.1.33
Current Protocols in Protein Science
Supplement 30
Janson, J.-C. and Ryden, L. 1989. Protein Purification: Principles, High Resolution Methods, and Applications. VCH Publishers, New York.
Johnson, B.H. and Hecht, M.H. 1994. Recombinant proteins can be isolated from E. coli by repeated cycles
of freezing and thawing. Biotechnology 12:1357-1360.
Johnson, K., Clements, A., Venkataramani, R.N., Trievel, R.C., and Marmorstein, R. 2000. Coexpression of
proteins in bacteria using a T7-based expression plasmid: Expression of heteromeric cell cycle and
transcriptional regulatory complexes. Protein Expr. Purif. 20:435-443.
Jones, C., Mulloy, B., and Thomas, A.H. 1994. Microscopy, optical spectroscopy, and macroscopic techniques. Methods Mol. Biol. 22:1-245.
Jones, D.H., Ball, E.H., Sharpe, S., Barber, K.R., and Grant, C.W.M. 2000. Expression and membrane
assembly of a transmembrane region from Neu. Biochemistry 39:1878-1878.
Kaback, H.R. 1971. Bacterial membranes. Methods Enzymol. 22:99-120.
Kamireddi, M., Eisenstein, E., and Reddy, P. 1997. Stable expression and rapid purification of Escherichia
coli GroEL and GroES chaperones. Protein Expr. Purif. 11:47-52.
Kelley, W.S. and Stump, K.H. 1979. A rapid procedure for isolation of large quantities of E. coli DNA
polymerase 1 utilizing a polA transducing phage. J. Biol. Chem. 254:3206-3210.
Kern, G., Kern, D., Jaenicke, R., and Seckler, R.L. 1993. Kinetics of folding and association of differentially
glycosylated variants of invertase from Saccharomyces cerevisiae. Protein Sci. 2:1862-1868.
Kholod, N. and Mustelin, T. 2001. Novel vectors for co-expression of two proteins in E.coli. Biotechniques
31:322-328.
Kiefhaber, T., Rudolph, R., Kohler, H.-H., and Buchner, J. 1991. Protein aggregation in vitro and in vivo: A
quantitative model of the kinetic competition between folding and aggregation. Biotechnology 9:825-829.
Kohno, T., Carmichael, D.F., Sommer, A., and Thompson, R.C. 1990. Refolding of recombinant proteins.
Methods Enzymol. 185:187-195.
Kost, T.A. and Condreay, J.P. 1999. Recombinant baculovirus as expression vectors for insect and mammalian
cells. Curr. Opin. Biotechnol. 10:428-433.
Laue, T.M., Senear, D.F., Eaton, S., and Ross, J.B.A. 1993. 5-Hydroxytryptophan as a new intrinsic probe
for investigating protein-DNA interactions by analytical ultracentrifugation. Study of the effect of DNA
on the self-assembly of the bacteriophage cI repressor. J. Biol. Chem. 32:2469-2472.
LaVallie, E.R., DiBlasio, E.A., Kovacic, S., Grant, K.L., Schendel, P.F., and McCoy, J.M. 1993. A thioredoxin
gene fusion expression system that circumvents inclusion body formation in the E. coli cytoplasm.
Biotechnology 11:187-193.
Lilie, H., Schwartz, E. and Rudolph, R. 1998 Advances in refolding of proteins produced in E.coli. Curr.
Opin. Biotechnol. 9:497-501.
Lindwall, G., Chau, M.-F., Gardner, S.R., and Kohlstaedt, L.A. 2000. A sparse matrix approach to the
solubilization of overexpression proteins. Protein Eng. 13:67-71.
London, J., Skrzynia, C., and Goldberg, M.E. 1974. Renaturation of Escherichia coli tryptophanase after
exposure to 8 M urea. Eur. J. Biochem. 47:409-415.
Lu, H.S., Fausset, P.R., Sotos, L.S., Clogston, C.L., Rohde, M.F., Stoney, K.S., and Herman, A.C. 1993.
Isolation and characterization of three recombinant human granulocyte colony stimulating factor His to
Gln isoforms produced in E. coli. Protein Expr. Purif. 4:465-472.
Markrides, S. 1996. Strategies for achieving high-level expression of genes in Escherichia coli. Microbiol.l
Rev. 60:512-538.
Marston, F.A.O. and Hartley, D.L. 1990. Solubilization of protein aggregates. Methods Enzymol. 182:264276.
Matthew, J.B., Friend, S.H., Botelho, L.D., Lehman, L.D., Hanania, G.I., and Gurd, F.R.H. 1978. Discrete
charge calculations of potentiometric titrations for globular proteins. Biochem. Biophys. Res. Commun.
81:416-421.
Maurizi, M.R. 1992. Proteases and protein degradation in Escherichia coli. Experientia 48:178-201.
Milburn, M.V., Hassel, A.M., Lambert, M.H., Jordon, S.R., Proudfoot, A.E.I., Graber, P., and Wells, T.N.C.
1993. A novel dimer configuration revealed by the crystal structure at 2.4 angstrom resolution of human
interleukin-5. Nature 363:172-176.
Mildner, A.M., Rothrock, D.J., Leone, J.W., Bannow, C.A., Lull, J.M., Reardon, I.M., Sarcich, J.L., Howe,
W.J., Tomich, C.-S.C., Smith, C.W., Heinrikson, R.L., and Tomasselli, A.G. 1994. The HIV-1 protease as
enzyme and substrate: Mutagenesis of autolysis sites and generation of a stable mutant with retained
kinetic properties. Biochemistry 33:9405-9413.
Purification of
Recombinant
E. coli Proteins
6.1.34
Supplement 30
Miller, C.G., Strauch, K.L., Kurral, A.M., Miller, J.L., Wingfield, P.T., Mazzei, G.J., Werlen, R.C., Graber,
P., and Movva, N.R. 1987. N-Terminal methionine-specific peptidase in Salmonella typhimurium. Proc.
Natl. Acad. Sci. U.S.A. 84:2718-2772.
Miller, H.I., Henzel, W.J., Ridgway, J.B., Kuang, W.-J., Chrisholm, V., and Liu, C.-C. 1989. Cloning and
expression of a yeast ubiquitin-protein cleaving activity in E. coli. Biotechnology 7:698-704.
Missiakas, D. and Raina, S. 1997. Protein folding in the bacterial periplasm. J. Bacteriol. 179:2465-2471.
Murby, M., Uhlen, M., and Stahl, S. 1996. Upstream strategies to minimize proteolytic degradation upon
recombinant protein in Escherichia coli. Protein Expr. Purif. 7:129-136.
Nagata, K., Kikuchi, N., Ohara, O., Teraoka, H., Yoshida, N., and Kawade, Y. 1986. Purification and
characterization of recombinant murine immune interferon. FEBS Lett. 205:200-204.
Nash, H.A., Robertson, C.A., Flamm, E., Weisberg, R.A., and Miller, H. 1987. Overproduction of Escherichia
coli integration host factor, a protein with noidentical subunits. J. Bacteriol. 169:4124.
Neidhardt, F.C. 1987. Chemical composition of Escherichia coli. In Escherichia coli and Salmonella
typhimurium: Cellular and Molecular Biology (F.C. Neidhardt, ed.) pp. 3-6. American Society for
Microbiology, Washington, D.C.
Nguyen, L.H., Jenson, D.B., and Burgess, R.R. 1993. Overproduction and purification of sigma-32, the
Escherichia coli heat shock transcription factor. Protein Expr. Purif. 4:425-433.
Nilsson, J., Stahl, S., Lundeberg, J., Uhlen, M., and Nygren, P.-A. 1997. Affinity fusion strategies for detection,
purification, and immobilization of recombinant proteins. Protein Expr. Purif. 11:1-16.
Orsini, G. and Goldberg, M.E. 1978. The renaturation of reduced chymotrypsin A in guanidineHCl. J. Biol.
Chem. 253:3453-3458.
Pace, N.C, Vajdos, F., Fee, L., Grimsley, G., and Gray, T. 1995. How to measure and predict the molar
absorption coefficient of a protein Protein Sci. 4:2411-2423.
Patel, D. 1993. Chromatographic fractionation media. In Biochemistry Labfax (J.A.A. Chambers and D.
Rickwood, eds.) pp. 49-68. BIOS Scientific Publishers and Academic Press, Oxford.
Petsch, D. and Anspach, F.B. 2000. Endotoxin removal from protein solutions. J. Biotechnol. 76:97-119.
Janson, J.-C. and Ryden, L. 1998. Protein purification: Principles, high resolution methods, and applications
(2nd ed.). Wiley-LISS, New York.
Puri, N.K., Crivelli E., Cardamome, M., Fiddes, R., Bertoloini, J., Ninham, B., and Brandon, M.R. 1992.
Solubilization of growth hormone and other recombinant proteins from Escherichia coli by using a
cationic surfactant. Biochem. J. 285:871-879.
Rasmussen, J.R. 1992. Effect of glycosylation on protein function. Curr. Opin. Struct. Biol. 2:682-686.
Ren, Z. and Schaefer, T.S. 2001. Isopropyl-b-D-thiogalactosidase (IPTG)-inducible tyrosine phosphorylation
of protein in E.coli. Biotechniques 31:1254-1258.
Rudolph, R., Bohm, G., Lilie, H., and Jaenicke, R. 1997. Folding proteins. In Protein Function: A Practical
Approach. Second Edition. (T.E. Creighton, ed.) pp. 57-99. IRL Press, Oxford.
Schein, C.H. 1989. Production of soluble recombinant proteins in bacteria. Biotechnology 7:1141-1147.
Schiene, C. and Fisher, G. 2000. Enzymes that catalyze the restructuring of proteins. Curr. Opin. Struct. Biol.
10:40-45.
Schmid, F.X. 1997. Optical spectroscopy to characterize protein conformation and conformational changes.
In Protein Structure: A Practical Approach. Second Edition. (T.E. Creighton, ed.) pp. 261-296. IRL Press,
Oxford.
Schneider, C., Newman, R.A., Sutherland, D.R., Asser, U., and Greaves, M.F. 1982. A one step purification
of membrane proteins using a high affinity immunomatrix. J. Biol. Chem. 257:10766-10769.
Scopes, R.K. 1994. Protein Purification: Principles and Practice, 3rd ed. Springer-Verlag, New York and
Heidelberg.
Sherman, F., Stewart, J.W., and Tsunasawa, S. 1985. Methionine or not methionine at the beginning of a
protein. Bioessays 3:27-31.
Sherman, P.A. and Fyfe, J.A. 1990. Human immunodeficiency virus integration protein expressed in E. coli
possesses selective DNA cleaving activity. Proc. Natl. Acad. Sci. U.S.A. 87:5119-5123.
Shire, S.J., Bock, L., Ogez, J., Builder, S., Kleid, D., and Moore, D.M. 1984. Purification and immunogenicity
of fusion VP1 protein of foot and mouth disease virus. Biochemistry 23:6474-6480.
Skerra, A., Pfitzinger, I., and Pluckthun, A. 1991. The functional expression of antibody Fv fragments in
Escherichia coli: Improved vectors and a generally applicable purification technique. Biotechnology
9:273-278.
Sofer, G. and Hagel, L. 1997. Handbook of process chromatography: A guide to optimization, scale-up, and
validation. Academic Press, San Diego, Calif.
Purification of
Recombinant
Proteins
6.1.35
Current Protocols in Protein Science
Supplement 30
6.1.36
Supplement 30
Wingfield, P.T., Stahl, S.J., Payton, M.A., Venkatesan, S., Misra, M., and Steven, A. 1990. HIV-1 Rev
expressed in recombinant Escherichia coli: Purification polymerization and conformational properties.
Biochemistry 30:7527-7534.
Wingfield, P.T., Stahl, S.J., Williams, R.W., and Steven, A.C. 1995. Hepatitis core antigen produced in E. coli:
Conformational analysis, and in vitro assembly. Biochemistry 34:4919-4932.
Wingfield, P.T., Stahl, S.J., Kaufman, J., Zlotnick, A., Hyde, C.C., Gronenborn, A.M., and Clore G.M. 1997.
The extracellular domain of immunodeficiency virus gp41 protein: Expression in Escherichia coli,
purification and crystallization. Protein Sci. 6:1653-1660.
Wingfield, P.T., Stahl, S.J., Thomsen, D.R., Homa, F.L., Booy, F.P., Trus, B.L., and Steven, A.C. 1997a.
Hexon-only binding of VP26 reflects differences between the hexon and penton conformations of the
VP5, the major capsid protein of Herpes Simplex Virus. J. Virology 71:8955-8961.
Wingfield, P.T., Stahl, S.J., Kaufman, J., Palmer, I., Chung, V., Sax, J.K., Kleiner, D.E., and Stetler-Stevenson,
G.W. 1999. Functional and biophysical characterization of full length, recombinant human TIMP-2
produced in Escherichia coli: Comparison of wild type and N-terminal alanine substituted variant. J. Biol.
Chem. 274:21362-21368.
Wurm, F. and Bernard, A. 1999. Largescale transient expression in mammalian cells for recombinant protein
production. Curr. Opin. Biotechnol. 10:156-159.
Wyss, D.F. and Wagner, G. 1996. The structure of sugars in glycoproteins. Curr. Opin. Struct. Biol. 7:409-416.
Yamazaki, T., Hinck, A.P., Wang, Y-X., Nicholson, L.K., Torchia, D.A., Wingfield, P.T., Stahl, S.J., Kaufman,
J.D., Chang, C.-H., Domaille, P.J., and Lam, P.Y.S. 1996. Three dimensional solution structure of the
HIV-1 protease complexed with DMP 323, a novel cyclic urea-type inhibitor, determined by nuclear
magnetic resonance spectroscopy. Protein Sci. 5:495-506.
Yang, Z.-N., Mueser, T.C., Kaufman, J., Stahl, S.J., Wingfield, P.T., and Hyde, C. 1999. The structure of the
SIV gp41 ectodomain at 1.47 A. J. Struct. Biol. 126:133-144.
Yarranton, G.T. and Mountain, A. 1992. Expression of proteins in prokaryotic systemsPrinciples and case
studies. In Protein Engineering: A Practical Approach (A.R. Rees, M.J.E. Sternberg, and R. Wetzel, eds.)
pp. 303-324. IRL Press, Oxford.
Zardeneta, G. and Horowitz, P.M. 1994. Detergent, liposome, and micelle-assisted protein refolding. Anal.
Biochem. 223:1-6.
Zhang, Y., Olsen, D.R., Nguyen, K.B., Olson, P.S., Rhodes, E.T., and Mascarenhas, D. 1998. Expressoin of
eukaryotic proteins in soluble form in Escherichia coli. Protein Expr. Purif. 12:159-165.
Purification of
Recombinant
Proteins
6.1.37
Current Protocols in Protein Science
Supplement 30
UNIT 6.2
Once a suitable protein expression system involving Escherichia coli is developed and
optimized (UNITS 5.1 & 5.2), large-scale production of recombinant proteins (UNIT 5.3)
generates large quantities of culture material from which the protein of interest must be
purified. Harvesting (UNIT 5.3) produces cell concentrate or culture medium, depending on
the subcellular localization of the protein. Cell paste is the starting material for purification of proteins expressed in soluble form inside cells, such as interleukin 1 (IL-1).
Human IL-1 is a 153-residue (17.4-kDa) protein cytokine of biomedical importance that
plays a central role in immune and inflammatory responses. Purification of human IL-1
is used as an example of the preparation of soluble proteins from E. coli.
Bacteria containing IL-1 are lysed, and the resulting supernatant is clarified to remove
ribosomes and other particulate matter. The sample is then applied to an anion-exchange
column to separate recombinant IL-1 from cellular contaminants, such as E. coli
proteins, nucleic acids, and lipopolysaccharides. The sample is further purified through
salt precipitation and cation-exchange chromatography, then concentrated. Finally, the
IL-1 protein is applied to a gel-filtration column to separate it from remaining higherand lower-molecular-weight contaminants, the purified protein is stored frozen or is
lyophilized.
The purification protocol described is typical for a protein that is expressed in fairly high
abundance (i.e., >5% total protein) and accumulates in a soluble state. With these
expression levels, only about a 20-fold overall purification is required to obtain pure
protein. Therefore, conventional chromatographic methods can be used, and normally
only three or four purification stages are required (see Table 6.2.1 for an outline of the
procedure as applied to IL-1, including time considerations). The process can be
shortened somewhat through the use of the Pharmacia Biotech BioPilot or FPLC systems
(see Time Considerations).
SDS-PAGE (UNIT 10.1) is used to monitor column fractions for the presence of IL-1, which
is detected as a stained band (UNIT 10.5) at the expected 17.4-kDa location. This is common
practice in recombinant protein purification, as the specific assays for many proteins,
including IL-1, are relatively complex. The original expression and fermentation experiments involving IL-1 (Wingfield et al., 1986) followed procedures similar to those given
in UNIT 5.3.
CAUTION: Avoid direct contact with IL-1-containing solutions. Trace material from
aerosols or from hand contact can cause severe inflammation of the eyes. Safety glasses
and gloves should be worn.
PURIFICATION OF A PROTEIN EXPRESSED IN ESCHERICHIA COLI IN A
SOLUBLE STATE: INTERLEUKIN 1
Materials
DEAE Sepharose CL-4B resin (Pharmacia Biotech)
Anion-exchange buffer (see recipe)
0.26% (w/v) sodium hypochlorite/70% ethanol or 5% (v/v) bleach (e.g.,
Clorox)/70% ethanol
E. coli cells (50 g wet weight) from fermentation (UNIT 5.3) containing IL-1
Lysis buffer (see recipe)
Contributed by Paul T. Wingfield
Current Protocols in Protein Science (1995) 6.2.1-6.2.15
Copyright 1995 by John Wiley & Sons, Inc.
BASIC
PROTOCOL
Purification of
Recombinant
Proteins
6.2.1
CPPS
Table 6.2.1
Day
CM Sepharose chromatography
(steps 18-22)
SDS-PAGE of CM Sepharose fractions
(step 22)
aStep numbers in parentheses refer to the Basic Protocol. Chromatography materials as well as the BioPilot and FPLC
6.2.2
Current Protocols in Protein Science
2. Suspend the washed resin in anion-exchange buffer to 75% settled gel/25% buffer by
volume, per manufacturers recommendations. Degas in a filter flask and pour into a
5 50cm chromatography column fitted with a filling reservoir.
After settling, the height of the resin should be 20 to 25 cm (390 to 490 ml packed resin).
For details on packing columns, see UNIT 8.4. Because the solubility of gases decreases with
increases in temperature, it is usual practice to pack the column at room temperature and
then run it in a cold room or cold box.
3. Elute column with anion-exchange buffer at 100 to 150 ml/hr using a peristaltic pump.
Make sure there is no compression of column contents. Monitor the absorbance of
the effluent at 260 or 280 nm with a UV detector. Collect 15-ml fractions in 16
50mm culture tubes using a fraction collector. Check that the pH and conductivity
of the column effluent are the same as the for anion-exchange buffer applied to the
column (this indicates that the column matrix is correctly equilibrated).
The bed height should not change significantly once the column is packed. Compression
indicates that the pressure applied to the column is too high (see manufacturers recommendations for maximum flow rates).
Purification of
Recombinant
Proteins
6.2.3
Current Protocols in Protein Science
5. Assemble the French pressure cell and and chill to 4C either by incubation in ice
or by refrigeration. Install the cell (first dried with paper towels if necessary) in the
Aminco laboratory press.
It is important to cool the equipment because pressurizing will generate heat. The 20K
rapid-fill French pressure cell (1-in.-diameter piston) has a capacity of 40 ml and can be
continuously filled while installed on the press. Before using the pressure cell, replace the
nylon ball at the end of the flow valve assembly or, at the very least, check it for distortion.
For small-scale work, a miniature French pressure cell (3/8-in.-diameter piston) with a
3.7-ml capacity is available.
6. Suspend thawed E. coli cells (50 g wet weight) with 150 ml lysis buffer using a
Waring blender. Place the suspension in a stainless steel beaker and homogenize with
the Polytron tissue-grinder homogenizer until clumps are no longer detected.
IMPORTANT NOTE: Wear disposable gloves and safety glasses while working with E.
coli. The high-pressure homogenization may generate aerosols.
The E. coli cells are stored frozen at 80C as a flattened paste in heat-sealable plastic
bags (UNIT 5.3). The cells are thawed at room temperature. Complete suspension of the cells
with the blender is important, as any visible clumps of bacteria will block the French
pressure cell. A clogged cell may have to be disassembled to clear the blockage.
7. Lyse the cells with two passes through the French press operated at 16,000 to 18,000
lb/in2 (with the high-ratio setting, pressure gauge readings between 1011 and 1135).
Chill the cell suspension to 4C after each pass through the pressure cell by incubation
on ice.
When filling the pressure cell, avoid drawing air into the cylinder to prevent foaming.
If a French press is not available, the cells can be broken by including 200 g/ml lysozyme
(Worthington) and 0.05% (w/v) sodium deoxycholate (Calbiochem) in the lysis buffer and
incubating cells 20 min at 20 to 25C with intermittent homogenization using the tissue
grinder (Burgess and Jendrisak, 1975). Cell breakage by lysozyme treatment and sonication is described in Basic Protocol 3 of UNIT 6.5.
8. Place the suspension (contained in a steel beaker) on an ice bath and, using an
ultrasonic homogenizer, sonicate 5 min at full power with 50% duty cycle (on for 0.5
sec then off for 0.5 sec). While sonicating, stir the suspension using a magnetic stirrer.
IMPORTANT NOTE: Wear sound-protection earmuffs to protect ears from ultrasonic
noise. Because sonication will generate some aerosol, use the sonicator in a microbiological hood if possible.
High viscosity reduces the rate of sedimentation of the various contaminating cellular
material and thus longer (sometimes much longer) centrifugation times are required.
Sonication reduces the viscosity of the suspension prior to centrifugation by shearing the
released DNA and RNA. The viscosity can also be reduced by digesting the lysate 15 to 30
min at 4 to 10C with bovine pancreas DNase I (25 to 50 g/ml) and RNase A (50 g/ml).
If the nucleases are used, EDTA in the lysis buffer should be replaced by 5 mM MgCl2 as
DNase requires Mg2+.
Low-speed centrifugation removes unbroken cells and large cellular debris. Highspeed centrifugation removes smaller particles such as ribosomes and membrane
6.2.4
Current Protocols in Protein Science
vesicles; the Beckman 70Ti rotor (capacity 8 39 ml) can be used in the ultracentrifuge
for smaller-scale work. Clarification of the lysate can also be carried out by salt fractionation (see Background Information, section on determining solubility, for further details).
Pellets are usually discarded immediately; but see Critical Parameters and Troubleshooting, section on protein purification for further comments.
10. Dilute the supernatant from step 6 (160 ml) 1:2 (three-fold) with anion-exchange
buffer and adjust to pH 8.5 (if necessary) with 2 N NaOH. Using a conductivity meter,
measure the conductivity of the diluted supernatant. If it is higher than 5.0 to 5.3
mS/cm, reduce by dilution with water.
The conductivity of the protein solution is carefully adjusted to ensure that the proteins (in
this case contaminants) are bound to the matrix. Too high an ionic strength will reduce or
prevent binding.
The DEAE Sepharose column is normally prepared (steps 1 to 3) before lysis of the cells
is initiated. If there are any delays in applying sample to column, store the clarified lysate
at 0 to 4C (for example, in a covered beaker or flask embedded in an ice bucket). The
same applies to the other chromatographic stages.
12. Assay every second or third column fraction by SDS-PAGE (UNIT 10.1).
See Figure 6.2.1A for an example of results from SDS-PAGE.
For rapid analysis, use precast gels or the Hoefer Pharmacia Phast system. Alternatively,
if speed of purification is important, the entire flowthrough can be used, eliminating the
need to analyze separate fractions. Most protein contaminants bound to the column can
be removed by step elution with 1 M NaCl in column buffer. After use, the resin should be
unpacked from the column and washed on a sintered-glass funnel with 1 liter of 2 M
NaCl/0.5% (w/v) Triton X-100, followed by 10 liters water. If the resin is to be stored,
suspend it in 5% ethanol or 5 mM sodium azide and store at 4C. In order to avoid potential
cross-contamination, dedicate the used resin for repeat purifications of IL-1 only.
6.2.5
Current Protocols in Protein Science
1.0
1.0
.75
.50
0.75
V0
Vi
200
300
400
Elution volume (ml)
500
A280
.25
0
100
0.50
V0
Vi
0.25
P
0
100
200
300
400
500
Figure 6.2.1 Purification of IL-1. (A) SDS-PAGE analysis of samples at various stages. Analysis
was conducted on a gel of dimensions 12 cm 16 cm 1.5 mm. Lane a, purified protein (100 g
loaded); lane b, purified protein (10 g loaded); lane d, CM Sepharose pool (80%); lane e, DEAE
Sepharose pool after ammonium sulfate fractionation (56%); lane f, high-speed supernatant
(starting material for DEAE Sepharose column; 13.5%); lane g, cell lysate (12.0%). The percentages
refer to specific IL-1 contents of the fractions determined by densitomeric scanning of the
Coomassie bluestained gel lanes. Lanes c and h contain the following protein standards (low-range
standards supplied by Bio-Rad) in order of increasing migration distance: phosphorylase b (97.4
kDa), bovine serum albumin (66.2 kDa), hen egg white ovalbumin (45 kDa), bovine carbonic
anhydrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), and hen white lysozyme (14.4 kDa). (B)
Analysis of results from gel filtration on Ultrogel AcA54. The excluded volume (V0) and the fully
included volume (Vi) are indicated. Inset, analytical rechromatography of the protein from the pooled
fractions (indicated P in larger chromatogram).
Preparation
of Soluble E. coli
Proteins
6.2.6
Current Protocols in Protein Science
14. Centrifuge the slightly cloudy solution 30 min at 22,000 g (12,000 rpm in JA-14),
4C. Decant the supernatant into a beaker and add an additional 17 g (NH4)2SO4 per
100 ml solution (77% saturation or 3 M final concentration). Equilibrate with stirring
and centrifuge 30 min at 22,000 g, 4C.
For the addition of (NH4)SO4 follow the same method as described in step 13.
15. Decant the supernatant and drain the pellets by inverting the tubes on a paper towel.
Save the pellets.
Dialyze the fractionated sample
16. Suspend the pellets in 300 ml cation-exchange buffer and dialyze, using Spectra/Por
1 dialysis tubing, against 5 liters cation-exchange buffer. Change the dialysis buffer
at least once.
The dialysis step is conveniently performed overnight; the CM Sepharose column used in
step 18 can be prepared during this period.
The dialysis tubing is prepared by heating 30 to 60 min at 90 to 95C in 5 mM EDTA. The
tubing is then washed well with water and stored in 10% ethanol at 4C prior to use. A
suitable length of tubing is filled to about one-half to three-quarters capacity with solution
(to allow for expansion) and sealed with two knots at each end. Use gloves when handling
the tubing, check for leaks before use, and make sure the magnetic stir-bar does not rub
against the tubing. See APPENDIX 3B for further information concerning dialysis.
17. After dialysis, remove the slightly cloudy solution from the tubing and centrifuge 30
min at 22,000 g (12,000 rpm in JA-14), 4C. Save the supernatant.
Chromatograph dialyzed sample on cation-exchange resin
18. Prepare 200 to 225 ml CM Sepharose CL-4B resin by washing on a sintered-glass
funnel first with water, then with cation-exchange buffer (pH 5.7; wash as in step 1
except using different buffer). Pack the degassed resin into a 5 50cm column as
in step 2.
The packed column will have a bed height of 11 to 12 cm. The comments made in the
annotation to step 3 also apply here.
19. Elute the column using cation-exchange buffer at 100 to 150 ml/hr with a peristaltic
pump. Monitor the column effluent at 280 or 260 nm using a suitable UV detector.
Check that the pH and conductivity of the column effluent are the same as for the
buffer applied to the column.
20. Check the pH and conductivity of the dialysate supernatant from step 17 and, if
necessary, dilute with water so that the conductivity is in the range 1.0 to 1.2 mS/cm
(at 4 to 6C). Apply the clear solution to the CM Sepharose column at a flow rate
of 150 ml/hr. When the UV absorbance of the column effluent approaches baseline,
proceed to the next step.
21. Prepare a 0 to 250 mM NaCl gradient in cation-exchange buffer by adding 500 ml
buffer to the inner chamber of the gradient maker and 500 ml buffer/250 mM NaCl
to the outer chamber. Apply the gradient to the column at 150 ml/hr. Collect 15-ml
fractions.
The total volume of the gradient is 1 liter (4.5 column volumes).
At pH 5.7, IL-1 (pI 6.8) is positively charged and binds to the negatively charged
cation-exchange resin. IL-1 is eluted from the column with 100 mM NaCl, and it will be
located in the major absorbance peak (see Wingfield et al., 1986, for figure of typical elution
profile).
Purification of
Recombinant
Proteins
6.2.7
Current Protocols in Protein Science
22. Monitor the progress of the gradient using an in-line conductivity meter positioned
after the absorbance flow cell. Assay column fractions for IL-1 by SDS-PAGE (UNIT
10.1) and pool fractions containing IL-1.
Deciding what fractions to pool is dictated by the fact that the remaining purification stage
is gel filtration, a method that will not remove contaminants with sizes close to that of IL-1
(17.4 kDa). As IL-1 is a well-expressed protein (>5% total protein), one can afford to be
conservative and pool for purity rather than yield.
The used CM-Sepharose matrix can be cleaned up and stored as described in the annotation
to step 12.
CM Sepharose Fast Flow or SP Sepharose FF (a strong cation exchanger; both resins are
also from Pharmacia Biotech) can be used instead of CM Sepharose CL-4B. Similar results
are obtained with either matrix, with the advantage of faster flow rates.
Preparation
of Soluble E. coli
Proteins
The resin should be free-flowing yet concentrated enough to produce a packed bed with
one pouring. A freshly packed gel-filtration column can be checked for packing
irregularities by prerunning the column with a few colored markers. Blue dextran will
be excluded from the gel matrix and will elute at the void volume (V0, which equals
30% to 35% of the total column volume); cytochrome c (red; 12.4 kDa) will elute close
6.2.8
Current Protocols in Protein Science
to the expected position of IL-1; and potassium dichromate (yellow) will be fully included
in the gel matrix and will elute at the included volume (Vi, 480 ml).
Superdex 75 gel-filtration matrix (Pharmacia Biotech) can be substituted for Ultrogel
AcA54. The former allows higher flow rates and thus is more compatible with the FPLC
and BioPilot systems. Both Superdex and the cation-exchange Fast Flow resins mentioned
above can be purchased in various prepacked columns that are useful for method development.
25. Filter the concentrated protein using a Millex-GV 0.22-m filter unit attached to a
10- or 20-ml syringe and apply to the gel-filtration column. The sample can be applied
either directly to the top of the column using a Pasteur pipet (care is required to prevent
breaking the tip and contaminating the column) or via a three-way valve and syringe
without removing the top flow adapter.
The volume of sample applied to the column (20 ml) represents 4% of the total column
volume. For columns with different dimensions, apply the same proportionate volume of
sample. The column should be 60 cm long. For analytical separations, the sample volume
should not exceed 2.5% of the column volume.
26. Elute the column at 35 ml/hr with gel-filtration buffer and collect 10-ml fractions.
The major eluting peak contains the IL-1; monitor the fractions by SDS-PAGE (UNIT
10.1) and pool fractions that contain pure protein. Save side fractions that contain small
amounts of contaminants; this material can be rechromatographed after concentration
as described in step 23a or 23b.
See Figure 6.2.1A for an example of results from SDS-PAGE.
Once the sample has been run into the column, it can be eluted with any buffer or even with
a suitable column storage solvent such as 5 mM sodium azide in water.
28. For short-term storage (12 months), filter the protein with a Millex-GV 0.22-m
filter unit, divide the solution into sterile plastic vials, and freeze aliquots rapidly with
dry ice/ethanol. Store at 80C. For long-term storage of IL-1 (>12 months),
lyophilize the protein. Dialyze the sample using a volatile buffer such as 50 mM
ammonium bicarbonate or a nonvolatile buffer such as lyophilization buffer.
To circumvent the dialysis step, the phosphate-based lyophilization buffer can be used for
gel filtration instead of the TrisCl gel-filtration buffer.
Purification of
Recombinant
Proteins
6.2.9
Current Protocols in Protein Science
Lysis buffer
100 mM TrisCl, pH 8.0
2 mM EDTA, pH 8.0
5 mM benzamidineHCl (780 mg/liter)
Make immediately prior to use; alternatively, make ahead of time and store up to
several days at 4C.
Conductivity of the solution is 1.57 mS/cm.
It should be noted that a 1C decrease in temperature increases the pH of the Tris buffer by
0.03 pH units. Both TrisCl and EDTA stock solutions are commercially available (e.g., Life
Technologies).
BenzamidineHCl is a water-soluble serine protease inhibitor. An alternative is 50 M
4-(2-aminoethyl)benzenesulfonyl fluoride hydrochloride (AEBSF; Perfabloc SC, Boehringer Mannheim), a water-soluble inhibitor with the same spectrum of activity as phenylmethylsulfonyl fluoride (PMSF).
COMMENTARY
Background Information
Preparation
of Soluble E. coli
Proteins
6.2.10
Current Protocols in Protein Science
Purification of
Recombinant
Proteins
6.2.11
Current Protocols in Protein Science
Preparation
of Soluble E. coli
Proteins
6.2.12
Current Protocols in Protein Science
Purification of
Recombinant
Proteins
6.2.13
Current Protocols in Protein Science
Cell breakage
Efficient cell breakage should be troublefree as long as the French press is operated in
accordance with the manufacturers instructions. It takes a little practice to operate the flow
valve. The aim is to generate as high a flow rate
as possible while maintaining a pressure gauge
reading of 1000. If the flow rate is too fast, the
pressure reading will drop and unbroken cells
will pass into the flow stream. Toward the end
of the run, the flow rate should be reduced, as
it becomes difficult to control the pressure.
After use the French pressure cell should be
cleaned and dried, and the flow valve ball
should be replaced. Store the cell at 4C.
Protein purification
In general, troubleshooting a purification
method will be much easier if fractions are not
discarded until the appropriate monitoring of
the purification steps is complete. When using
an established method, fractions from a chromatographic run are frequently pooled on the
basis of absorbance only and the remainder
quickly discarded. Ammonium sulfate supernatants or pellets are often discarded on the
basis of previous fractionation behavior or pilot-scale work. As stated, do not discard any
fractions until they have been checked, usually
by SDS-PAGE. If there has been a problem, it
can usually be easily sorted out if all the fractions from the various stages are still available.
If in doubt about conditions for storing fractions, freeze selected fractions at as low a temperature as possible (ideally 80C) and discard when appropriate. When freezing material, it is worthwhile to take the extra effort to
dispense small samples (<1 ml) into microcentrifuge tubes amenable for rapid analysis, if
necessary, to avoid having to thaw large volumes merely to take 10 l for SDS-PAGE.
For reproducible results, careful recording
of the pH and conductivity of starting buffers,
protein solutions, and column eluents is necessary. For example, if the protein does not bind
to the cation exchanger (step 19), the ionic
strength of the sample or column buffer may be
too high. Alternatively, the buffer pH may be
too high. Errors of this kind are avoided by
simply measuring the conductivity and pH of
all solutions at each stage. Other precautions
and critical steps relating to chromatography
procedures are mentioned in Chapters 8 and 9.
Preparation
of Soluble E. coli
Proteins
Anticipated Results
Time Considerations
Because IL-1 appears to be stable against
proteolytic degradation and other chemical
modications during purification, the speed of
purification was not critical in this case. The
low-pressure chromatographic method described in the Basic Protocol requires 4 days,
which can be shortened to 3 days using the
Pharmacia Biotech BioPilot or FPLC systems
in conjunction with matrices that allow faster
flow rates. The times required for purification
whether using low-pressure chromatography
(as described in the Basic Protocol) or mediumpressure chromatography in the FPLC or BioPilot systems are summarized in Table 6.2.1.
Literature Cited
Burgess, R.R. and Jendrisak, J.J. 1975. A procedure
for the rapid, large-scale purification of E. coli
DNA-dependent RNA polymerase involving polymin P precipitation and DNA-cellulose chromatography. J. Biol. Chem. 14:4634-4638.
Chrunyk, B.A., Evans, J., Lillquist, J., Young, P., and
Wetzel, R. 1993. Inclusion body formation and
protein stability in sequence variants of Interleukin-1. J. Biol. Chem. 268:18053-18061.
Clore, G.M., Wingfield, P.T., and Gronenborn, A.M.
1991. High-resolution structure of interleukin 1
in solution by three- and four-dimensional nuclear magnetic resonance spectroscopy. Biochemistry 30:2315-2323.
6.2.14
Current Protocols in Protein Science
Dinarello, C.A. 1989. Interleukin-1 and its biologically related cytokines. Adv. Immunol. 44:153205.
Dixon, M. and Webb, E. 1979. Enzyme isolation. In
Enzymes (3rd ed.) pp. 23-46. Academic Press,
New York.
Dyda, F., Hickman, A.B., Jenkins, T.M., Engelman,
A., Craigie, R., and Davies, D.R. 1994. Crystal
structure of the catalytic domain of HIV-1 integrase: Similarity to other polynucleotidyltransferases. Science 266:1981-1986.
Gery, I. and Schmidt, J.A. 1985. Human interleukin
1. Methods Enzymol. 116:456-467.
Hlodan, R. and Hartl, F.U. 1994. How the protein
folds in the cell. In Mechanisms of Protein Folding (R.H. Pain, ed.) pp. 194-228. IRL Press,
Oxford.
Hopkins, T.R. 1991. Physical and chemical cell
disruption for the recovery of intracellular proteins. In Purification and Analysis of Recombinant Proteins (R. Seetharam and S.K. Sharma,
eds.) pp. 57-83. Marcel Dekker, New York.
Johnson, B.H. and Hecht, M.H. (1994) Recombinant proteins can be isolated from E. coli by
repeated cycles of freezing and thawing.
Bio/Technology 12:1357-1360.
Joseph-Liauzun, E., Legoux, R., Guerveno, V.,
Marchese, E., and Ferra, P. 1990. Human recombinant interleukin-1 isolated from E. coli by
simple osmotic shock. Gene 86:291-295.
Kronheim, S.R., Cantrell, M.A., Deeley, M.C.,
March, C.J., Glackin, P.J., Anderson, D.M., Hemenway, T., Merriam, J.E., Cosman, D., and
Hopp, T.P. 1986. Purification and characterization of human interleukin-1 expressed in
Escherichia coli. Bio/Technology 4:1078-1082.
Livi, G.P., Lillquist, J.S., Ferrara, A., Sathe, G.M.,
Simon, P.L., Meyers, C.A., Gorman, J.A., and
Young, P.R. 1991. Secretion of N-glycosylated
interleukin-1 in Saccharomyces cerevisiae using a leader peptide from Candida albicans.
Effect of N-linked glycosylation on biological
activity. J. Biol. Chem. 266:15348-15348.
McMahan, C.J., Slack, J.L., Mosley, B., Cosman,
D., Lupton, S.D., Brunton, L.L., Grubin, C.E.,
Wignall, J.M., Jenkins, N.A., Brannan, C.I.,
Copeland, N.G., Huebner, K., Croce, C.M., Cannizzarro, L.A., Benjamin, D., Dower, S.K.,
Spriggs, M.K., and Sims, J.E. 1991. A novel IL-1
receptor, cloned from B cells by mammalian
expression, is expressed in many cell types.
EMBO J. 10:2821-2832.
Meyers, C.A., Johanson, K.O., Miles, L.M., McDevitt, P.J., Simon, P.L., Webb, R.L., Chen, M.-J.,
Holskin, B.P., Lillquist, J.S., and Young, P.R.
1987. Purification and characterization of human
recombinant interleukin-1. J. Biol. Chem.
262:11176-11181.
Key Reference
Wingfield et al., 1986. See above.
The original publication on which Basic Protocol 1
is based.
Purification of
Recombinant
Proteins
6.2.15
Current Protocols in Protein Science
UNIT 6.3
BASIC
PROTOCOL 1
Bacterial cells are lysed using a French press, and inclusion bodies in the cell lysate are
pelleted by low-speed centrifugation. The pellet fraction is washed (preextracted) with
urea and Triton X-100 to remove E. coli membrane and cell wall material. GuanidineHCl
(8 M) and dithiothreitol (DTT) are used to solubilize the washed pellet protein. Extraction
with the denaturant simultaneously dissociates protein-protein interactions and unfolds
the protein. As a result, the extracted protein consists (ideally) of unfolded monomers,
with sulfhydryl groups (if present) in the reduced state.
Materials
E. coli cells from fermentation (UNIT 5.3) containing the protein of interest
Lysis buffer (see recipe)
Wash buffer (see recipe), with and without urea and Triton X-100
Extraction buffer (see recipe)
250- and 500-ml stainless steel beakers
0.22-m syringe filters (e.g., Millex from Millipore)
20-ml disposable syringe
Additional equipment for breaking cells, homogenizing cells and pellets and
centrifuging at low and high speeds (UNIT 6.2)
Break cells and prepare clarified lysate
1. Place thawed E. coli cells in a stainless steel beaker. Add 4 ml lysis buffer per gram
wet weight of cells. Keep bacterial cells cool by placing the beaker on ice in an ice
bucket.
The cells can be pretreated with lysozyme prior to lysis in the French press. Lysozyme
treatment involves incubating cells 20 min at 20 to 25C in lysis buffer supplemented
Contributed by Ira Palmer and Paul T. Wingfield
Current Protocols in Protein Science (1995) 6.3.1-6.3.15
Copyright 2000 by John Wiley & Sons, Inc.
Purification of
Recombinant
Proteins
6.3.1
CPPS
with 200 g/ml lysozyme, with intermittent homogenization using a tissue grinder. It should
be emphasized that this optional step is carried out before French press breakage and is
not simply an alternative method of cell breakage (compare the comments made in the
annotation to step 4 of UNIT 6.2). Its purpose is to aid removal of the peptidoglycan and
outer membrane protein contaminants during the washing steps (steps 6 to 9; for further
details see UNIT 6.1 and Fig. 6.1.5). An example of this approach is given in Basic Protocol
1 of UNIT 6.5.
2. Suspend cells using a Waring blender and homogenize using the Polytron tissuegrinder homogenizer until all clumps are disrupted, as described in UNIT 6.2, step 3.
3. Lyse cells with two passes through the French pressure cell operated at 16,000 to
18,000 lb/in2 (with the high-ratio setting, pressure gauge readings between 1011 and
1135), chilling the cell suspension to 4C after each pass, as described in UNIT 6.2,
steps 2 and 4.
4. Reduce the viscosity of the suspension by sonicating 5 min at full power with 50%
duty cycle (on for 5 sec, off for 5 sec) using an ultrasonic homogenizer, as described
in UNIT 6.2, step 5.
5. Clarify the lysed cell suspension by centrifuging 1 hr at 22,000 g (12,000 rpm in a
JA-14 rotor in a Beckman J2-21M centrifuge), 4C.
Unbroken cells, large cellular debris, and the inclusion body protein will be pelleted.
The JA-14 rotor uses 250-ml centrifuge bottles. For processing smaller volumes the
Beckman JA-20 rotor (or equivalent) with 50-ml tubes can be used, at 13,500 rpm (22,000
g).
The procedure for dealing with insoluble inclusion-body proteins now diverges from that
for purifying soluble proteins (UNIT 6.2).
7. Centrifuge the suspension 30 min at 22,000 g (12,000 rpm in JA-14), 4C. Discard
supernatant and, using the tissue homogenizer, suspend the pellet in 4 to 6 ml wash
buffer per gram wet weight of cells.
8. Repeat step 7 two more times.
If the supernatant is still cloudy or colored, continue washing the pellet until the supernatant is clear.
9. Suspend the pellet with wash buffer minus the Triton X-100 and urea, using 4 to 6
ml buffer per gram wet cells. Centrifuge 30 min at 22,000 g (12,000 rpm in JA-14),
4C.
Preparation and
Extraction of
Inclusion Bodies
The final wash removes excess Triton X-100 from the pellet.
6.3.2
Current Protocols in Protein Science
If necessary the washed pellets can be stored at 80C. It is better to store material at this
stage rather than after the extraction stage (see comments to step 13).
11. Centrifuge the suspension 1 hr at 100,000 g (30,000 rpm in Ti45 rotor in a Beckman
Optima XL-90 ultracentrifuge), 4C.
For volumes <250 ml the Beckman 70Ti rotor (capacity 6 39 ml) can be used at 32,000
rpm (100,000 g).
12. Carefully pour off the supernatant from the pellet. Filter the supernatant through a
0.22-m syringe filter attached to a 20-ml disposable syringe.
The filter removes unpelleted large cell wall debris that will clog most chromatography
columns.
13. Use the clarified inclusion body extract for preparing folded protein (UNIT 6.5) or purify
further by gel filtration (see Basic Protocol 2).
The extract can be stored at 80C until required. Freeze in plastic (or polyethylene)
containers rather than glass. Divide sample into 10- to 20-ml aliquots instead of freezing
in one large lot and fill containers to only 50% to 75% capacity.
BASIC
PROTOCOL 2
Washed, extracted pellets (see Basic Protocol 1) contain >50% recombinant protein and
are used as the starting material for purification of the protein of interest by gel-filtration
chromatography. Superdex 200 gel-filtration medium, which allows high flow rates, is
washed and packed into a column. The column is equilibrated at 4C and the sample is
applied.
Assay of column fractions by gel electrophoresis in the presence of SDS is complicated
by the fact that guanidineHCl forms a precipitate with SDS. Therefore, preparing samples
for gel analysis involves selective precipitation of protein from guanidineHCl prior to
Purification of
Recombinant
Proteins
6.3.3
Current Protocols in Protein Science
SDS-PAGE (see Support Protocol). The purified (or partially) purified protein is used as
the starting material for procedures (e.g., UNIT 6.5) in which the denatured protein is folded
into a native and biologically active structure.
Materials
Gel-filtration medium: Superdex 200 PG (preparative grade; Pharmacia Biotech)
5% (v/v) ethanol
Gel-filtration buffer (see recipe)
GuanidineHCl extract of E. coli cells containing the protein of interest (see Basic
Protocol 1)
4- to 6-liter plastic beaker
Chromatography column: Pharmacia Biotech XK 16/100, 26/100, or 50/100
Packing reservoir: Pharmacia Biotech RK 16/26 (for 16- and 26-mm-i.d. columns)
and RK 50 (for 50-mm-i.d. column)
Chromatography pump: Pharmacia Biotech P-6000 or P-500
Injection valve (to select between sample loop and pump)
UV monitor and fraction collector
Sample loop (volume determined by size of column)
NOTE: The various components of the chromatography system (pumps, valves, monitors,
and sample loops) listed separately above are supplied as components of the BioPilot
chromatography system (Pharmacia Biotech), which is used to run the XK 50/100
column. The smaller XK columns (2.6 and 2.5 cm i.d.) are run using the FPLC chromatography system (also from Pharmacia Biotech), which is designed for small- to mediumscale work. For further details on this equipment see the manufacturers literature (e.g.,
Process Products, Pharmacia Biotech).
NOTE: Perform steps 1 to 11 at room temperature. After the column is packed, equilibrate
and elute at 4C.
Pack the column
1. Wash the gel-filtration medium in a large plastic beaker with 5% ethanol. Let the
medium settle and adjust the volume of liquid to give a gel slurry concentration of
65% to 75%.
The XK 16/100, 26/100, and 50/100 columns are 100 cm long and have inner diameters of
16, 26, and 50 mm, respectively. Hence, for an XK 50/100 column, column volume = radius
(2.5 cm)2 3.1416 bed height (97 cm) 1900 ml, and 2 liters preparative-grade
Superdex 200 is required. To pack this column, the gel medium is suspended in 5% ethanol
to give a total volume of 3 liters which corresponds to 70% gel slurry (it should be noted
that the RK 50 reservoir has a capacity of 1 liter, so the 3 liters of gel slurry can be poured
in a single operation).
2. Fix the chromatography column in an upright position, using a level to adjust the
position. Attach the packing reservoir.
3. Add sufficient 5% ethanol to displace the air from a few centimeters of the bottom
of the column. Clamp off the bottom of the column.
4. Gently mix the gel-filtration medium in the plastic beaker to an even slurry of 70%
medium suspended in 5% ethanol.
5. Degas the suspension 5 to 10 min using a vacuum flask and laboratory vacuum.
Preparation and
Extraction of
Inclusion Bodies
The ethanol is included to reduce the surface tension and density of the solvent, thus
allowing air bubbles that form to rise to the surface more quickly.
6.3.4
Current Protocols in Protein Science
6. Carefully pour the slurry of medium into the column, introducing material along the
side of the column to avoid creating air bubbles.
7. Let the column stand 5 min and then unclamp the bottom of the column.
8. Attach the chromatography pump to the packing reservoir and pump 5% ethanol
(degassed) into the column at an appropriate flow rate (based on manufacturers
instructions). Pack the column at a pressure greater than the pressure at which the
column will be run (up to twice as high), but not greater than the maximum pressure
rating of the column.
The XK 50/100 column (rated to 0.5 MPa) is packed at 20 to 30 ml/hr and 0.4 MPa.
9. After the medium has settled, turn off the pump and close the bottom of the column.
Pipet fluid from the reservoir and remove the reservoir.
Once the column has been packed, be careful to prevent air from entering the column bed.
Air will disturb the bed and reduce the column separation resolution.
10. Attach the column top adapter to the column. Place the top of the adapter onto the
top of the packed medium and gently compress the medium.
11. Reattach the pump to the column and wash the column with water at a flow rate that
will generate the maximum pressure to be used. If the medium continues to settle,
readjust the top adapter to maintain a firm fit against the gel.
From this point onward, perform all steps at 4C.
Equilibrate the column
12. Equilibrate the column with at least 1 column volume of gel-filtration buffer.
Although the proteins were extracted with buffer containing 8 M guanidineHCl (see Basic
Protocol 1), the gel-filtration buffer contains only 4 M guanidineHCl. The concentration
is reduced to allow faster flow rates and for reasons of economy. Most proteins remain
unfolded at the lower guanidineHCl concentration. If, however, the protein elutes in an
anomalous manner (e.g., in more than one peak or at an elution position not consistent
with its size), and assuming there is adequate reducing agent present, then try increasing
the guanidineHCl concentration in the gel-filtration buffer.
13. Measure the actual flow rate while running the column at a flow rate that generates
a back pressure about one-half of that generated when packing the column (step 8).
For an XK 50/100 column packed using Superdex 200 at 0.4 MPa, a running pressure of
0.2 MPa is used, which generates flow rates of 5 to 10 ml/min that are equivalent to linear
flow rates of 15.3 to 30.6 cm/hr. The linear flow rate equals the flow rate (ml/hr)/cross-sectional area (cm2). At these flow rates it takes between 3 and 6 hr to complete the
chromatography.
14. Connect tubing from the end of the column to the UV monitor and the fraction
collector.
Apply the sample
15. Load the sample loop with the guanidineHCl extract to be separated.
Avoid loading a sample volume >5% of the total column volume; the optimum sample size
is 2% (40 ml for the XK 50/100 column). The sample consists of washed pellets extracted
with guanidineHCl (see Basic Protocol 1). A sample size of 40 to 50 ml is usually derived
from 50 g wet weight cells. With smaller sample sizes, use columns with proportionally
smaller diameters (e.g., XK 16/100 or 26/100 columns). If purchase of only one column is
possible, a 2.5 100cm size is a good compromise for variable sample loading.
Purification of
Recombinant
Proteins
6.3.5
Current Protocols in Protein Science
16. Monitor column effluent with the UV monitor and collect fractions with the fraction
collector.
For an XK 50/100 column, collect 15- to 20-ml fractions in 16 20mm culture tubes.
The eluent from the column is usually monitored at 280 nm or, if the protein has a
particularly low extinction coefficient, at 230 nm (guanidineHCl strongly absorbs below
225 nm). For an XK 50/100 column, fractions need only be collected after 500 ml of
elution. The excluded volume (void volume) is 570 ml. Run one column volume (1900 ml)
to ensure all of the load material is eluted from the column.
17. Prepare the fractions to be assayed for SDS-PAGE (see Support Protocol and UNIT
10.1).
SUPPORT
PROTOCOL
3. Mix the sample and ethanol well. Chill 5 to 10 min at 20C or colder (e.g., 80C).
4. Microcentrifuge the sample 5 min at maximum speed (15,000 g), 4C. Carefully
withdraw the supernatant and retain the pellet.
The pellet may be difficult to see. Be careful not to draw the pellet out of the microcentrifuge
tube with the supernatant.
5. Suspend the pellet with 250 l cold 90% (v/v) ethanol. Mix thoroughly using a vortex
mixer.
The 90% ethanol is made by mixing 225 l ethanol and 25 l H2O.
6. Microcentrifuge the sample 5 min at maximum speed, 4C. Carefully pipet off the
supernatant and suspend the pellet in 25 l of 1 SDS sample buffer.
Some proteins are more difficult than others to suspend from an ethanol precipitate.
Electrophoresis sample buffer containing 8 M urea is helpful for such proteins (UNIT 10.1).
Sonication with a microtip probe can also be used to disperse the sample. A volume of
sample buffer >25 l may be required in this case (e.g., 50 l), and great care must be taken
to prevent foaming of the sample caused by excessive sonication power.
Preparation and
Extraction of
Inclusion Bodies
6.3.6
Current Protocols in Protein Science
Extraction buffer
50 mM TrisCl, pH 7.0
5 mM EDTA
8 M guanidineHCl (764 g/liter)
5 mM DTT (770 mg/liter)
If the buffer is cloudy, filter through a 0.45- to 0.5-m filter (the solution should be
clear if high-quality guanidineHCle.g., ultrapure grade, ICN Biomedicalsis
used; see APPENDIX 3A). Buffer can be stored minus DTT at least 1 month at 4C.
Gel-filtration buffer
50 mM TrisCl, pH 7.5
4 M guanidineHCl (382 g/liter; ultrapure, ICN Biomedicals)
5 mM DTT (770 mg/liter)
Buffer can be stored minus DTT at least 1 month at 4C. Filter (as for extraction
buffer; see recipe) and degas before use.
Higher concentrations of guanidineHCl (up to 8 M) may be required for some proteins (see
comment at step 12).
Lysis buffer
100 mM TrisCl, pH 7.0
5 mM EDTA
5 mM DTT (770 mg/liter)
5 mM benzamidineHCl (780 mg/liter)
Prepare immediately before use
The TrisCl and EDTA are diluted from concentrated stock solutions. The other components
are added to the diluted buffer before use.
Wash buffer
100 mM TrisCl, pH 7.0
5 mM EDTA
5 mM DTT (770 mg/liter)
2 M urea (120 g/liter; ultrapure, ICN Biomedicals)
2% (w/v) Triton X-100 (20 g/liter; Calbiochem-Novabiochem)
Add DTT, urea, and Triton X-100 to the other components directly before use.
Prepare this buffer in two forms: one with and one without the urea and Triton X-100
(the latter for use in Basic Protocol 1, step 9).
Purification of
Recombinant
Proteins
6.3.7
Current Protocols in Protein Science
COMMENTARY
Background Information
The decision of whether to work with insoluble recombinant protein or to put more effort
into generating soluble protein (e.g., by modifying the expression vector or changing the host
strain and fermentation conditions) can be dictated by the nature of the protein. A small
protein (10 to 17 kDa) with only one or two
cysteine residues might be expected to fold in
reasonable yield from extracted inclusion bodies. Larger proteins (>25 kDa) with many cysteine residues may be more problematical, and
lower folding yields can normally be expected.
In the latter case, if only small amounts of
material are needed then yield is not such an
important issue.
It should be emphasized that, unless proved
otherwise, a protein folded from insoluble inclusion bodies can be expected to have the same
structural and conformational integrity as the
same protein directly purified from soluble extracts (also see UNIT 6.1). It is similarly true that
a purified soluble protein can be denatured and
renatured (reversible denaturation) without
structural or conformational modifications (reviewed by Anfinson, 1973; Ghelis and Yon,
1982).
Preparation and
Extraction of
Inclusion Bodies
6.3.8
Current Protocols in Protein Science
centrifugation. For the latter, a benchtop ultracentrifuge such as the Beckman XL-100 or
Airfuge is ideal. The comparative effectiveness
of extractants can determined by measuring the
amount of protein solubilized from the washed
pellets using standard protein estimation methods and/or SDS-PAGE (UNIT 10.1).
It should be noted that solubilized protein
must at some stage be folded into the native
conformation; hence, the most effective solubilizing conditions might not necessarily be the
best, especially if they cause irreversible denaturation. Irreversible denaturation appears to
result from chemical modification of the protein and is induced by such factors as high
temperature, extremes of pH, and tight binding
of denaturants such as SDS.
The extraction process (Basic Protocol 1)
should result in protein that is both monomeric
(assumed to be unfolded) and contains cysteine
residues in the reduced state. This provides a
defined starting point from which to develop a
reproducible folding protocol or from which to
further purify the protein by, for example, gel
filtration under denaturing conditions (Basic
Protocol 2). Extraction of inclusion bodies with
some denaturants, such as urea, may not completely convert the protein to monomers, resulting in physically heterogeneous mixtures. As
pointed out in UNIT 6.1, extraction can be accomplished with strong denaturants (e.g., guanidineHCl) that can then be exchanged by dialysis
or gel filtration for weaker ones (e.g., urea).
This particular solvent exchange often results
in much better yields of folded protein compared to that obtained by the direct removal of
guanidineHCl (e.g., UNIT 6.5).
Gel-filtration chromatography (Basic Protocol 2) is not commonly considered a high-resolution separation technique. However, as the
recombinant-derived protein content of wellprepared washed pellets will be >50% of the
total (see Fig. 6.3.1, lanes h and i), only a 2-fold
purification is required to obtain pure protein.
Protein extracted with guanidineHCl in the
presence of reductant will ideally be in a random coil conformation with all sulfhydryl residues in the reduced state. Under such conditions, the order in which proteins elute from a
gel-filtration matrix in guanidineHCl can be
directly correlated with molecular size (Mann
and Fish, 1972).
Selection of the proper chromatography
resin is critical for success (for detailed discussion, see Critical Parameters and Troubleshooting). The main disadvantage of gel filtration in
Purification of
Recombinant
Proteins
6.3.9
Current Protocols in Protein Science
Preparation and
Extraction of
Inclusion Bodies
6.3.10
Current Protocols in Protein Science
Table 6.3.1 Gel-Filtration Matrices Suitable for Use with Solutions Containing
Guanidine Hydrochloride
Matrixa
Native proteins
Sepharose CL-6B
Bio-Gel A-5m
Sepharose CL-4B
Sephacryl S-100 HR
Sephacryl S-200 HR
Sephacryl S-300 HR
Sephacryl S-400 HR
Superdex 75
Superdex 200
Unfolded proteinsb
10-4,000
10-5,000
60-20,000
1-100
5-250
10-1,500
20-8,000
3-70
10-600
1-80
1-80
10-300
<1-30c
1-50
1-100c
1->100c
<1-25
1-80
Reference
Mann and Fish (1972)
Mann and Fish (1972)
Mann and Fish (1972)
aAll resins are from Pharmacia Biotech except Bio-Gel A-5m, which is from Bio-Rad. The Sepharose and Bio-Gel matrices
are normally run under low pressure; all other resins can be run under low or medium pressure. Medium pressure is achieved
using one of the chromatography pumps indicated in Basic Protocol 2; the pumps are normally included in the Pharmacia
Biotech FPLC or BioPilot systems.
bData on the fractionation range in the unfolded state refer to proteins unfolded with guanidineHCl; however, the guidelines
also apply to proteins unfolded and eluted with urea (assuming they are random coils).
cEstimates based on fractionation range for native proteins.
Purification of
Recombinant
Proteins
6.3.11
Current Protocols in Protein Science
Preparation and
Extraction of
Inclusion Bodies
Ancipitated Results
Cell lysis and preparation of washed pellets
Pelleted aggregates after washing contain
30% dry weight, of which 90% is protein.
SDS-PAGE of a typical washed pellet preparation (Fig. 6.3.1, lanes h and i) indicates that
recombinant bovine growth hormone (21 kDa)
makes up >60% of the total protein. The
6.3.12
Current Protocols in Protein Science
Figure 6.3.1 Analysis by SDS-PAGE of fractions from low-speed centrifugation of E. coli cell
lysates containing aggregated bovine growth hormone. A 12.5% acrylamide gel of dimensions 12
cm 16 cm 1.5 mm was used with the Laemmli buffer system (UNIT 10.1). Lanes a and g contain
standard proteins (low-range standards, Bio-Rad) in order of increasing migration distance:
phosphorylase b (97.4 kDa), bovine serum albumin (66.2 kDa), hen egg white ovalbumin (45 kDa),
bovine carbonic anhydrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), and hen egg white
lysozyme (14.4 kDa). After low-speed centrifugation of the clarified lysate and of the washed pellet
homogenate (see Basic Protocol 1, steps 5 and 7), the supernatants will be cloudy (lane f) and the
pellets usually consist of two layers (see Fig. 6.1.5). The bottom layer is inclusion body protein plus
unbroken cells (lanes b and c) and the top layer consists of outer membrane and peptidoglycan
fragments (lanes d and e). The outer membrane proteins OmpA (35 kDa) and OmpF/C (38 kDa)
are indicated by and , respectively. After the washing steps, the growth hormone (marked , 21
kDa) is the major constituent (lanes h and i) together, in this example, with another plasmid-encoded
protein, namely kanamycin phosphotransferase (marked , 30.8 kDa), the product of the gene
conferring resistance to the antibiotic kanamycin.
Purification of
Recombinant
Proteins
6.3.13
Current Protocols in Protein Science
2.00
A280
1.50
S 50 55 60 65 70 75
1.00
0.50
P
0.00
0
20
40
60
80
100
Fraction number
Figure 6.3.2 Gel filtration using Superdex 200 in 4 M guanidineHCl. Column dimensions, 6 60
cm; buffer, 50 mM TrisCl (pH 7.5)/4 mM guanidineCl/2 mM DTT; flow rate, 5 ml/min (300 ml/hr).The
sample was an extract containing HIV-1 protease, which has a mass of 10 kDa. Protein fractions
66 to 72 (pool P) was further purified under the same conditions using a Superdex 75 matrix. The
inset shows SDS-PAGE analysis of selected fractions. The protein markers (lane S) correspond to
standards with mass values of 66.2, 45, 30, 21.5, and 14.4 kDa, respectively (migration order top
to bottom).
Preparation and
Extraction of
Inclusion Bodies
Time Considerations
Cell lysis and preparation of washed pellets
French press lysis of 150 to 200 ml cell
suspension will take 30 to 35 min. Preparation
6.3.14
Current Protocols in Protein Science
Literature Cited
Anfinson, C.B. 1973. Principles that govern the
folding of protein chains. Science 181:223-230.
Belew, M., Fohlman, J., and Janson, J.-C. 1978. Gel
filtration on Sephacryl S-200 superfine in 6 M
guanidineHCl. FEBS Lett. 91:302-304.
Chang, J.Y. and Swartz, J.R. 1993. Single-step solubilization and folding of IGF-1 aggregates from
Escherichia coli. In Protein Folding: In Vivo and
in Vitro (J.L. Cleland, ed.) pp. 178-188 (ACS
Symposium Series No. 526). American Chemical Society, Washington, D.C.
Creighton, T.E. 1993. Proteins: Structures and Molecular Properties, 2nd ed., pp. 293-296. W.H.
Freeman, New York.
Falson, P. 1992. An efficient procedure to dialyze
volumes in the range of 10-200 l. BioTechniques 13:20.
Fish, W.W., Mann, K.G., and Tanford, C. 1969. The
estimation of polypeptide chain molecular
weights by gel filtration in 6 M guanidine hydrochloride. J. Biol. Chem. 244:4989-4994.
Ghelis, C. and Yon, Y. 1982. Simulation of protein
folding: Studies of in-vitro denaturation-renaturation. In Protein Folding, pp. 225-243. Academic
Press, San Diego.
Glazer, A.N., Delange, R.J., and Sigman, D.S. 1975.
Modifications of sulfhydryl and disulfide
groups. In Chemical Modification of Proteins,
pp. 101-120 (Laboratory Techniques in Biochemistry and Molecular Biology Series).
North-Holland Publishing Company, New York.
Kane, J.K. and Hartley, D.L. 1991. Properties of
recombinant protein-containing inclusion bodies in E. coli. In Purification and Analysis of
Recombinant Proteins (R. Seetharam and S.K.
Sharma, eds.) pp. 121-145. Marcel Dekker, New
York.
Purification of
Recombinant
Proteins
6.3.15
Current Protocols in Protein Science
a rate that can keep pace with the rate of synthesis; in vitro, protein molecules can fold with
half-times varying from tens to thousands of
milliseconds.
Consider first proteins that do not contain
disulfide bonds in the native state. Very early in
the folding pathway, in less than a few milliseconds, the chain collapses to intermediates in
which substantial proportions of the secondary
structure and hydrophobic core are assembled.
The stabilization of secondary structure elements and the packing of side chains continues
until a compact intermediate termed the molten globule is formed. This contains essentially all the native secondary structure and, in
many proteins, has been shown to accumulate
to a significant extent (Ptitsyn et al., 1990).
Although the topology of the molten globule is
similar to that of the protein in the native state,
the specific and persistent tertiary interactions
characteristic of the native state are still lacking.
This accounts for its ability to bind the hydrophobic probe 8-anilino-1-naphthalene sulfonic
acid (ANS) and presumably also for its observed tendency to aggregate. The molten globule finally folds to the native state in a reaction
that usually constitutes the rate-limiting step of
folding.
Although several proteins have been shown
to exhibit multiple kinetic phases and hence
small energy barriers during folding, the early
stages are fast (t12 = 1 to 25 msec). Many of the
slower steps observed in folding are rate-limited by the slow cis-trans isomerization of peptide bonds joining X-Pro residues (Nall, 1994).
This isomerization, with half-times on the order
of a minute, occurs at later stages of folding,
and therefore within rather collapsed forms of
the protein. The variable accessibility of the
bonds involved in cis-trans isomerization, resulting from the collapsed state of the polypeptide chain, explains the somewhat variable success of peptidyl prolyl cis-trans isomerase
(PPI) in catalyzing the reaction in vitro.
In comparison to these generally fast folding
processes, disulfide bondcontaining proteins
fold somewhat more slowly in vitro. Where the
noncovalent interactions are relatively strong
and constitute the main factors driving the folding (at least up to a molten globule state), the
stages of folding prior to disulfide formation
will be similar to those described above. In
some disulfide-containing proteins, however,
the noncovalent interactions are weaker; thus
UNIT 6.4
Purification of
Recombinant
Proteins
6.4.1
CPPS
X1
X2 .... Xn
As
Overview of
Protein Folding
Al
Figure 6.4.1 Illustration of the dependence of yield on aggregation. The unfolded protein (U) folds
through the intermediate forms (X1 to Xn), then through the molten globule (I) to the native state
(N). Some or all of the intermediates will be capable of associating, reversibly at first, to form small
aggregates (As). Subsequently, however, these will associate irreversibly to form larger insoluble
aggregates (Al). The rate of aggregation (ra) will depend on the concentrations of the intermediates
(and hence on the rate constants for the individual steps in the folding pathway) and also on the
rate constant (ki,aggr) for aggregation of each intermediate (Xi) according to the equation ra = ki,aggr
[Xi]. Aggregation will therefore depend on the overall protein concentration as well as the solubility
properties of each intermediate.
6.4.2
Current Protocols in Protein Science
110
100
90
80
70
60
50
40
30
20
10
0
10
0
Guanidine-HCl (mol/liter)
bonds are involved, they supplement the noncovalent interactions in contributing to the stability and specificity of the native three-dimensional conformation and the folding intermediates (see discussion of Disulfide-bonded
proteins, below). Covalently bonded ligands
(e.g., heme in cytochrome c) may also contribute to the stability of both the native and intermediate forms and therefore play an important
role in folding (Roder and Elve, 1994).
When such proteins are subjected to dena-
Purification of
Recombinant
Proteins
6.4.3
Current Protocols in Protein Science
Overview of
Protein Folding
Figure 6.4.3 Equilibrium for noncovalent interactions during protein folding, where n is the
number of disulfide bonds in the native, functional protein.
Disulfide-bonded proteins
Disulfide bonds contribute to the stability of
the native conformation to an extent that varies
from one protein to another. The overall stability may be expressed thermodynamically in
terms of the equilibria for the sidechain and
backbone noncovalent interactions (Fig. 6.4.3)
and the covalent interaction (Fig. 6.4.4) in
which two thiol groups produce a disulfide
bond.
U(SH)2n
N(SH)2n
6.4.4
Current Protocols in Protein Science
X SH + HS Y
X S S Y
Figure 6.4.4 Equilibrium for the covalent interactions (formation of disulfide bonds) during
the folding of a protein, X and Y are cysteine
residues in the amino acid sequence of the
folding protein that interact specifically in the
folded structure.
SH
S
+ RSSR
+ 2e + 2H+
SH
S
heavy metals
SSR
+ RS
+ RS
P
S
Figure 6.4.6
Formation of disulfide
bond through mixeddisulfide intermediates.
Purification of
Recombinant
Proteins
6.4.5
Current Protocols in Protein Science
Limiting Aggregation
Aggregates of unfolded proteins are intrinsically more stable than the folded, monomeric
Overview of
Protein Folding
6.4.6
Current Protocols in Protein Science
LITERATURE CITED
Anfinsen, C.B. 1967. The formation of the tertiary
structure of proteins. Harvey Lect. 61:95-116.
Anfinsen, C.B. 1973. Principles that govern the
folding of protein chains. Science 181:223-230.
Craig, S., Schmeissner, U., Wingfield, P., and Pain
R.H. 1987. Conformation, stability and folding
of interleukin-1. Biochemistry 26:3570-3576.
Gilbert, H.F. 1994. The formation of native disulfide
bonds. In Mechanisms of Protein Folding (R.H.
Pain, ed.) pp. 109-111. Oxford University Press,
Oxford.
Goloubinoff, P., Christeller, J.T., Gatenby, A.A., and
Lorimer, G.H. 1989. Reconstitution of active
dimeric ribulose bisphosphate carboxylase from
an unfolded state depends on two chaperone
proteins and ATP. Nature 342:884-889.
Haase-Pettingell, C.A. and King, J. 1988. Formation
of aggregates from a thermolabile in vivo folding
intermediate in P22 tail spike maturation. J. Biol.
Chem. 263:4977-4983.
Hlodan, R., Craig, S., and Pain, R.H. 1991. Protein
folding and its implications for the production of
recombinant proteins. Biotechnol. & Genet. Eng.
Rev. 9:47-88.
Hlodan, R. and Hartl, F.U. 1994. How the protein
folds in the cell. In Mechanisms of Protein Folding (R.H. Pain, ed.) pp. 194-228. Oxford University Press, Oxford.
Lapanje, S., Skerjane, J., Glavnik, S., and Zibret, S.
1978. Thermodynamic studies of the interactions
of guanidinium chloride and urea with some
oligoglycines and oligolysines. J. Chem. Thermodynam. 10:425-433.
assembly of penicillin acylase, an enzyme composed of two polypeptide chains that result from
proteolytic activation. Biochemistry 30:90349040.
Lomas, D.A., Evans, D.Ll., Stone, S.R., Chang,
W.-S.W., and Carrell, R.W. 1993. Effect of the Z
mutation on the physical and inhibitory properties of 1-antitrypsin. Biochemistry 32:500-508.
Mitchinson, C. and Pain, R.H. 1985. Effects of
sulfate and urea on the stability and reversible
unfolding of -lactamase from Staphylococcus
aureus. J. Mol. Biol. 184:331-342.
Nall, B.T. 1994. Proline isomerization as a rate-limiting step. In Mechanisms of Protein Folding
(R.H. Pain, ed.) pp. 80-103. Oxford University
Press, Oxford.
Ptitsyn, O.B., Pain, R.H., Semisotnov, G.V.,
Zerovnik, E., and Razgulaev, D.I. 1990. Evidence for a molten globule state as a general
intermediate in protein folding. FEBS (Fed. Eur.
Biochem. Soc.) Lett. 262:20-24.
Roder, H. and Elve, G.A. 1994. Early stages of
protein folding. In Mechanisms of Protein Folding (R.H. Pain, ed.) pp. 37-40. Oxford University
Press, Oxford.
Thatcher, D. and Hitchcock, A. 1994. Protein folding in biotechnology. In Mechanisms of Protein
Folding (R.H. Pain, ed.) pp. 242-250. Oxford
University Press, Oxford.
Purification of
Recombinant
Proteins
6.4.7
Current Protocols in Protein Science
UNIT 6.5
Purification of
Recombinant
Proteins
Contributed by Paul T. Wingfield, Ira Palmer, and Shu-Mei Liang
Current Protocols in Protein Science (1995) 6.5.1-6.5.27
Copyright 2000 by John Wiley & Sons, Inc.
6.5.1
CPPS
BASIC
PROTOCOL 1
Folding and
Purification of
Insoluble Proteins
from E. coli
The lysozyme in BGH break buffer A aids in removal of the peptidoglycan and outer
membrane protein contaminants (see UNIT 6.1).
6.5.2
Current Protocols in Protein Science
Fold protein
6. Dilute the clear amber-colored solution with an equal volume of BGH folding buffer
A and pour into prewashed dialysis tubing.
Use two or three pieces of tubing and fill dialysis bags only three-quarters of the way to
allow for any volume increase during dialysis.
The BGH concentration from step 5 should be 2 to 4 mg/ml, and after dilution should not
exceed 1.0 to 2.0 mg/ml. If higher concentrations are found, the solution must be diluted
further with folding buffer A. BGH concentration can be estimated by diluting the sample
with 4 M guanidineHCl in water and measuring the absorbance at 280 nm and 260 nm
in a cell with a 1-cm path length. The total protein concentration (mg/ml) is estimated as
1.55 A280 0.76 A260 nm (Stoscheck, 1990). The BGH content may either be assumed to be
60% of the total protein or, more accurately, be estimated by performing SDS-PAGE and
densitometry using the washed pellet from step 4.
Urea is included in folding buffer A as a cosolvent to maintain solubility of the protein
during refolding. Removal of guanidineHCl by dialysis or dilution results in precipitation
if no cosolvent is used. The urea concentration chosen (in this case 4 M) should be low
enough to allow the native structure to form (see UNIT 6.4). Urea unfolding/folding profiles
for BGH (i.e., equilibrium-denaturation curves in which protein conformation is measured
as a function of denaturant concentration; see UNIT 6.4) were available in the literature
prior to development of this method (Edelhoch and Burger, 1966). The urea concentration
that induces protein unfolding can be determined rapidly by urea gradient electrophoresis
(Goldenberg, 1989).
Purification of
Recombinant
Proteins
6.5.3
Current Protocols in Protein Science
Figure 6.5.1 SDS-PAGE of bovine growth hormones on 12.5% polyacrylamide gel. Lane A,
BGH-expressing E. coli cells minus the expression vector; lanes B and C, BGH-expressing E. coli
cells with A-4 and A-9 BGH expression vectors, respectively; lane D, purified recombinant A-9
BGH; lane E, BGH purified from pituitary (supplied by A.F. Parlow, UCLA); lane F, purified
recombinant A-9 BGH with no reductant in sample buffer; lane G, BGH purified from pituitary with
no reductant in sample buffer. In lane E (pituitary BGH), the two bands correspond to full-length
protein and protein truncated at the N-terminus by 4 residues. It can be seen that the bottom band
has the same mobility as E. coli extracts containing the A-4 BGH construct. In lane G, it it may be
noted that the two bands are not resolved under nonreducing conditions.
Oxidation of the protein during dialysis can be monitored by SDS-PAGE (UNIT 10.1).
SDS-denatured oxidized BGH is more compact than SDS-denatured reduced BGH and thus
migrates faster as a result of the lower apparent molecular weight (i.e., 18 kDa for the
oxidized form versus 22 kDa for the reduced form). SDS-PAGE band patterns of oxidized
and reduced BGH are shown in Figure 6.5.1. Any free thiol groups in the sample are
quenched by addition of 20 mM iodoacetamide. SDS sample buffer (UNIT 10.1) minus
reductant is then added. The pH of the SDS-treated sample may have to be readjusted with
dilute alkali. The proportion of oxidized versus reduced protein is finally determined by
densitometry of the Coomassie bluestained gel. It should be noted that this approach does
not prove whether or not the correct disulfide bond(s) have been formed. The shift in
mobility upon reduction occurs because of the formation of the disulfide bond linking
Cys-51 to Cys-163. BGH in which the second disulfide bond (linking Cys-180 to Cys-188)
has been selectively reduced (Graf et al., 1975) still exhibits the gel shift. Despite these
potential pitfalls, the gel method is useful and correlates with other approachese.g., direct
monitoring of disulfide formation using 2-nitro-5-thiosulfobenzoate in the presence of
sodium sulfite after the free sulfhydryls have been quenched and the buffer components
removed (Thannhauser et al., 1984). Sulfhydryl groups can be assayed using Ellmans
reagent (Riddles et al., 1979).
8. Centrifuge the slightly cloudy solution 30 min at 20,000 g. Discard pellet and adjust
supernatant to pH 9.0 with 2 M HCl.
The volume of the supernatant should be 300 to 320 ml.
Folding and
Purification of
Insoluble Proteins
from E. coli
6.5.4
Current Protocols in Protein Science
10. Elute with BGH column buffer A at the same flow rate and continue fraction
collection until A280 or A260 of eluant approaches a baseline value. Assay fractions by
SDS-PAGE and pool fractions containing BGH.
Under the ion-exchange conditions used, BGH does not bind to the matrix and is located
in the flowthrough fractions. The more acidic E. coli contaminants bind tightly to the top
portion of the column, which turns brown. Some of the earlier flowthrough fractions may
be slightly contaminated with aggregated protein that separates from soluble protein as a
result of the gel-filtration effect of the matrix. A BGH concentration of 0.2 to 0.3 mg/ml in
400 to 450 ml of pooled eluant should be obtained. This should exhibit a single band on
the SDS-PAGE gel.
Pooled fractions can be stored 1 to 2 days at 4C and for longer periods at 80C.
13. Elute with BGH column buffer B at the same flow rate and continue fraction
collection until A280 or A260 of eluant approaches a baseline value. Assay fractions by
SDS-PAGE and pool fractions containing BGH.
The protein elutes in a single but slightly asymmetrical peak.
The gel-filtration elution peak has a sharp front edge, whereas the descending portion is
more diffuse and trailing. This elution behavior results from the fact that BGH is a rapidly
associating/dissociating monomer/dimer system (with a Kd of 0.8 to 1.0 105 M; see
Ackers, 1970, for discussion of theory). The apparent molecular weight estimated by gel
filtration is 30 to 35 kDa (with a monomer mass of 22 kDa).
The slightly alkaline pH of the column buffer and folding buffers is required to maintain
solubility of the protein.
14. Repeat step 11, then filter sterilize filtrate using a Millex-GV 0.22-m filter unit.
Store purified BGH in aliquots at 80C.
For long-term storage the protein can be lyophilized. If this is to be done, the pooled
fractions from step 10 should be directly dialyzed (APPENDIX 3B) against 0.1 M ammonium
bicarbonate (pH 9.2 to 9.4, adjusted with ammonium hydroxide), and the dialysate
filter-sterilized and freeze-dried. If an essentially salt-free protein is required, two cycles
of freeze-drying should be performed and the protein reconstituted with water alone after
the first drying cycle.
The concentration of the purified BGH can be conveniently determined by UV absorbance
measurementi.e., 1 mg/ml native BGH has an A280 of 0.7 in a 1-cm quartz cuvette. Protein
concentration may also be estimated in crude extracts using the Bio-Rad Protein Assay Kit
(based on the Bradford method) and in partially purified fractions as described in the
annotation to step 6. Biological activity of the protein is measured as described in Wingfield
et al. (1987a).
Purification of
Recombinant
Proteins
6.5.5
Current Protocols in Protein Science
BASIC
PROTOCOL 2
2. Add 21 g sucrose and mix well with cell paste using tissue homogenizer. Add 34 mg
lysozyme and mix again using tissue homogenizer. Incubate 30 min in a 30C water
bath, then dilute with 100 ml hIL-2 break buffer and cool on ice.
Folding and
Purification of
Insoluble Proteins
from E. coli
3. Break cells by passing suspension through a French press twice as described in UNIT
6.2. Centrifuge 30 min at 13,000 g (10,400 rpm in SS-34 rotor), 4C. Save pellet.
The wet weight of the pellet should be 1.5 g.
6.5.6
Current Protocols in Protein Science
7. Repeat step 5 and pool the two clear supernatants (total volume should be 25 ml).
Isolate hIL-2 monomer from solubilized protein by gel filtration
8. Immediately apply pooled supernatants to column prepared in step 6. Perform gel
filtration chromatography at 4C as described in UNIT 8.3 and construct a chromatogram, pooling the fractions that make up the third peak.
The solution should be applied to the column immediately after extraction to prevent any
covalent modification of the protein.
The column eluate usually shows three major peaks in A280. The first peak (in the void
volume) contains aggregates, the second peak contains dimeric hIL-2, and the third peak
contains monomeric hIL-2 (S.M.L., unpub. observ.). A T cell proliferation assay shows that
the third peak contains the highest biological activity of hIL-2 (Liang et al., 1986; Bottomly
et al., 1991).
9. Using 45-mm diameter Spectra/Por 3 dialysis tubing (MWCO 3500), dialyze the
pooled fractions (60 ml) making up the third peak at 4C overnight against 5 liters
of Milli-Q water, then for an additional 3 to 4 hr against 5 liters of fresh Milli-Q water.
The pooled eluate fractions should be extensively dialyzed against water to remove acetic
acid. The dialysate should have a volume of 60 ml with an hIL-2 concentration of 0.2
mg/ml. After dialysis, the hIL-2 monomer can be stored at 20C until required.
As the acetic acid is removed; sample pH slowly increases, thereby allowing hIL-2 thiols
to form disulfide bonds.
12. Fill the reservoirs of the HPLC system gradient maker with RP-HPLC solvent A and
RP-HPLC solvent B. Carry out a blank run (at room temperature) with the following
gradient program:
0 min: 0% solvent B/100% solvent A
10 min: 0% solvent B/100% solvent A
60 min: 50% solvent B/50% solvent A
90 min: 50% solvent B/50% solvent A
150 min: 70% solvent B/30% solvent A
Purification of
Recombinant
Proteins
6.5.7
Current Protocols in Protein Science
13. Pump sample from step 10 into column at a flow rate of 4 ml/min. Run the gradient
program described in step 12 at a flow rate of 1 ml/min at room temperature. Collect
8-ml fractions at a rate of 1 ml/min and pool the fractions making up the correctly
folded hIL-2 peak, which elutes at 70 min.
Care should be taken to avoid introducing air bubbles while pumping the sample into the
column, as these will generate false peaks in the chromatogram. If the peaks in the HPLC
profile are too broad, the gradient program should be varied to improve the separation (see
Chapter 8).
14. Using 11.5-mm diameter Spectra/Por 3 dialysis tubing (MWCO 3500), dialyze the
pooled hIL-2 fractions against Milli-Q water or 25 mM acetic acid at 4C. Filter the
purified protein solution using a 0.22-m filter unit and store at 20C or below.
SUPPORT
PROTOCOL
Folding and
Purification of
Insoluble Proteins
from E. coli
6.5.8
Current Protocols in Protein Science
A214
0.3
0.2
0.1
0.0
C
D
1
0.3
0.2
0.1
0.0
40
80
120
40
80
120
Time (min)
Figure 6.5.2 Chromatograms illustrating peaks produced by (A) correctly folded hIL-2 (in absence
of denaturant); (B) correctly folded hIL-2 (in presence of denaturant but at pH 3.5, which is too low
for disulfide-bond exchange); (C) scrambled hIL-2 isomers (resulting from denaturant treatment at
pH 8.5); (D) unfolded hIL-2 (resulting from denaturant/reductant treatment).
Properly folded hIL-2 usually elutes at 54% acetonitrile and unfolded hIL-2, which has
no intramolecular disulfide bond, elutes at 57% acetonitrile.
In the presence of denaturants such as guanidineHCl, hIL-2 rapidly scrambles into a
mixture of three disulfide-linked isomers. The incorrectly folded hIL-2 elutes at a lower
acetonitrile concentration (<54%) as shown in Figure 6.5.2. For each separation in Figure
6.5.2, two Vydac C4 columns were connected in series and run at 40C with a 1 ml/min
flow rate. The sample was eluted with an acetonitrile gradient that increased from 0% to
44% in 15 min, and then to 64% in 120 min. In the separation represented by the
chromatogram in panel A, 100 g hIL-2 in 40 l of 50 mM acetic acid was dissolved in
400 l of 20% formic acid and chromatographed. The elution peak (at 57.3% acetonitrile)
represents native hIL-2. In panel B, hIL-2 was added to 300 l of 6 M guanidineHCl/0.2
M acetic acid, pH 3.5, and incubated 10 min at room temperature. 200 l of 20% formic
acid was added and the sample was chromatographed. Note that the chromatogram has
not changed, as the pH is below the point where isomerization (disulfide-bond exchange)
occurs. In panel C, hIL-2 was added to 300 l of 6 M guanidineHCl/0.05 M TrisCl, pH
8.5, incubated 5 min at room temperature, then quenched with formic acid as in the
separation represented by panel B. Peak 1 represents the scrambled isomer with a disulfide
bond between Cys-105 and Cys-125, whereas peak 2 is probably the scrambled isomer
with a disulfide bond between Cys-58 and Cys-125. Finally, in panel D, hIL-2 was added
to 6 M guanidineCl as in panel C. 50 mM DTT was then added and the sample was
incubated 40 min at room temperature, then quenched with formic acid as in panel B. The
peak (at 59.4% acetonitrile) represents completely reduced (unfolded) hIL-2. (see also
Browning et al., 1986).
IMPORTANT NOTE: If the pressure in the HPLC column exceeds 4000 psi, the column
should be cleaned.
Purification of
Recombinant
Proteins
6.5.9
Current Protocols in Protein Science
BASIC
PROTOCOL 3
Folding and
Purification of
Insoluble Proteins
from E. coli
6.5.10
Current Protocols in Protein Science
3. Centrifuge 1 hr at 30,000 g (14,000 rpm in J-14 rotor), 4C. Resuspend pellet with
500 ml suspension buffer and centrifuge 1 hr at 30,000 g.
Extract washed pellet and purify by gel filtration
4. Dissolve pellet in 40 ml IN extraction buffer using tissue homogenizer. Centrifuge
45 min at 100,000 g (35,000 rpm in 45Ti rotor), 4C.
5. Apply the clear supernatant to a 6 60cm Superdex 200 gel-filtration column (UNIT
8.3) equilibrated with IN column buffer A. Elute at 300 ml/hr using IN column buffer
A, collecting 15-ml fractions. Assay fractions by SDS-PAGE (UNIT 10.1) and pool those
containing IN50-212.
IN50-212 should be the major band visible on SDS-PAGE after gel filtration, representing
20% to 50% of the total protein.
Purification of
Recombinant
Proteins
6.5.11
Current Protocols in Protein Science
8. Elute bound protein with a linear pH gradient in a total volume of 980 ml (10 column
volumes) beginning at pH 6.4 and ending at pH 4.5: make the gradient by placing
490 ml IN column buffer C (pH 6.4) and 490 ml of IN column buffer D (pH 4.5) in
the appropriate reservoirs of the gradient maker. Elute column at 300 ml/hr, collect
10-ml fractions and pool those making up the major peak (containing IN50-212; eluted
at pH 5.5).
After elution, the column can be cleaned by washing with 0.2 M acetic acid/6 M guanidineHCl, then reequilibrated with IN column buffer B. Under the conditions described, the
column can be used two or three times before it turns from light blue-green to brownishgray, indicating that regeneration is required. The column can be regenerated by passing
2 column volumes of 100 mM EDTA to strip Ni from the resin, washing the column
sequentially with 2 column volumes each of 0.1 M NaOH and 6 M guanidineHCl, and
finally passing 2 column volumes of 10 mM NiSO4 through the column to charge the resin
with Ni. The column is then washed with water and equilibrated with IN column buffer B.
Further details on use of the Ni-NTA resin are given in the manufacturers literature
(Qiagen, 1992) and in UNIT 9.2.
folding buffer
stir bar
HIV-1
integrase
peristaltic pump
Folding and
Purification of
Insoluble Proteins
from E. coli
magnetic stirrer
Figure 6.5.3 Setup for folding of HIV-1 integrase by dilution into buffer.
6.5.12
Current Protocols in Protein Science
12. Concentrate solution to 1 mg/ml protein using a stirred cell with Diaflo PM 10
ultrafiltration membrane. Filter concentrated solution (25 to 30 ml volume) through
a Millex-GV 0.22-m filter.
If it is not be be used immediately, the concentrated solution can be stored at 80C for
several months.
The largest stirred cell produced by Amicon has a 2-liter maximum capacity (i.e., the Model
2000). This cell can be used to concentrate the solution to 100 ml, and this must be further
concentrated using a smaller cell (e.g., with a 200- or 400-ml capacity). For volumes >2
liters, or for a more rapid process, the Minitan tangential-flow membrane system (Millipore) can be used. This system allows a concentration rate of 0.5 to 1.0 liter/hr.
14. Add 300 l of 2000 NIH unit/ml thrombin (600 NIH units) to 60 ml of diluted protein
from step 13 (30 mg IN50-212). Incubate 30 min at 28C with occasional mixing. Add
an additional 300 l of 2000 NIH unit/ml thrombin and incubate another 30 min.
Cool solution on ice to 4C and proceed immediately to step 15.
Digestion conditions are optimized by conducting small-scale digestions (using <1.0 ml of
protein solution from step 13). See Critical Parameters and Troubleshooting for details.
The substrate/enzyme ratio used here is 0.02% (w/w).
Thrombin digestions are more reproducible when carried out on a relatively small scale
e.g., 20 to 30 mg protein per batch. If more cleaved protein is required, several batches
can be processed concurrently.
6.5.13
Current Protocols in Protein Science
17. Apply digest to equilibrated column from step 15 at a flow rate of 2 ml/min. Collect
the column flowthrough, then wash with IN column buffer F until A280 is close to the
baseline (this usually requires 1 to 2 column volumes).
IN50-212 cannot be separated from thrombin by gel filtration because the IN50-212 (mol. wt.
18,200) undegoes self-association (Hickman et al., 1994), and therefore elutes with a
molecular weight close to that of thrombin (36,000). Affinity chromatography using the
thrombin inhibitor p-aminobenzamidine is therefore employed.
The 1 M urea is added to the sample and column buffers to prevent nonspecific binding of
IN50-212 to the column.
The affinity column may be regenerated by washing with several column volumes of 6 M
guanidineHCl in water to remove the bound thrombin; alternatively, the thrombin can be
eluted under native conditions with 50 mM TrisCl (pH 7.8)/10 mM benzamidineHCl.
18. Pool flowthrough and wash fractions, add 0.1 mM AEBSF, then concentrate to 2
mg/ml (in 10 ml) using a stirred cell with Diaflo PM 10 ultrafiltration membrane.
Peform final purification and assay protein
19. Purify protein by gel filtration as in step 10, using IN column buffer G in place of IN
column buffer E for equilibration and elution.
The column size can be reduced to 2.6 60 cm for fractionation of smaller (30 mg)
quantities of protein; the sample volume should be 5 to 10 ml.
20. Pool IN50-212-containing fractions and filter sterilize with Millex-GV 0.22-m filter.
The final protein solution can be concentrated if required. However, it should be noted that
when CHAPS-containing solutions are concentrated by ultrafiltration, the detergent will
also be concentrated. The excess detergent can be slowly removed by dialysis (APPENDIX
3B; the critical micellar concentration of CHAPS is 8 mM and its micellar size is 8 kDa).
Unfortunately, radiolabeled CHAPS is not available for monitoring the removal of that
detergent; however, an enzymatic assay is available (Talalay, 1960; Coleman et al., 1979).
21. Measure A280 of protein solution in a cuvette with a 1-cm path length.
Native IN50-212 (after the His tag has been cleaved) at a concentration of 1 mg/ml will have
an A280 = 1.54.
Folding and
Purification of
Insoluble Proteins
from E. coli
6.5.14
Current Protocols in Protein Science
Purification of
Recombinant
Proteins
6.5.15
Current Protocols in Protein Science
Folding and
Purification of
Insoluble Proteins
from E. coli
IN break buffer
50 mM TrisCl, pH 7.5 (pH as determined at 4C)
5 mM EDTA (from 0.5 M stock)
5 mM benzamidineHCl (780 mg/liter)
5 mM DTT (770 mg/liter)
0.1 mg/ml lysozyme (100 mg/liter)
Prepare immediately before use
It is preferable that DTT be added as a solid, but it may also be added as 0.2 M stock in
water (30.8 g/liter; store at or below 20C; stable 1 month).
6.5.16
Current Protocols in Protein Science
IN column buffer A
10 mM TrisCl, pH 7.5 (pH as determined at 4C)
4 M guanidineHCl (382 g/liter)
1 mM 2-mercaptoethanol (2-ME; 0.070 ml/liter)
Because of the volume increase on addition of guanidineHCl (i.e., 1 g increases the volume
0.76 ml), buffer components must be added to 420 ml water and total volume adjusted to 1
liter at the end.
Concentrated 2-ME (14.3 M) should be stored at 4C and opened bottles replaced every 2
or 3 months.
IN column buffer B
To 500 ml H2O add:
10 ml 1 M TrisCl, pH 8.0 (pH as determined at 4C; 10 mM final)
0.73 g NaH2PO4H2O (monobasic; mol. wt. 137.99)
13.44 g Na2HPO4 (dibasic; mol. wt. 141.96)
0.07 ml 2-mercaptoethanol (2-ME; 1 mM final)
After salts have dissolved add:
573.2 g guanidineHCl (6 M final)
Bring temperature to 4C and add 4C H2O to 1 liter
Adjust pH to 8.0 (as determined at 4C) if necessary with 2 M NaOH or 2 M HCl
Prepare immediately before use
Because of the volume increase on addition of guanidineHCl (i.e., 1 g increases the volume
0.76 ml), buffer components must be added to 500 ml and total volume adjusted to 1 liter
at the end.
Concentrated 2-ME (14.3 M) should be stored at 4C and opened bottles replaced every 2
or 3 months.
IN column buffer C
Titrate IN column buffer B (see recipe) to pH 6.4 (as determined at 4C) with HCl.
Prepare immediately before use.
IN column buffer D
Prepare 100 mM (13.8 g/liter) NaH2PO4H2O. If necessary adjust to pH 4.5 with 2
M NaOH or 2 M HCl.
IN column buffer E
Immediately before use, prepare IN column buffer B (see recipe), replacing the
2-ME with 5 mM DTT.
It is preferable that DTT be added as a solid, but it may also be added as 0.2 M stock in
water (30.8 g/liter; store at or below 20C; stable 1 month).
IN column buffer F
50 mM TrisCl, pH 7.5 (pH as determined at 4C)
0.25 M NaCl (14.6 g/liter)
10 mM 3-[(3-cholamidopropyl)-dimethylammonio]-1-propanesulfonate
(CHAPS; mol. wt. 614.9; 6.2 g/liter)
1 mM DTT (154 mg/liter)
1 mM EDTA (prepare from 0.5 M stock)
1 M urea (60 g/liter)
Prepare immediately before use
It is preferable that DTT be added as a solid, but it may also be added as 0.2 M stock in
water (30.8 g/liter; store at or below 20C; stable 1 month).
IN column buffer G
Prepare IN column buffer F (see recipe), but omit urea.
Purification of
Recombinant
Proteins
6.5.17
Current Protocols in Protein Science
IN extraction buffer
10 mM TrisCl, pH 7.5 (pH as determined at 4C)
8 M guanidineHCl (764 g/liter)
5 mM DTT (770 mg/liter)
Prepare immediately before use
Because of the volume increase on addition of guanidineHCl (i.e., 1 g increases the volume
0.76 ml), buffer components must be added to 420 ml and total volume adjusted to 1 liter at
the end.
If high-quality guanidineHCl is used, the solution will be colorless and clear (see APPENDIX
3A).
It is preferable that DTT be added as a solid, but it may also be added as 0.2 M stock in
water (30.8 g/liter; store at or below 20C; stable 1 month).
IN folding buffer
50 mM TrisCl. pH 7.5 (pH as determined at 4C)
0.5 M NaCl (29.2 g/liter)
10 mM 3-[(3-cholamidopropyl)-dimethylammonio]-1-propanesulfonate
(CHAPS; mol. wt. 614.9; 6.2 g/liter)
2 mM DTT (308 mg/liter)
Prepare immediately before use
It is preferable that DTT be added as a solid, but it may also be added as 0.2 M stock in
water (30.8 g/liter; store at or below 20C; stable 1 month).
COMMENTARY
Background Information
Folding and
Purification of
Insoluble Proteins
from E. coli
6.5.18
Current Protocols in Protein Science
Purification of
Recombinant
Proteins
6.5.19
Current Protocols in Protein Science
Folding and
Purification of
Insoluble Proteins
from E. coli
6.5.20
Current Protocols in Protein Science
Reversed-phase-HPLC has been used to separate these isomers (Browning et al., 1987; also
see Fig. 6.5.3).
HIV-1 integrase
An early part of the life cycle of all
retroviruses (including the human immunodeficiency virus, HIV) is the integration of a DNA
copy of the viral genome into the host chromosome. This step is essential for viral replication.
Retroviral DNA integration is carried out by a
defined set of DNA cutting and joining reactions, catalyzed by integrase protein (Katz and
Skalka, 1994).
The HIV-1 integrase is a 288-residue protein
(mol. wt. 32,200) which has been expressed in
E. coli (Sherman and Fyfe, 1990). The protein
is located in the pellet obtained by low-speed
centrifugation following cell breakage, but is
apparently not highly aggregated into inclusion
bodies, as it can be extracted into high-ionicstrength solution (e.g., 1 M NaCl) without a
denaturant. The insolubility of the protein is
probably a result of nonspecific binding to E.
coli nucleic acid. The salt-extraction procedure
was originally used by Terry et al. (1988) for
extraction of avian sarcoma-leukosis virus integrase expressed in E. coli.
Dissection of the HIV-1 integrase by preparation of a series of deletion mutants (Bushman
et al., 1993; see also references therein) demonstrated that a central core region of residues
50 to 212 was enzymatically active, carrying
out a subset of the reactions catalyzed by the
full-length enzyme. This deletion mutant is
interesting for structural studies as it has better
solubility than the full-length enzyme, which
is notoriously difficult to handle as a result of
limited solubility in the usual aqueous solvents.
The HIV-1 integrase deletion mutant
(IN50-212) is expressed in E. coli as a fusion
protein with the N-terminal extension sequence
GlySerSerGlyHisHisHisHisHisHisSerSer
GlyLeuValProArgGlySerHisMet. This sequence (a His tag) contains a six-residue histidine repeat which is responsible for the selective high-affinity binding of the fusion protein
to a nickel-chelate column. The boldface portion of the His tag sequence indicates the location of the specific thrombin-cleavage site between Arg-16 and Gly-17, which is exploited
in removal of the tag using thrombin.
In Basic Protocol 3 the His-tagged IN50-212
is expressed as an insoluble (i.e., inclusionbody-type) protein, and is purified under denaturing conditions, taking advantage of the fact
that metal-chelate affinity chromatography
Critical Parameters
Bovine growth hormone (BGH)
Protein extraction. The protein should be
extracted from the inclusion bodies in a
monomeric and fully reduced state. This provides a defined starting point from which to
develop a reproducible folding protocol. Based
on pilot-scale experiments in which washed
pellets (from step 4 of Basic Protocol 1) were
extracted with various protein denaturants
(P.T.W., unpub. observ.; also see UNIT 6.3), 8 M
guanidineHCl was chosen for solubilization of
aggregated BGH. The most effective ones were
Purification of
Recombinant
Proteins
6.5.21
Current Protocols in Protein Science
Supplement 4
Folding and
Purification of
Insoluble Proteins
from E. coli
Interleukin-2
Protein purification. Recombinant hIL-2
should be extracted as a monomer (see Basic
Protocol 2, step 5). The extraction has been
tested with various concentrations of acetic to
determine the optimal condition; 20% acetic
acid has been found to extract the maximum
amount of monomeric hIL-2 from inclusion
bodies. However, in the gel-filtration step (see
Basic Protocol 2, step 8), 20% acetic acid is too
corrosive for the metal parts of the chromatography system (e.g., the stands, pump, and fraction collector); thus 10% acetic acid is recommended.
Protein folding. The hIL-2 monomer that
elutes from the Sephadex G-100 column (see
Basic Protocol 2, step 8) yields two peaks upon
analytical reversed-phased chromatography
(see Support Protocol). These two peaks represent the refolded, oxidized hIL-2 and unfolded,
reduced hIL-2. The proportion of the oxidized
form is increased by the dialysis step (see Basic
Protocol 2, step 9). The time and volume recommended in the dialysis step have been determined empirically to produce the best yield of
oxidized hIL-2. Longer periods of dialysis (e.g.
48 hr) causes do not increase the yield of oxidized hIL-2.
Scaleup. The parameters for larger-scale
production are the same as those for BGH (see
above).
HIV-1 integrase
MCAC. It is important to use low concentrations of 2-mercaptoethanol in the sample and
column buffer (1 mM is safe) and to completely
avoid dithiothreitol, as either reductant will
strip Ni2+ from the MCAC matrix (see UNIT 9.4).
Protein folding. Because HIV-1 integrase is
not a very soluble protein and has a high tendency toward self-association and aggregation
(Hickman et al., 1994), the solubility and stability of the folded protein must be maintained
by including a relatively high salt concentration
as well as the detergent CHAPS in the folding
buffer and in all subsequent column buffers.
The buffer additives indicated for HIV-1 integrase are usually not required for folding of the
average protein.
Thrombin cleavage. It is necessary to perform pilot-scale experiments to optimize the
conditions for removing the His tag using
thrombin. The usual parameters to vary are
enzyme/substrate ratios and incubation time
and temperature. Thrombin is a serine protease,
and can be irreversibly inhibited with either
PMSF or AEBSF. Removal of the tag can be
6.5.22
Supplement 4
Troubleshooting
Bovine growth hormone (BGH)
Most precautions and guidelines for folding
and oxidation are given in annotations to the
individual steps of Basic Protocol 1. It should
be cautioned that, when using a mixture of
oxidized and reduced glutathione for oxidizing
proteins that contain both free and disulfidelinked cysteines in the native state (e.g., IL-2),
the potential exists for glutathionylation, in
which the unpaired cysteine(s) form a mixed
disulfide with glutathione. This modification
will cause charge heterogeneity, and can be
readily detected by electrospray ionization
mass spectrometry (ESI-MS), which will indicate a mass increase of 305.3 for each GSH
moiety incorporated.
Interleukin-2
Low-molecular-weight impurities extracted
together with monomeric hIL-2 in acetic acid
(see Basic Protocol 2, step 5) are usually eluted
earlier than the oxidized form of hIL-2 in RPHPLC (see Basic Protocol 2, step 12) and can
thus be removed. If the RP-HPLC peak containing the oxidized form of hIL-2 is contaminated
by low-molecular-weight impurities, SDSPAGE analysis of each fraction composing the
peak is recommended to avoid pooling fractions
containing impurities. Additional information
regarding troubleshooting is given in annotations to individual steps of Basic Protocol 2.
HIV-1 integrase
MCAC. Protein will not bind to the column
if the His tag has been nonspecifically degraded
or removed by E. coli proteases. SDS-PAGE
(UNIT 10.1) of the cell extracts and fractions
during the purification should indicate if the tag
is present. The His tag adds 2 kDa to the
Anticipated Results
BGH
The purification of A-9 BGH is summarized in Table 6.5.1. About 90 to 100 mg of
protein are obtained from 50 g (wet weight) of
cells with an overall yield in the range 15% to
25%. For larger and/or multidomain proteins,
much lower yields (1% to 5%) may be more
typical. Gel analysis of the BGH in cell extracts
and in purified protein is shown in Figure 6.5.1.
Interleukin-2
If the expression of hIL-2 in E. coli is 10%,
20 g of cell pellet will yield 4 to 5 mg of purified
protein, with a specific activity of 5 106 U/mg.
The purity of the hIL-2 should be >98% as
indicated by analytical C18 column chromatography (see Support Protocol).
Purification of
Recombinant
Proteins
6.5.23
Current Protocols in Protein Science
Table 6.5.1
Total Proteinc
(mg)
Stage of Purificationb
Cells
Washed pellet (step 4)
Dialysis supernatant (step 8)
DEAE-Sepharose pool (step 11)
Sephadex G-100 pool (step 14)
5000
375
200
131
95
Specific BGH
contentd (%)
7.5
60
75
95
99
Total BGH
(mg)
375
225
150
125
94
Yield (%)
100
60
40
33
25
aThe summary refers specifically to a biologically active analog of BGH (-9 BGH) in which the full-length sequence is
truncated at the N-terminus by eight residues and serine is substituted for glycine at the first position. Similar relative
yields were obtained with a -1 analog in which the N-terminal Ala of the native sequence was replaced by Met, but the
amount of this protein expressed in E. coli was several-fold lower than that obtained for -9 BGH (Wingfield et al., 1987a)
bThe numbers in parentheses refer to Basic Protocol 1 steps.
cDeterminations were made using the Bio-Rad Protein Assay Kit, except those for the DEAE and Sephadex pools, which
SDS-polyacrylamide gels.
Folding and
Purification of
Insoluble Proteins
from E. coli
HIV-1 integrase
Figure 6.5.4 shows results of SDS-PAGE of
the purified IN50-212 before (lane B) and after
(lane A) removal of the His tag by thrombin
digestion. The molecular weight difference of
2000 kDa corresponds to the removal of the
6.5.24
Current Protocols in Protein Science
Time Considerations
BGH
Basic Protocol 1 for the purification of BGH
takes 5 days. Protein can be stored at 80C at
the end of step 4 (as washed pellets) or step 10
(as pooled fractions from ion-exchange).
Day 1: Cell breakage, preparation of washed
pellets and extraction with guanidineHCl
(steps 1 to 5) requires 12 day of work. The
protein extract is then dialyzed overnight (step
5).
Day 2: After changing the dialysis buffer
(step 7), dialysis is continued at least 6 hr. At
this step the dialysis can be left overnight or
directly processed by ion-exchange chromatography (steps 8 to 10), which takes 6 to 8 hr
and can be run overnight.
Day 3: Ion-exchange chromatography is
run, or if ion-exchange was performed on day
2, the pooled fractions are concentrated for
several hours (step 11), then applied to the
gel-filtration column and chromatographed for
several hours (steps 12 and 13). The gel-filtration column is run is overnight if a low-pressure
matrix (e.g., Sephadex G-100) is used or on the
same day if a medium-pressure matrix (e.g.,
Superdex 200) is used.
Day 4: The protein is analyzed (e.g., by
SDS-PAGE or isoelectric focusing), concentrated if required (step 14), and frozen or prepared for lypohilization.
Interleukin-2
In Basic Protocol 2, preparation and lysis of
the cells (steps 1 and 2) will take 1 hr, cell
breakage and preparation of washed pellets
(steps 3 and 4) will take 3 hr, acid extraction
(steps 5 and 6) will take 1 hr, Sephadex G-100
column chromatography (step 7) will take 12
hr, dialysis of the pooled fractions will be carried out overnight, and RP-HPLC (steps 9 to
12) will require 7.5 hr. The final dialysis is run
overnight. In the Support Protocol for resolution of native and misfolded forms of hIL-2, it
will take 1 hr to run a sample after the blank
runs to establish the baseline profile have been
completed.
HIV-1 integrase
Basic Protocol 3 is usually carried in two
stages. In stage 1, purification of the unfolded
protein that still contains the His tag (steps 1 to
8) is usually carried out on a relatively large
scale (using 100 g of cells). If the Superdex 200
column used in step 5 and the MCAC column
used in step 6 are run using the Pharmacia
Biotech Biopilot system, this stage will take 3
to 4 days to complete. It will take longer if
low-pressure columns are used. In stage 2,
protein folding (steps 11 and 12), removal of
the His tag by thrombin cleavage (steps 13 and
14), affinity chromatography (steps 15 to 17),
and gel filtration (step 19) are performed. This
stage is carried out repeatedly with relatively
small amounts of protein (i.e., 30 mg; this
represents <10% of the protein produced in the
Stage 1). Using the Pharmacia Biotech FPLC
system for the chromatography steps, stage 2
takes 2 days.
Literature Cited
Abdel-Meguid, S.S., Shieh, H.-S., Smith, W. W.,
Dayringer, H.E., Violand, B.N., and Bentle, L.A.
1987. Three-dimensional structure of a genetically engineered variant of porcine growth hormone. Proc. Natl. Sci. U.S.A. 84:6434-6437.
Ackers, G.K. 1970. Analytical gel chromatography
of proteins. Adv. Protein Chem. 24:343-446.
Bastiras, S. and Wallace, J. C. 1992. Equilibrium
denaturation of recombinant porcine growth hormone. Biochemistry 31:9304-9309.
Bazan, J.F. 1992. Unraveling the structure of IL-2.
Science 257:410-413.
Bell, J.A., Moffat, K., Voderhaar, B.K., and Golde,
D.W. 1985. Crystallization and preliminary Xray characterization of bovine growth hormone.
J. Biol. Chem. 260:8520-8525.
Bogosian, G., Violand, B.N., Dorward-King, E.J.,
Workman, W.E., Jung, P.E., and Kane, J.F. 1989.
Biosynthesis and incorporation into protein of
norleucine by Escherichia coli. J. Biol. Chem.
264:531-539.
Bottomly, K., Davis, C.S., and Lipsky, P.E. 1991.
Measurement of human and murine interleukin2 and interleukin-4. In Current Protocols in Immunology (J.E. Coligan, A.M. Kruisbeek, D.H.
Marguiles, E.M. Shevach, and W. Strober, eds.)
pp. 6.3.1-6.3.12. John Wiley & Sons, New York.
Brems, D.N. and Havel, H.A. 1989. Folding of
bovine growth hormone is consistent with the
molten globule hypothesis. Proteins Struct.
Funct. Genet. 5:93-95.
Browning, J.L., Mattaliano, R.J., Chow, E.P., Liang,
S.-M., Allet, B., Rosa, J., and Smart, J.E. 1986.
Disulfide scrambling of interleukin-2: HPLC
resolution of the three possible isomers. Anal.
Biochem. 155:123-128.
Purification of
Recombinant
Proteins
6.5.25
Current Protocols in Protein Science
6.5.26
Current Protocols in Protein Science
Purification of
Recombinant
Proteins
6.5.27
Current Protocols in Protein Science
UNIT 6.6
This unit describes how pGEX vectors (available from Pharmacia Biotech) can be used
for high-level inducible intracellular expression of polypeptides as fusions with glutathione-S-transferase (GST) in Escherichia coli. GST fusion proteins are easily purified
under nondenaturing conditions by affinity chromatography (Chapter 9) using a glutathioneSepharose 4B conjugate. The amino-terminal GST moiety can then be cleaved
from the protein of interest using a specific protease cleavage site located between the
GST moiety and the recombinant polypeptide. Finally, the GST moiety can be removed
by rechromatographing the sample on the glutathione-Sepharose column.
Potential applications of the system include the expression and purification of large
quantities of individual polypeptides for use in structural determinations using either
nuclear magnetic resonance (NMR) or crystallography, immunological studies, vaccine
production, and structure-function studies involving protein-protein and DNA-protein
interactions.
The basic protocols are chromatography-based adaptations of the manufacturers recommendations and include batch and column purification methods. The experience of this
laboratory is that, although batch purifications may be slightly faster, column-based
purification steps usually provide higher protein yields and higher purity of the final
product.
GST fusion proteins can be expressed at high levels in E. coli grown using an environmental shaker (see Basic Protocol 1). The soluble fusion protein is purified using a
single-step glutathioneSepharose 4B affinity column (see Basic Protocol 2). Fusion
proteins that are found in inclusion bodies can be extracted using denaturants and then
refolded before affinity chromatography using a glutathioneSepharose 4B column (see
Alternate Protocol 1). Soluble fusion proteins can also be purified batchwise using
glutathioneSepharose 4B affinity matrix (see Alternate Protocol 2). The fusion protein
can be cleaved with either thrombin or factor Xa in solution to separate the protein of
interest from glutathione (see Basic Protocol 3). The protein of interest can also be
released by protease digestion of the fusion protein immobilized on a glutathione
Sepharose 4B column (see Alternate Protocol 3), or by batchwise protease cleavage and
separation from the resin (see Alternate Protocol 4). Affinity purification using glutathioneSepharose 4B can be used to remove the GST moiety after enzymatic cleavage
(see Support Protocol 1). HPLC gel filtration is used as a final purification step to obtain
>98% pure protein (see Support Protocol 2). The relationships between the protocols are
shown in Figure 6.6.1.
STRATEGIC PLANNING
A variety of pGEX expression vectors are commercially available (Pharmacia Biotech)
that contain a tac promoter for chemically inducible, high-level protein expression. The
available pGEX vectors have an open reading frame encoding glutathione-S-transferase
(GST) followed by multiple cloning sites. These are followed by termination codons in
each reading frame (Figs. 6.6.2 and 6.6.3). A fragment of DNA containing the genetic
sequence for the polypeptide of interest is ligated into an appropriate pGEX vector and
transformed into E. coli. It should be noted that although expression in E. coli is efficient,
there is no post-translational modification machinery. Successful expression of GST
Purification of
Recombinant
Proteins
6.6.1
Supplement 9
on-column affinity
purification (Basic Protocol
2 or Alternate Protocol 1)
cleavage on column
(Alternate Protocol 3)
elution
batchwise purification
with affinity resin
(Alternate Protocol 2)
elution
cleavage in solution
(Basic Protocol 3)
elution
cleavage in solution
(Basic Protocol 3)
protein purification by
affinity chromatography
(Support Protocol 1) and/or
HPLC (Support Protocol 2)
Figure 6.6.1 Flow chart showing the relationships between the various protocols in this unit.
fusion proteins using baculovirus systems (Davies et al., 1993) and yeast (Mitchell et al.,
1993) have also been reported.
One factor that influences which pGEX vector to choose is whether or not the GST moiety
will ultimately be cleaved away from the protein of interest. The pGEX-2T and pGEX-4T
series of vectors contain a protease cleavage site for thrombin, and the pGEX-3X and
pGEX-5X series of vectors contain protease cleavage sites for factor Xa. A more recently
developed expression vector is the pGEX-6P series, which contains a cleavage site for
PreScission protease (Pharmacia Biotech). PreScission protease has the advantage that it
is effective at low temperatures (5C). It is also a GST fusion protein, a feature that
facilitates removal of the protease from the target protein after cleavage. Fusion proteins
with a thrombin recognition site have the advantage that relatively small amounts of
thrombin and short digestion times at 37C will often cleave the fusion protein with high
efficiency. Thrombin digestions are often the most cost effective on a per milligram of
cleaved target polypeptide basis. Factor Xa is more expensive and typically requires use
of much higher enzyme-to-substrate ratios for efficient cleavage. Solutions of factor Xa
also have a more limited shelf life since freezing and thawing inactivates this enzyme.
Expression and
Purification of
GST Fusion
Proteins
For preparation of cDNA inserts encoding the desired polypeptide, see Ausubel et al.
(1994) or Sambrook et al. (1989). Briefly, a set of oligonucleotides is designed for
polymerase chain reaction (PCR) amplification of the region of interest of a pertinent
cDNA. These oligonucleotides should also contain appropriate restriction sites adjacent
to the desired coding region that are compatible with a restriction site in the cloning site
of the selected pGEX vector (see Figs. 6.6.2 and 6.6.3). PCR amplification is performed,
6.6.2
Supplement 9
pGEX-1T
thrombin
Leu Val Pro Arg Gly Ser Pro Glu Phe Ile Val Thr Asp
CTG GTT CCG CGT GGA TCC CCG GAA TTC ATC GTG ACT GAC TGA CGA
BamHI
stop codons
EcoRI
Tth111I
Aat II
BalI
glutathione-S- transferase
Ptac
pSj10Bam7Stop7
Ap
BspMI
PstI
pGEX
~4950 bp
la
NarI
EcoRV
BssHII
ApaI
Bst EII
MluI
Alw NI
p4.5
cl
pBR322
ori
Figure 6.6.2 pGEX vectors are plasmid expression vectors that express a cloned gene as a fusion
protein with glutathione-S-transferase (GST). The lac repressor gene binds to the lac promoter (ptac)
and represses expression of the GST fusion protein until induction with isopropyl-1-thio--D-galactopyranoside (IPTG). The polypeptide of interest can be inserted immediately after the GST gene
using the polylinker site shown in brackets (pGEX-1T, shown here, is the most common; see Fig.
6.6.3 for other PGEX polylinkers). Protease cleavage sites (brackets above the polylinker sequences) are located between the GST carrier protein and the protein of interest so that the GST
moiety can be removed. Restriction endonuclease sites are indicated below the sequence of the
polylinker and on the plasmid. An important consideration in selecting a vector and appropriate
cloning sites is to minimize the number of extraneous residues introduced into the N-terminal of the
target polypeptide. Vector map courtesy of Pharmacia Biotech.
followed by digestion of the PCR product and the pGEX vector with the appropriate
restriction enzymes. The PCR product is then ligated into the pGEX vector and transfected
into a suitable E. coli host. Several transformants should be grown in minicultures and
induced with isopropyl-1-thio--D-galactopyranoside (IPTG) to check for expression of
the desired fusion protein. Fusion protein expression can be monitored by SDS-PAGE
(UNIT 10.1) or by Western blot (UNIT 10.10) detection of the GST fusion protein using an
antibody specific for either the target protein or the GST moiety. Once successful
expression is achieved, the integrity of the DNA should be verified by sequencing to
ensure that no errors were introduced during PCR.
Before conducting a large-scale purification, it is worthwhile to perform a small pilot
purification (10-fold less than protocol descriptions) to determine optimal conditions.
The purification can then easily be scaled up. All stages of purification should be
monitored using SDS-PAGE (UNIT 10.1). In most cases, GST fusion protein expression is
very high and a major band at the expected molecular weight (the GST moiety contributes
26 kDa to the molecular weight) is obvious when uninduced and induced cells are
compared on SDS gels (Fig. 6.6.4). This band can then be monitored at each step of the
purification. However, as noted above, if the level of protein expression obtained is low
or band identification is ambiguous, the fusion protein can be monitored by Western blot
Purification of
Recombinant
Proteins
6.6.3
Current Protocols in Protein Science
Supplement 9
pGEX-2T
thrombin
Leu Val Pro Arg Gly Ser Pro Gly Ile His Arg Asp
CTG GTT CCG CGT GGA TCC CCG GGA ATT CAT CGT GAC TGACTG ACG
stop codons
thrombin
Leu Val Pro Arg Gly Ser Arg Arg Ala Ser Val
CTG GTT CCG CGT GGA TCT CGT CGT GCA TCT GTT GGA TCC CCG GGAATT CATCGT GAC TGA
stop codon
thrombin
Leu Val Pro Arg Gly Ser Pro Glu Phe Pro Gly Arg Leu Glu Arg Pro His Arg Asp
CTG GTT CCG CGT GGA TCCCCG GAATTC CCG GGT CGA CTC GAG CGGCCG CAT CGT GAC TGA
BamHI
SalI
EcoRI SmaI
stop codons
NotI
XhoI
pGEX-4T-2
thrombin
Leu Val Pro Arg Gly Ser Pro Gly Ile Pro Gly Ser Thr Arg Ala Ala Ala Ser
CTG GTTCCG CGT GGA TCC CCA GGA ATT CCC GGG TCGACT CGA GCG GCC GCA TCG TGA
BamHI
EcoRI SmaI
SalI
stop codons
NotI
XhoI
pGEX-4T-3
thrombin
Leu Val Pro Arg Gly Ser Pro Asn Ser Arg Val Asp Ser Ser Gly Arg Ile Val Thr Asp
CTG GTT CCG CGT GGA TCC CCG AAT TCC CGG GTC GAC TCG AGC GGC CGC ATC GTG ACT GAC TGA
BamHI
factor Xa
EcoRI SmaI
SalI
stop codons
NotI
XhoI
pGEX-3X
Ile Glu Gly Arg Gly Ile Pro Gly Asn Ser Ser
ATC GAA GGT CGT GGG ATC CCC GGG AAT TCA TCG TGA CTG ACT GAC
stop codons
factor Xa
Ile Glu Gly Arg Gly Ile Pro Glu Phe Pro Gly Arg Leu Glu Arg Pro His Arg Asp
ATC GAA GGT CGT GGG ATC CCC GAATTC CCG GGTCGA CTC GAG CGG CCG CAT CGT GAC TGA
BamHI
EcoRl SmaI
Sall
XhoI
Not l
stop codons
pGEX-5X-2
factor Xa
Ile Glu Gly Arg Gly Ile Pro Gly Ile Pro Gly Ser Thr Arg Ala Ala Ala Ser
ATC GAA GGT CGT GGG ATC CCC GGA ATTCCC GGG TCG ACT CGA GCGGCC GCATCG TGA
BamHI
EcoRl SmaI
Sall
XhoI
Not l
stop codons
pGEX-5X-3
factor Xa
Ile Glu Gly Arg Gly Ile Pro Arg Asn Ser Arg Val Asp Ser Ser Gly Arg lle Val Thr Asp
ATC GAA GGT CGT GGG ATC CCC AGG AAT TCC CGGGTC GAC TCG AGC GGC CGC ATC GTG ACT GAC TGA
BamHI
EcoRI SmaI
SalI
XhoI
NotI
Stop codons
Figure 6.6.3 Variations on the polylinker site shown in Figure 6.6.2 (courtesy of Pharmacia
Biotech).
Expression and
Purification of
GST Fusion
Proteins
6.6.4
Supplement 9
95
66
43
36
66
25
17
29
6
18
S
30 C
S
25 C
18 C
analysis (UNIT 10.10) using a GST-specific antibody (Pharmacia Biotech). It is recommended that the lysed cell extract, extracted pellet, and all other collected fractions from
the purification be saved on ice until after careful analysis of the entire purification by
SDS-PAGE and/or immunoblotting to ensure that fractions containing the fusion protein
are not mistakenly discarded.
Basic Protocol 1 describes protein production in cells grown at 37C; however, at this
temperature some fusion proteins may be found in inclusion bodies in a denatured form.
As an alternative to attempting to renature the protein after extraction from inclusion
bodies (UNIT 6.1), expression at lower temperatures, such as 30, 25, 20, or 15C, can be
evaluated to determine whether the protein can be obtained in good yield in the soluble
fraction (Fig. 6.6.5). When expressing fusion proteins at lower temperatures, the initial
overnight culture can be grown at 37C followed by growth at a lower temperature prior
to induction.
EXPRESSION OF GLUTATHIONE-S-TRANSFERASE FUSION PROTEIN
Transformed E. coli cells expressing the glutathione-S-tranferase (GST) fusion protein
of interest are grown in culture in the presence of isopropyl-1-thio--D-galactopyranoside
(IPTG) at the desired preparative scale. Since the expression level of GST fusion proteins
is usually very high, adequate amounts of protein can usually be conveniently obtained
by preparing a few liters of cells grown in shaker cultures. This protocol describes the
preparation of 1.8 liters of transfected E. coli in three 600-ml units using 2-liter flasks in
a shaker incubator. Moderate further scale-up is feasible by using more or larger flasks.
BASIC
PROTOCOL 1
Purification of
Recombinant
Proteins
6.6.5
Current Protocols in Protein Science
Supplement 9
Further scale-up can be accomplished using a fermentor (see UNITS 5.3 & 5.4). This protocol
describes protein production in cells grown at 37C. At this temperature, however, some
fusion proteins may be recovered from inclusion bodies in a denatured form, and culture
conditions may need to be modified to improve protein yield in the soluble fraction (see
Strategic Planning).
Culture growth can be monitored by reading the optical density at 550 nm (OD550) as well
as by analysis of the bacterial culture using SDS-PAGE. Cells should not be allowed to
grow for extended periods of time after induction as cell lysis can occur; this releases
proteases that may degrade the fusion protein. Visual inspection of the cells using a
microscope is a useful method for identifying cell breakage (see UNITS 5.1-5.3 & 6.1-6.5 for
additional details concerning recombinant protein expression in E. coli).
Materials
Luria broth (LB medium; UNIT 5.2; pH adjusted to 7.2)
5 mg/ml ampicillin (see recipe)
Glycerol culture of transformed E. coli cells expressing GST fusion protein of
interest in a pGEX vector
100 mM isopropyl-1-thio--D-galactopyranoside (IPTG; see recipe)
2-liter culture flasks
500-ml culture flasks
Large centrifuge bottles (e.g., 1-liter capacity)
Low-speed refrigerated centrifuge (e.g., Beckman J6-B and JS-4.2 rotor or
equivalent), 4C
Additional reagents and equipment for SDS-polyacrylamide gel electrophoresis
(SDS-PAGE; UNIT 10.1)
Grow bacterial cells
1. Prepare LB medium and add 600 ml to each of three 2-liter flasks and 100 ml to each
of two 500-ml flasks. Autoclave 20 to 30 min at a slow exhaust (liquid) setting.
Flasks should be filled to only 20% to 30% of their capacity to ensure adequate aeration
of the medium during cell growth. Autoclave LB medium immediately after preparing it to
prevent any incidental bacterial growth from occurring. It can then be stored up to 1 month
at room temperature under sterile conditions.
3. Using an inoculating loop, transfer some of the glycerol culture containing the
transfected E. coli expressing the GST fusion protein of interest to the flask containing
100 ml LB medium with ampicillin.
Sterile flame the inoculating loop as well as the opening of all bottles. Allow the loop to
cool before transferring the inoculating culture. If the loop temperature is too high, all the
cells could be killed during the transfer.
4. Incubate the inoculated culture on an environmental shaker set at 250 to 300 rpm
overnight at 37C.
Expression and
Purification of
GST Fusion
Proteins
5. The next morning, remove the culture from the shaker and read the optical density at
550 nm (OD550) using a UV/visible light spectrophotometer.
The OD550 of the overnight culture should be 1.0. Use medium from the second 500-ml
flask that did not receive ampicillin as a reference to zero the spectrophotometer.
6.6.6
Supplement 9
6. Using sterile technique, add 6 ml of 5 mg/ml ampicillin to each 2-liter flask containing
600 ml LB medium (0.1 mM final concentration).
7. Dilute the overnight culture 1:20 by adding 30 ml to each of the three 2-liter flasks
containing 600 ml LB medium.
8. Incubate the 600-ml cultures on a shaker at 37C at 250 to 300 rpm until the OD550
is 0.5 to 0.7.
It should take 2 hr for the culture to reach this early log stage of growth at 37C. If cells
are grown at lower temperatures to shift the expressed protein from inclusion bodies into
the soluble fraction, this time must be increased, since the cells will grow more slowly at
lower temperatures.
10. Remove cultures from the shaker 2.5 to 3 hr after induction. Remove 1 ml from each
culture and save for gel analysis. Check final OD550.
Culture growth can be monitored at OD550. When the cells reach saturation, they will stop
dividing. A typical SDS-PAGE gel of an uninduced control culture and an induced culture
after 3 hr growth is shown in Figure 6.6.4.
6.6.7
Current Protocols in Protein Science
Supplement 9
BASIC
PROTOCOL 2
2. Wash the glutathione column with 5 to 10 bed volumes PBS at a flow rate of 1.5
ml/min to remove the ethanol storage solution.
Expression and
Purification of
GST Fusion
Proteins
A bed volume is one-half the amount of glutathioneSepharose 4B that was added to the
column as a 50% slurry.
6.6.8
Supplement 9
3. Wash the glutathione column with 3 to 5 bed volumes glutathione buffer at 1.5
ml/min.
Previously used columns may become partially oxidized on storage and should be
preequilibrated (steps 3 to 4) within 24 hr before use.
4. Wash the glutathione column with 10 bed volumes PBS/EDTA/PMSF at 1.5 ml/min.
Lyse cells
5. Resuspend each pelleted 600-ml culture in 15 ml ice-cold lysis buffer.
Pellets should be resuspended in 25 to 50 l buffer per milliliter of culture.
6. Sonicate the suspension using a probe-tip sonicator ten times for 10 sec each, with
1-min rests between sonications to lyse the cells. Save a sample (100 l) of the lysate
for gel analysis and transfer remainder to a 60-ml centrifuge tube.
The cells are usually adequately lysed at the point when the suspension turns a slightly
darker color and becomes clearer. To minimize proteolysis in the sample, it is essential to
keep the cells on ice throughout the sonication procedure, and sonication should be
performed in short bursts to minimize sample heating. Avoid excessive sonication, as this
can lead to co-purification of E. coli host proteins along with the fusion protein of interest.
Avoid frothing during sonication, which can denature the fusion protein.
8. Decant the supernatant containing the soluble fusion protein into a clean 50-ml
centrifuge tube.
9. Add a volume of ice-cold wash buffer equal to the volume of lysis buffer used in step
5 to the pellet. Use a dounce homogenizer to resuspend the pellet.
Pellets should be resuspended in 25 to 50 l buffer per milliliter of culture.
10. Analyze the lysate, supernatant, and resuspended pellet using SDS-PAGE (UNIT 10.1)
to verify that the fusion protein is in the supernatant.
If the fusion protein is in the supernatant, proceed to the next step. If the protein is in the
pellet, it is necessary to purify the GST fusion proteins from inclusion bodies (see Alternate
Protocol 2) or to start over, shifting the protein into the supernatant by growing the cultures
at a lower temperature (see Troubleshooting and Fig. 6.6.5).
Purification of
Recombinant
Proteins
6.6.9
Current Protocols in Protein Science
Supplement 9
can be adjusted so that most of the supernatant will be loaded by the next morning. Do not
allow the column to run dry.
SDS-PAGE analysis of fractions collected during sample loading will reveal whether the
fusion protein is bound to the column or is present in the unbound fractions. Absence of
fusion protein in early unbound fractions combined with its appearance in late unbound
fractions indicates that column capacity has been exceeded. If this condition is observed,
reduce the protein load or increase the column size.
12. Wash the column with 5 to 10 bed volumes PBS/EDTA/PMSF at 1.5 ml/min.
13. Wash the column with 10 bed volumes PBS/EDTA at 1.5 ml/min to remove the PMSF.
If samples are to be cleaved with thrombin or factor Xa, any serine protease inhibitor (e.g.,
PMSF) must be removed from the sample before cleavage.
15. Analyze the fractions by SDS-PAGE (UNIT 10.1) and pool fractions containing the GST
fusion protein. Store at 0 to 4C.
Fusion protein should typically be >90% pure at this point.
ALTERNATE
PROTOCOL 1
Expression and
Purification of
GST Fusion
Proteins
1. Preequilibrate the glutathione column, lyse the cells, and separate the lysate pellet,
which includes the inclusion bodies, and supernatant (see Basic Protocol 2, steps 1
to 9).
6.6.10
Supplement 9
2. Centrifuge washed pellet (see Basic Protocol 2, step 9) 20 min at 48,000 g, 4C.
Decant the supernatant and resuspend the pellet in 12 ml freshly prepared U buffer
per 600 ml original culture. Incubate 2 hr on ice.
Pellets should be resuspended in 20 l U buffer per ml of culture.
7. Remove the sample from the dialysis bag and centrifuge 20 min at 4000 g, 4C.
8. Column purify the fusion protein (see Basic Protocol 2, steps 11 to 15).
BATCH PURIFICATION OF GST FUSION PROTEIN
Soluble glutathione-S-transferase (GST) fusion proteins in cell lysate supernatants or
renatured proteins extracted from inclusion bodies can be batch purified on glutathione
Sepharose 4B as an alternative to column-based purification (see Basic Protocol 2 and
see Alternate Protocol 1). Batch purification requires less equipment and is relatively
quick and easy, but resulting protein yield and sample purity are lower than in a
chromatographic separation. In addition, the room temperature incubations recommended by the resin manufacturer (Pharmacia Biotech), especially the batch incubation
of E. coli lysate supernatant or inclusion body extract with glutathione-Sepharose,
increase the risk of proteolytic degradation of the fusion protein.
ALTERNATE
PROTOCOL 2
Purification of
Recombinant
Proteins
6.6.11
Current Protocols in Protein Science
Supplement 9
6. Centrifuge the suspension 5 min at 500 g, room temperature. Discard the supernatant.
7. Repeat the wash and centrifugation steps for a total of three washes with 10 bed
volumes each of PBS.
Elute fusion protein
8. Elute the bound fusion protein by gently resuspending the sedimented resin in 1.0 ml
glutathione buffer per milliliter resin bed volume. Incubate 10 min at room temperature with gentle agitation.
9. Centrifuge the suspension 5 min at 500 g, room temperature. Transfer supernatant
to a separate tube.
10. Repeat the elution and centrifugation (steps 8 to 9) a total of three times. Store at 0
and 4C.
The supernatants may be pooled into one tube or analyzed separately by SDS-PAGE (UNIT
10.1) to monitor for fusion protein content.
The yield of fusion protein can be monitored by measuring the absorbance at 280 nm (A280).
The extinction coefficient will partially depend on the absorbance characteristics of the
experimental component of the fusion protein. For the GST moiety alone, the concentration
can be estimated using 1 A280 = 0.6 mg/ml protein.
As noted above, batch purification at room temperature increases the risk of proteolytic
digestion of the target protein. To minimize such degradation, this procedure can alternatively be performed in a cold room at 4C with incubation times in steps 3 and 8 increased
2- to 4-fold.
BASIC
PROTOCOL 3
Expression and
Purification of
GST Fusion
Proteins
Additional reagents and equipment for dialysis (APPENDIX 3B) and SDS-PAGE
(UNIT 10.1)
6.6.12
Supplement 9
2. Stop preparative digestion by adding a 1:500 dilution of 0.15 M PMSF stock solution.
Incubate sample an additional 15 min at 37C for thrombin or 30 min at 25C for
factor Xa to covalently inhibit the enzyme with the PMSF.
3. Dialyze the sample (APPENDIX 3B) twice versus 2 liters PBS/EDTA/PMSF for a
minimum of 4 hr per buffer change at 4C.
Complete removal of glutathione is important if samples are to be rechromatographed on
glutathione-Sepharose to remove the GST moiety and uncleaved fusion protein. Larger
volumes of dialysate may be necessary depending on sample volume: e.g., if the sample
volume is >40 ml, increase the dialysis buffer volume to 4 liters per change or use three
changes of buffer. In addition, since glutathione equilibrates slowly during dialysis, when
66
36
18
14
6
1
10
Figure 6.6.6 SDS gel stained with Coomassie brilliant blue showing pilot thrombin digestions of
two recombinant glutathione-S-transferase (GST) fusion proteins. Samples were digested 3 hr at
37C in buffer with varying enzyme-to-substrate ratios. Lane 1, a 78-kDa fusion protein. Lanes 2 to
5, thrombin digestion of the 78-kDa fusion protein using enzyme-to-substrate ratios of 1:3000,
1:1000, 1:350 and 1:100, respectively. Lane 6, a 45-kDa fusion protein. Lanes 7 to 10, thrombin
digestion of the 45-kDa fusion protein using substrate ratios of 1:3000, 1:1000, 1:350, and 1:100,
respectively. In each case, an enzyme-to-substrate ratio of 1:1000 was chosen as the optimal
digestion condition
Purification of
Recombinant
Proteins
6.6.13
Current Protocols in Protein Science
Supplement 9
dialysis tubing with a MWCO <12,000 is used, the dialysis conditions should be increased
to at least three changes with 2 liters dialysis buffer per change.
4. Centrifuge the dialyzed sample 20 min at 4000 g, 4C, to remove any precipitated
material. Transfer the supernatant to a clean tube at 0 to 4C and analyze by
SDS-PAGE (UNIT 10.1).
The cleaved protein of interest can be separated from the GST moiety and uncleaved fusion
protein by rechromatographing it on the glutathione column (see Support Protocol 1).
ALTERNATE
PROTOCOL 3
2. Load the thrombin reaction mixture onto the column. When the reaction mixture has
been added, replace the bottom cap and allow the column to stand at room temperature
for 2 to 16 hr.
Incubation times must be empirically determined for each fusion protein.
3. Elute the protein of interest by washing the column with PBS and immediately add
an aliquot of 0.15 M PMSF stock solution to give a final PMSF concentration of 0.3
mM (e.g., 6 l PMSF stock solution per 3.0-ml fraction).
The protein of interest will be in the flowthrough and the GST moiety and undigested fusion
protein will remain bound to the glutathione-Sepharose.
4. Remove the GST moiety and uncleaved fusion protein from the column by washing
with 5 bed volumes glutathione buffer at 0.3 ml/min.
5. Immediately after collecting fractions, store all fractions at 0 to 4C.
Expression and
Purification of
GST Fusion
Proteins
6.6.14
Supplement 9
ALTERNATE
PROTOCOL 4
2. Add the reconstituted thrombin reaction mixture to the pelleted glutathioneSepharose resin containing the bound fusion protein (from Alternate Protocol 2, step
7).
3. Gently resuspend the glutathione-Sepharose and agitate on an orbital shaker 2 to 16
hr at room temperature.
Incubation times must be empirically determined for each fusion protein.
Cleavage may be monitored by removing an aliquot of the slurry from the incubation
mixture at different time points, centrifuging the slurry to separate the resin and supernatant, and analyzing the fractions by SDS-PAGE (UNIT 10.1).
5. Inhibit the reaction by adding a 1:500 dilution of 0.15 M PMSF stock solution (0.3
mM PMSF final). Incubate the sample an additional 15 min at 37C for thrombin or
an additional 30 min at 25C for factor Xa to covalently inhibit the enzyme with the
PMSF. Store the samples at 0 to 4C.
6. Analyze the final fractions by SDS-PAGE (UNIT 10.1).
AFFINITY CHROMATOGRAPHY PURIFICATION OF POLYPEPTIDES
AFTER ENZYMATIC CLEAVAGE
SUPPORT
PROTOCOL 1
6.6.15
Current Protocols in Protein Science
Supplement 9
Materials
Solution of fusion protein that has been cleaved with thrombin or factor Xa and
dialyzed into PBS/EDTA/PMSF buffer (see Basic Protocol 3, step 4)
Glutathione buffer (see recipe)
PBS/EDTA/PMSF buffer (see recipe)
2.5 8cm glutathioneSepharose 4B column (e.g., Bio-Rad Econo)
Additional reagents and equipment for SDS-PAGE (UNIT 10.1)
1. Wash the glutathioneSepharose 4B column with >3 bed volumes glutathione buffer
at 1.5 ml/min.
A bed volume is one-half the amount of glutathione-Sepharose that was added to the
column.
The same 2.5 8cm column used for initial purification of a given fusion protein can be
used for repurification of the same target polypeptide after protease cleavage. GlutathioneSepharose must be fully reduced in order for the GST moiety to bind efficiently. If the column
was washed with glutathione buffer <48 hr before, this step may be skipped.
2. Wash the glutathione column with 10 bed vol PBS/EDTA/PMSF at 1.5 ml/min.
3. Load the solution of fusion protein that has been cleaved onto the column using a
flow rate of 0.1 ml/min. Collect fractions for later analysis by SDS-PAGE (UNIT 10.1)
and store at 0 to 4 C.
The protein of interest will be in the unbound or column flowthrough peak, and the GST
moiety will bind to the glutathione-Sepharose. A low flow rate for sample application, such
as 0.1 ml/min, should be used to ensure complete binding of GST to the column. Faster
flow rates are likely to result in elution of excessive amounts of the GST moiety in the
unbound fraction.
Expression and
Purification of
GST Fusion
Proteins
6.6.16
Supplement 9
Materials
Cleaved fusion protein (see Support Protocol 1, step 6, or Alternate Protocol 3,
step 5)
PBS (APPENDIX 2E)
Centriprep concentrator with appropriate MWCO (Amicon)
Low-speed refrigerated centrifuge (e.g., Beckman J6-B and JS-4.2 rotor or
equivalent, 4C), or 0.22-m low-protein-binding filter (Costar)
Gel-filtration columns (see UNIT 8.3)
Additional reagents and equipment for SDS-PAGE (UNIT 10.1)
1. Concentrate the unbound cleaved fusion protein using a Centriprep concentrator.
Samples should be concentrated to a volume of no greater than 0.5% to 1% of the column
volume to be used for the separation.
2. Centrifuge the concentrated sample 20 min at 4000 g, 4C. Remove the supernatant,
being careful not to disturb any pellet that may have formed (precipitated material
formed during concentration).
Alternatively, a 0.22-m low-protein-binding filter may be used to remove particulates. It
is especially important to remove particulates to avoid clogging of the HPLC column end
frits.
3. Inject the concentrated sample onto gel filtration column(s) equilibrated in PBS.
4. Monitor the absorbance at 280 nm (A280) using an online HPLC detector and collect
fractions.
5. Analyze the fractions by SDS-PAGE (UNIT 10.1) to determine fractions of interest and
to evaluate the purity of the target polypeptide, which should be in the major peak.
6. Pool the desired fractions and store at 0 to 4C.
REAGENTS AND SOLUTIONS
Use Milli-Q-purified water or its equivalent for the preparation of all buffers. For common stock solutions,
see APPENDIX 2E; for suppliers, see SUPPLIERS APPENDIX.
Ampicillin, 5 mg/ml
500 mg ampicillin
100 ml H2O
Filter sterilize using 0.22-m filter
Store up to several months at 4C
Do not autoclave the solution; ampicillin is inactivated above 50C.
Glutathione buffer
50 mM Tris base
10 mM reduced glutathione (Sigma)
Adjust pH to 8.0 with 6 M HCl
Bring to final volume with cold H2O
Prepare fresh daily and store at 4C until needed
Isopropyl-1-thio--D-galactopyranoside (IPTG), 100 mM
1 g IPTG
42 ml sterile H2O
Divide into 6-ml aliquots in sterile tubes
Store up to 1 year at 20C
Purification of
Recombinant
Proteins
6.6.17
Current Protocols in Protein Science
Supplement 9
Lysis buffer
50 mM NaCl
50 mM Tris base
5 mM EDTA
1 g/ml leupeptin
1 g/ml pepstatin
0.15 mM phenylmethylsulfonyl fluoride (PMSF) in isopropanol
1 mM diisopropyl fluorophosphate (DFP)
Adjust pH to 8.0 with 6 M HCl
Bring to final volume with cold H2O
Prepare fresh daily
CAUTION: DFP is a dangerous neurotoxin. Handle the neat reagent with double gloves in
a chemical fume hood only. Carefully follow all precautions supplied by the manufacturer
for this chemical.
PBS/EDTA
1 PBS (APPENDIX 2E)
5 mM EDTA
Adjust pH to 7.4 with 1 M NaOH
Bring to final volume with cold H2O
Store up to 1 month at 4C
PBS/EDTA/PMSF buffer
1 PBS (APPENDIX 2E)
5 mM EDTA
0.15 mM phenylmethylsulfonyl fluoride (PMSF) in isopropanol
Adjust pH to 7.4 with 1 M NaOH
Bring to final volume with cold H2O
Store up to 1 week at 4C
PBS/glycerol buffer
1 PBS (APPENDIX 2E)
20% (v/v) glycerol
1% (v/v) Triton X-100
5 mM 2-mercaptoethanol (2-ME)
5 mM EDTA
0.1 g/ml leupeptin
0.1 g/ml pepstatin
0.15 mM phenylmethylsulfonyl fluoride (PMSF) in isopropanol
0.1 mM diisopropyl fluorophosphate (DFP)
Adjust pH to 7.4 with 1 M NaOH
Bring to final volume with cold H2O
Prepare fresh daily
CAUTION: DFP is a dangerous neurotoxin. Handle the neat reagent with double gloves in
a chemical fume hood only. Carefully follow all precautions supplied by the manufacturer
for this chemical.
Expression and
Purification of
GST Fusion
Proteins
U buffer
5 M urea
50 mM Tris base
5 mM EDTA
5 mM 2-mercaptoethanol (2-ME)
1 g/ml leupeptin
1 g/ml pepstatin
continued
6.6.18
Supplement 9
Wash buffer
50 mM Tris base
5 mM EDTA
1 g/ml leupeptin
1 g/ml pepstatin
0.15 mM phenylmethylsulfonyl fluoride (PMSF) in isopropanol
Adjust pH to 8.0 with 6 M HCl
Bring to final volume with cold H2O
Prepare fresh daily
COMMENTARY
Background Information
The pGEX vectors were designed for highlevel, inducible, intracellular expression of glutathione-S-transferase (GST) fusion proteins
produced in Escherichia coli. GST is a common
26-kDa protein of eukaryotes. The GST gene
used in the development of the pGEX vectors
was originally cloned from the parasitic
helminth Schistosoma japonicum (Smith and
Johnson, 1988).
Purification of fusion proteins from a wholecell lysate is readily achieved through the
strong affinity of the GST moiety for glutathione, which is immobilized on Sepharose
beads. The fusion protein can be displaced
under mild conditions from the glutathioneSepharose beads using neutral-pH buffers containing free reduced glutathione.
The main advantages of this system are the
very high level of fusion protein expression
(often >10 to 50 mg/liter of E. coli culture
grown on an environmental shaker) and the
facile purification methods for both initial isolation and subsequent separation of cleaved
polypeptide and GST moiety. Since nondenaturing purification conditions are employed,
polypeptides that do not normally contain posttranslational modifications usually retain their
functional and antigenic properties. Other advantages of this system include availability of
several alternative protease cleavage sites and
the large number of bacterial hosts that can be
used.
Purification of most soluble GST fusion
proteins is straightforward, and success in the
Troubleshooting
Contamination of the glutathione-S-transferase (GST) fusion protein after affinity purification with E. coli host cell proteins is usually
an indication that sonication has been too severe. Other contaminants may represent de-
Purification of
Recombinant
Proteins
6.6.19
Current Protocols in Protein Science
Supplement 9
Expression and
Purification of
GST Fusion
Proteins
6.6.20
Supplement 9
Anticipated Results
Yields of fusion protein can vary widely.
Typical yields are 10 to 50 mg/liter, but can
occasionally be much lower, especially if the
fusion protein is toxic to the cells or is unstable.
In some cases, >50 mg/liter can be obtained
when expression conditions have been well
optimized. A single-step affinity purification
should yield fusion protein that is >90% pure
in most cases. The relationship between yield
of fusion protein and yield of cleaved, repurified recombinant target polypeptide is in part
due to the mass of the target polypeptide. A
good final yield of a cleaved protein after repurification on a glutathione column followed by
HPLC gel filtration might be 2 mg/liter for a
10-kDa protein and 10 mg/liter for a 50-kDa
protein.
Time Considerations
Protein expression takes 1.5 days of intermittent work requiring approximately 3 to 4 hr
of operator time. Longer induction periods may
be required at temperatures lower than 30C,
but total operator time remains the same. A
small-scale batch purification can be completed in 1 day. A large-scale column purification, cleavage of fusion protein, and repurification of the cleaved peptide by affinity and gel
filtration chromatography will take 5 to 8 days
of intermittent work requiring several hours of
operator time per day. The purification should
be completed in as short a time as practical to
minimize proteolysis, aggregation, and precipitation of impure fractions.
Literature Cited
Ausubel, F.M., Brent, R., Kingston, R.E., Moore,
D.D., Seidman, J.G., Smith, J.A., and Struhl, K.
(eds.). 1994. Current Protocols in Molecular Biology. John Wiley & Sons, New York.
Davies, A.H., Jowett, J.B.M., Jones, I.M. 1993. Recombinant baculovirus vectors expressing glutathione-S-transferase fusion proteins. Bio/Technology 11:933-936.
Frangioni, J.V. 1992. Solubilization and purification
of enzymatically active glutathione-S-transferase (pGEX) fusion proteins. Anal. Biochem.
210:179-187.
Gearing, D.P., Nicola, N.A., Metcalf, D., Foote, S.,
Willson, T.A., Gough, N.M., and Williams, R.L.
1989. Production of leukemia inhibitory factor
in Escherichia coli by a novel procedure and its
use in maintaining embryonic stem cells in culture. Bio/Technology 7:1157-1161.
Grieco, F., Hull, J., and Hull, R. 1992. An improved
procedure for the purification of protein fused
with glutathione-S-transferase. Biotechniques
13:856-857.
Guan, K.L. and Dixon, J.E. 1991. Eukaryotic proteins expressed in Escherichia coli: An improved
thrombin cleavage and purification procedure of
fusion proteins with glutathione-S-transferase.
Anal. Biochem. 192:262-267.
Hakes, D.J. and Dixon, J.E. 1991. New vectors for
high level expression of recombinant proteins in
bacteria. Anal. Biochem. 202:293-298.
Mitchell, D.A., Marshall, T.K., and Deschenes, R.J.
1993. Vectors for the overexpression of glutathione-S-transferase fusion proteins in yeast.
Yeast 9:715-722.
Sambrook, J., Fritsch, E.F., and Maniatis, T. 1989.
Molecular Cloning: A Laboratory Manual, 2nd
ed. Cold Spring Harbor Laboratory Press, Cold
Spring Harbor, New York.
Smith, D.B. and Johnson, K.S. 1988. Single-step
purification of polypeptides expressed in Escherichia coli as fusions with glutathione-Stransferase. Gene 67:31-40.
Key Reference
Smith and Johnson, 1988. See above.
Original description of the pGEX system.
Purification of
Recombinant
Proteins
6.6.21
Current Protocols in Protein Science
Supplement 9
UNIT 6.7
This unit describes a gene fusion expression system that uses thioredoxin, the product of
the Escherichia coli trxA gene, as the fusion partner. The system is particularly useful for
high-level production of soluble fusion proteins in the E. coli cytoplasm; in many cases
heterologous proteins produced as thioredoxin fusion proteins are correctly folded and
display full biological activity. Although the thioredoxin gene fusion system is routinely
used for protein production, high-level production of peptidesi.e., for use as antigens
is also possible because the prominent thioredoxin active-site loop is a very permissive
site for the introduction of short amino acid sequences (10 to 30 residues in length). The
inherent thermal stability of thioredoxin and its susceptibility to quantitative release from
the E. coli cytoplasm by osmotic shock can also be exploited as useful tools for thioredoxin
fusion protein purification. In addition, a more generic method for purification of any
soluble thioredoxin fusion employs a modified form of thioredoxin (called His-patch
Trx), which has been designed to bind to metal chelate resins. Protein fusions to
His-patch Trx can usually be purified in a single step from cell lysates (see Strategic
Planning).
The first step is construction of a fusion of trxA to any desired gene and expression of the
fusion protein in an appropriate host strain at 37C (see Basic Protocol). Additional
protocols describe E. coli cell lysis using a French pressure cell and fractionation (see
Support Protocol 1), osmotic release of thioredoxin fusion proteins from the E. coli
cytoplasm (see Support Protocol 2), and heat treatment to purify some thioredoxin fusion
proteins (see Support Protocol 3).
STRATEGIC PLANNING
The thioredoxin gene fusion expression vectors pTRXFUS and hpTRXFUS, both of
which carry the E. coli trxA gene (Fig. 6.7.1), are used for high-level production of
C-terminal fusions to thioredoxin. The vector hpTRXFUS differs from pTRXFUS in that
it contains a modified E. coli trxA gene which produces a mutant protein (His-patch
thioredoxin) that can specifically bind to metal chelate matrices charged with nickel or
cobalt, otherwise known as native metal-chelate affinity chromatography (MCAC; UNIT
9.4). The trxA translation-termination codon has been replaced in both vectors by DNA
encoding a ten-residue peptide linker sequence that includes an enterokinase (enteropeptidase; LaVallie et al., 1993a) cleavage site. This highly specific site can be cleaved with
enterokinase following purification of the fusion protein to release the protein of interest
from its thioredoxin fusion partner. Immediately downstream of the DNA encoding the
enterokinase site in pTRXFUS and hpTRXFUS lies a DNA polylinker sequence containing a number of unique restriction endonuclease sites that can be used for forming
in-frame translational fusions of any desired gene to trxA. Downstream of the DNA
polylinker lies the E. coli aspA transcription terminator. Replication of these vectors is
controlled by a modified colE1 replication origin similar to that found in pUC vectors
(Norrander et al., 1983). Plasmid selection and maintenance is ensured by the presence
of the -lactamase gene on the vector. The vector pALtrxA-781 (Fig. 6.7.1) is very similar
to pTRXFUS. However in this plasmid the trxA gene is followed by a translation
termination codon, and the sequences encoding the enterokinase-site peptide linker are
absent. A unique RsrII site, present in both pALtrxA-781 and pTRXFUS, allows for the
easy insertion of short peptide-encoding DNA sequences into trxA within the region that
encodes the active-site loop.
Contributed by John McCoy and Edward LaVallie
Current Protocols in Protein Science (1997) 6.7.1-6.7.14
Copyright 1997 by John Wiley & Sons, Inc.
Purification of
Recombinant
Proteins
6.7.1
Supplement 10
BLA
ori
pALtrxA781
pTRXFUS
p L hpTRXFUS
trxA
aspA
Rsr II
TGGTGCGGTCCGTGCAAA
W
C
G33 P34 C
K
Sfi I
pALtrxA781:
Xba I
Sal I
Pst I
AACCTGGCCTAGCTGGCCATCTAGAGTCGACCTGCAG
N L
A *
aspA
terminator
thioredoxin
pTRXFUS:
Sal I
Pst I
AACCTGGCCGGTTCTGGTTCTGGTGATGACGATGACAAGGTACCCGGGGATCCTCTAGAGTCGACCTGCAG
thioredoxin
fusion
point
aspA
terminator
linker peptide
enterokinase site
Figure 6.7.1 Thioredoxin gene fusion expression vectors pTRXFUS, hpTRXFUS, and pALtrxA-781. pALtrxA-781
contains a polylinker sequence at the 3 end of the trxA gene. pTRXFUS and hpTRXFUS contain a linker region encoding
a peptide that includes the enterokinase cleavage site between the trxA gene and the polylinker. The sequence
surrounding the active site loop of thioredoxin has a single RsrII site that can be used to insert peptide coding sequence.
The asterisk indicates a translational stop codon. Abbreviations: trxA, E. coli thioredoxin gene; BLA, -lactamase gene;
ori, colE1 replication origin; pL, bacteriophage major leftward promoter; aspA terminator, E. coli aspartate amino-transferase transcription terminator.
Expression and
Purification of
Thioredoxin
Fusion Proteins
6.7.2
Supplement 10
Strain
Desired
production
temperature (C)
Pre-induction
growth
temperature (C)
Induction
period (hr)
GI698
15
25
20
GI698
GI698
20
25
25
25
18
10
GI724
GI724
30
37
30
30
6
4
GI723
37
37
Purification of
Recombinant
Proteins
6.7.3
Current Protocols in Protein Science
Supplement 10
BASIC
PROTOCOL
2. Transform the ligation mixture containing the new thioredoxin fusion plasmid into
competent GI724 cells. Plate transformed cells onto IMC plates containing 100 g/ml
ampicillin to select transformants. Incubate plates in a 30C convection incubator
until colonies appear.
Expression and
Purification of
Thioredoxin
Fusion Proteins
Strains GI698, GI723, and GI724 are all healthy prototrophs that can grow under a wide
variety of growth conditions, including rich and minimal media and a broad range of
growth temperatures (see Table 6.7.1). These strains can be prepared for transformation
with pL-containing vectors by growing them in LB medium at 37C. LB medium may also
6.7.4
Supplement 10
be used for these strains during the short period of outgrowth immediately following
transformation. This growth period of 30 min to 1 hr is often used to express drug-resistance
phenotypes before plating out plasmid transformations onto solid medium. Subsequently,
however, these strains should be grown only on minimal or tryptophan-free rich media,
such as IMC medium containing 100 g/ml ampicillin (for expression of the fusion protein)
or CAA/glycerol/ampicillin 100 medium (for plasmid DNA preparations). Except during
transformation, LB medium should never be used with these three strains when they carry
pL plasmids because LB contains tryptophan. The pL promoter is extremely strong and
should be maintained in an uninduced state until needed so that expression of the protein
will not lead to selection of mutant or variant cells with lower expression due to undesirable
genetic selections or rearrangements in the expression strain.
6. Pick a single fresh, well-isolated, colony from the plate and use it to inoculate 5 ml
IMC medium containing 100 mg/ml ampicillin in an 18 150mm culture tube.
Incubate overnight at 30C on a roller drum.
7. Add 0.5 ml overnight culture to 50 ml fresh IMC medium containing 100 g/ml
ampicillin in a 250-ml culture flask (1:100 dilution). Grow at 30C with vigorous
aeration until absorbance at 550 nm reaches 0.4 to 0.6 OD/ml (3.5 hr).
8. Remove a 1-ml aliquot of the culture (uninduced cells). Measure the optical density
at 550 nm and harvest the cells by microcentrifuging 1 min at maximum speed, room
temperature. Carefully remove all the spent medium with a pipet and store the cell
pellet at 80C.
9. Induce pL by adding 0.5 ml of 10 mg/ml tryptophan (100 g/ml final) to remaining
cells immediately.
10. Incubate 4 hr at 37C. At hourly intervals during this incubation, remove 1-ml aliquots
of the culture and harvest cells as in step 8.
11. Harvest the remaining cells from the culture 4 hr post-induction by centrifuging 10
min at 3000 rpm (e.g., in a Beckman J6 rotor), 4C. Store the cell pellet at 80C.
Procedures for further analysis of these cells are outlined in the support protocols.
Verify induction
12. Resuspend the pellets from the induction intervals (steps 8 and 10) in 200 l of
SDS-PAGE sample buffer/OD550 cells. Heat 5 min at 70C to completely lyse the
cells and denature the proteins. Run the equivalent of 0.15 OD550 cells per lane (30
l) on an SDS-polyacrylamide gel (UNIT 10.1).
Purification of
Recombinant
Proteins
6.7.5
Current Protocols in Protein Science
Supplement 10
13. Stain the gel 1 hr with Coomassie brilliant blue (UNIT 10.5). Destain the gel and check
for expression.
Most thioredoxin fusion proteins are produced at levels that vary from 5% to 20% of the
total cell protein. The desired fusion protein should exhibit the following characteristics:
it should run on the gel at the mobility expected for its molecular weight; it should be absent
prior to induction; and it should gradually accumulate during induction, with maximum
accumulation usually occurring 3 hr post-induction at 37C.
SUPPORT
PROTOCOL 1
2. Place 1.5 ml resuspended cell pellet in the 3.5-ml French pressure cell. Hold the cell
upside down with the base removed, the piston fully extended downwards, and the
outlet valve handle that holds the nylon ball seal in the open position (loose).
Before filling the pressure cell, check that the nylon ball, which seals the outlet port and
sits on the end of the outlet valve handle, is not deformed. If it is, replace it with a new one.
Both the condition of the nylon ball and its seat in the pressure-cell body are critical for
the success of the procedure.
3. Bring the liquid in the pressure cell to the level of the outlet port by raising the piston
slowly to expel excess air from the cell. With the outlet valve open and at the same
time maintaining the piston in position, install the pressure-cell base. Gently close
the outlet valve.
CAUTION: Do not over-tighten the valve as this will deform the nylon ball and may
irreparably damage its seat on the pressure-cell body.
4. Turn the sealed cell right-side-up and place it in the hydraulic press.
Expression and
Purification of
Thioredoxin
Fusion Proteins
5. Turn the pressure regulator on the press fully counter-clockwise to reset it to zero
pressure. Set the ratio selector to medium. Turn on the press.
CAUTION: The larger (50-ml) pressure cell is usually used with the selector set on high.
The small (3.5-ml) cell is only used on medium ratio.
6.7.6
Supplement 10
6. Slowly turn the pressure regulator clockwise until the press just begins to move. Allow
the press to compress the piston.
The press will stop moving after a few seconds.
7. Position a collection tube under the pressure-cell outlet. Slowly increase the pressure
in the cell by turning the pressure regulator clockwise. Monitor the reading on the
gauge and increase the pressure to 1000 on the dial, corresponding to an internal cell
pressure of 20,000 lb/in2.
8. While continuously monitoring the gauge, very slowly open the outlet valve until
lysate begins to trickle from the outlet.
The lysate should flow slowly and smoothly, and the cell pressure should not drop more
than 100 divisions on the dial.
At 20,000 lb/in2 and 5 OD550/ml, cell lysis will be complete after one passage through the
press. Lower pressures and/or higher cell densities may require a second passage.
12. Resuspend the pellet in an equivalent volume of lysis buffer. Remove a 100-l aliquot
and freeze at 80C (insoluble fraction).
13. Lyophilize the 100-l aliquots to dryness in a Speedvac evaporator. Solubilize in 100
l SDS-PAGE sample buffer. Analyze 30-l samples by SDS-PAGE (UNIT 10.1).
This crude fractionation provides a fairly reliable indication of whether a protein has folded
correctly. Usually proteins in the soluble fraction have adopted a correct conformation and
proteins in the insoluble fraction have not. However, occasionally proteins found in the
soluble fraction are not truly soluble; instead they form aggregates that do not pellet in the
microcentrifuge. Conversely, sometimes a protein found in the insoluble fraction may be
there because it has an affinity for cell-wall components and cell membranes, and it may
not be intrinsically insoluble. Occasionally proteins can be recovered from these insoluble
fractions by extracting with agents such as mild detergents.
SUPPORT
PROTOCOL 2
Materials
Cell pellet from 4-hr post-induction cultures (see Basic Protocol)
20 mM TrisCl (pH 8.0)/2.5 mM EDTA/20% (w/v) sucrose, ice-cold
20 mM TrisCl (pH 8.0)/2.5 mM EDTA, ice-cold
Additional reagents and equipment for SDS-PAGE (UNIT 10.1)
1. Resuspend cell pellet from 4-hr post-induction cultures at a concentration of 5
OD550/ml in ice-cold 20 mM TrisCl (pH 8.0)/2.5 mM EDTA/20% sucrose. Incubate
10 min on ice.
2. Microcentrifuge 30 sec at maximum speed, 4C, to pellet the cells.
Purification of
Recombinant
Proteins
6.7.7
Current Protocols in Protein Science
Supplement 10
3. Discard the supernatant and gently resuspend the cells in an equivalent volume of
ice-cold 20 mM TrisCl (pH 8.0)/2.5 mM EDTA. Incubate 10 min on ice and mix
occasionally by inverting the tube.
Osmotic release from the cytoplasm occurs at this stage.
2. Lyse the cells at 20,000 lb/in2 in a French pressure cell (see Support Protocol 1, steps
2 to 8). Collect whole-cell lysate in a 10-ml glass-walled tube.
3. Incubate whole-cell lysate 10 min at 80C. Remove 100-l aliquots after 30 sec, 1
min, 2 min and 5 min and plunge immediately into ice. At 10 min, plunge the
remaining heated lysate into ice.
Expression and
Purification of
Thioredoxin
Fusion Proteins
A glass-walled tube (not plastic) provides good thermal conductivity to provide a rapid
rise in temperature to 80C and then a rapid drop in temperature to 4C. A suitable volume
to use in a 10-ml glass tube is 1.5 ml lysate. For large-scale work, a glass-walled vessel
should be used and the lysate should be mixed well during both heat treatment and cooling.
6.7.8
Supplement 10
5. Remove 2-l aliquots of the supernatants and add 28 l SDS-PAGE sample buffer.
Analyze the samples by SDS-PAGE (UNIT 10.1) to determine the heat stability of the
fusion protein and the minimum time of heat treatment required to obtain a good
purification.
REAGENTS AND SOLUTIONS
Use deionized, distilled water in all recipes and protocol steps. For common stock solutions, see
APPENDIX 2E; for suppliers, see SUPPLIERS APPENDIX.
6.7.9
Current Protocols in Protein Science
Supplement 17
M9 salts, 10
60 g Na2HPO4 (0.42 M)
30 g KH2PO4 (0.24 M)
5 g NaCl (0.09 M)
10 g NH4Cl (0.19 M)
H2O to 1 liter
Adjust pH to 7.4 with NaOH
Autoclave or filter sterilize through a 0.45-m filter
Store 6 months at room temperature
SDS-PAGE sample buffer
15% (v/v) glycerol
0.125 M TrisCl, pH 6.8 (APPENDIX 2E)
5 mM Na2EDTA
2% (w/v) SDS
0.1% (w/v) bromphenol blue
1% (v/v) 2-mercaptoethanol (2-ME; add immediately before use)
Store indefinitely at room temperature
Tryptophan, 10 mg/ml
Heat 500 ml glass-distilled H2O to 80C. Stir in 5 g L-tryptophan until dissolved.
Filter sterilize the solution through a 0.45 m filter and store 6 months in the dark
at 4C.
COMMENTARY
Background Information
Expression and
Purification of
Thioredoxin
Fusion Proteins
producing large quantities of any desired eukaryotic protein. However, these gene-fusion
systems still suffer from the pervasive inclusion-body problem. They are thus mainly useful for the production of antigens, rather than
correctly folded, biologically active proteins.
More recently the maltose binding protein
(MBP; Riggs, 1994; UNIT 5.1) and glutathioneS-transferase (GST) gene fusion expression
systems (see UNIT 6.6) have proven more successful in producing soluble fusion proteins;
these systems retain the translation advantage
of the earlier fusion systems. Apart from the
obvious advantages in making a correctly
folded product, the synthesis of soluble fusion
proteins also allows for the development of
generic purification schemes based on some
unique property of the fusion partner.
Why would any particular eukaryotic protein produced in the E. coli cytoplasm be more
soluble when it is linked to a fusion partner than
it would be by itself? It is likely that physical
properties of the fusion-partner protein are important, with efficient self-folding and high
solubility being useful in this role. It is possible
that some good fusion partners (proteins that
fold efficiently and are highly soluble), by virtue of their desirable physical qualities, are able
to keep folding intermediates of linked heterologous proteins in solution long enough for
6.7.10
Supplement 17
Purification of
Recombinant
Proteins
6.7.11
Current Protocols in Protein Science
Supplement 10
10
11
12
MW(kDa)
97.4
66.2
45.0
31.0
21.5
14.4
Figure 6.7.2 Expression of thioredoxin gene fusions. The gel shows proteins found in the soluble
fractions derived from E. coli cells expressing eleven different thioredoxin gene fusions. Lane 1, host
E. coli strain GI724 (negative control, 37C); lane 2, murine interleukin-2 (IL-2; 15C); lane 3, human
IL-3 (15C); lane 4, murine IL-4 (15C); lane 5, murine IL-5 (15C); lane 6, human IL-6 (25C); lane
7, human MIP-1a (37C); lane 8, human IL-11 (37C); lane 9, human macrophage colony-stimulating factor (M-CSF; 37C); lane 10, murine leukemia inhibitory factor (LIF; 25C); lane 11, murine
steel factor (SF; 37C); and lane 12, human bone morphogenetic protein-2 (BMP-2; 25C).
Temperatures in parentheses are the production temperature chosen for expressing each fusion.
This is a 10% SDS-polyacrylamide gel, stained with Coomassie brilliant blue.
Expression and
Purification of
Thioredoxin
Fusion Proteins
Critical Parameters
Lack of protein solubility leading to inclusion-body formation in E. coli is a complex
phenomenon with many contributing factors:
simple insolubility as a result of high-level
expression, insolubility of protein-folding intermediates, lack of appropriate bacterial
chaperone proteins, and lack of glycosylation
mechanisms in the bacterial cytoplasm. Fusion
of heterologous proteins to thioredoxin or to
other fusion partners can help address most of
these solubility issues. However, another important factor contributing to inclusion body
formation is the inability to form essential disulfide bonds in the reducing environment of
the bacterial cytoplasm, which leads to incorrect folding. Thermal lability of even correctly
6.7.12
Supplement 10
Anticipated Results
Thioredoxin fusion protein yields are usually in the range of 5% to 20% of total cell
protein. At these expression levels, a 1-liter
induction culture in a shaker flask will yield
3 g (wet weight) of cells, 300 mg total protein,
and 15 to 60 mg of thioredoxin fusion protein.
The final recovered yield will depend on factors
such as solubility of the fusion protein and the
efficiency of downstream purification procedures.
Time Considerations
From a single colony on a plate, the basic
induction protocol requires an overnight
growth to prepare a liquid inoculum and a
3.5-hr preinduction growth at 30C the next
day, followed by a 4-hr 37C induction period.
These times are significantly longer if lower
induction temperatures are required (see Table
6.7.1). Lysis of a sample in the French pressure
cell should require 5 min, and both the heattreatment and osmotic-shock procedures require <1 hr each. SDS-PAGE takes 2.5 hr.
Literature Cited
Bardwell, J.C.A., McGovern, K., and Beckwith, J.
1991. Identification of a protein required for
disulfide bond formation in vivo. Cell 67:581589.
Edman, J.C., Ellis, L., Blacher, R.W., Roth, R.A.,
and Rutter, W.J. 1985. Sequence of protein disulphide isomerase and implications of its relationship to thioredoxin. Nature 317:267-270.
Hannestad, U., Lundqvist, P., and Sorbo, B. 1982.
An agarose derivative containing an arsenical for
affinity chromatography of thiol compounds.
Anal. Biochem. 126:200-204.
Hoffman, R.D. and Lane, M.D. 1992. Isodophenylarsine oxide and arsenical affinity chromatography: New probes for dithiol proteins. J. Biol.
Chem. 267:14005-14011.
Holmgren, A. 1985. Thioredoxin. Ann. Rev. Biochem. 54:237-271.
Katti, S.K., LeMaster, D.M., and Eklund, H. 1990.
Crystal structure of thioredoxin from Escherichia coli at 1.68 angstroms resolution. J.
Mol. Biol. 212:167-184.
Kelley, R.F., Shalongo, M., Jagannadham, M.V., and
Stellwagen, E. 1987. Equilibrium and kinetic
measurements of the conformational transition
of reduced thioredoxin. Biochemistry 26:14061411.
LaVallie, E.R., Rehemtulla, A., Racie, L.A.,
DiBlasio, E.A., Ferenz, C., Grant, K.L., Light,
A., and McCoy, J.M. 1993a. Cloning and functional expression of a cDNA encoding the catalytic subunit of bovine enterokinase. J. Biol.
Chem. 268:23311-23317.
LaVallie, E.R., DiBlasio, E.A., Kovacic, S., Grant,
K.L., Schendel, P.F., and McCoy, J.M. 1993b. A
thioredoxin gene fusion expression system that
circumvents inclusion body formation in the E.
coli cytoplasm. Bio/Technology 11:187-193.
Lu, Z., DiBlasio-Smith, E.A., Grant, K.L., Warne,
N.W., LaVallie, E.R., Collins-Racie, L.A.,
Follettie, M.T., Williamson, M.J., and McCoy,
J.M. 1996. Histidine patch thioredoxins. Mutant
forms of thioredoxin with metal chelating affinity that provide for convenient purifications of
thioredoxin fusion proteins. J. Biol. Chem.
271:5059-5065.
Lunn, C.A. and Pigiet, V.P. 1982. Localization of
thioredoxin from Escherichia coli in an osmotically sensitive compartment. J. Biol. Chem.
257:11424-11430.
Mazzarella, R.A., Srinivasan, M., Haugejorden,
S.M., and Green, M. 1990. ERp72, an abundant
luminal endoplasmic reticulum protein, contains
three copies of the active site sequences of protein disulfide isomerase. J. Biol. Chem.
265:1094-1101.
McCoy, J.M. 1992. Heat-shock proteins and their
potential uses for pharmaceutical protein production in microorganisms. In Stability of Protein Pharmaceuticals, Part B. (T. Ahern and M.
Manning, eds.) pp 287-316. Plenum Press, New
York.
Purification of
Recombinant
Proteins
6.7.13
Current Protocols in Protein Science
Supplement 10
Mitraki, A. and King, J. 1989. Protein folding intermediates and inclusion body formation.
Bio/Technology 7:690-697.
Key Reference
Riggs, P. 1994. Expression and purification of maltose-binding protein fusions. In Current Protocols in Molecular Biology (F.M. Ausubel, R.
Brent, R.F. Kingston, D.D. Moore, J.G. Seidman, J.A. Smith, and K. Struhl, eds.) pp.16.6.116.6.14. John Wiley & Sons, New York.
Internet Resources
http://www.invitrogen.com/manuals.html
Source for protocols on affinity-based purification.
Expression and
Purification of
Thioredoxin
Fusion Proteins
6.7.14
Supplement 10