Professional Documents
Culture Documents
1
In This Chapter
THE UNIVERSAL FEATURES OF CELLS ON EARTH THE DIVERSITY OF GENOMES AND THE TREE OF LIFE GENETIC INFORMATION IN EUCARYOTES A1 A5 A9
TRUE/FALSE
19 True. Even in eucaryotes where the coding regions of a gene are often interrupted by noncoding segments, the order of codons in the DNA is still the same as the order of amino acids in the protein. False. The nucleotide subunits of RNA and DNA differ in two key ways. First, the backbone in RNA uses the sugar ribose instead of deoxyribose, which is used in DNA. Second, RNA uses the base uracil in place of the base thymine, which is used in DNA. Three of the four basesA, C, and Gare the same in RNA and DNA.
110
THOUGHT PROBLEMS
111 Trying to define life in terms of properties is an elusive business, as suggested by this scoring exercise (Table 12). Cars are highly organized objects, take energy from the environment and transform gasoline into motion, responding to stimuli from the driver as they do so. However, they cannot reproduce themselves, or grow and developbut then neither can old animals. Cacti are not particularly responsive to stimuli, but they display other life attributes. It is curious that standard definitions of life usually do not mention that living organisms on Earth are largely made of organic molecules, that life is carbon based. The first few pages of MBoC emphasize this point and discuss the properties of living cells mainly in terms of their informational macromoleculesDNA, RNA, and protein. Reference: Pace NR (2001) The universal nature of biochemistry. Proc. Natl Acad. Sci. U.S.A. 98, 805808.
A1
A2
Table 12 Plausible life scores for car, cactus, and humans (Answer 111).
CHARACTERISTIC 1. Organization 2. Homeostasis 3. Reproduction 4. Development 5. Energy 6. Responsiveness 7. Adaptation CAR Yes Yes No No Yes Yes No CACTUS Yes Yes Yes Yes Yes No Yes HUMAN Yes Yes Yes Yes Yes Yes Yes
112
Such modules are generally designed to look for organic molecules characteristic of life. The first Mars probe analyzed soil samples for amino acids; none were found. It is extremely unlikely that you created a new organism in this experiment. Far more probably, a spore from the air landed in your broth, germinated, and gave rise to the cells you observed. In the middle of the nineteenth century, Louis Pasteur invented a clever apparatus to disprove the then widely accepted belief that life could arise spontaneously. He showed that sealed flasks never grew anything if properly heat sterilized first. He overcame the objections of those who pointed out the lack of oxygen or who suggested that his heat sterilization killed the life-generating principle, by using a special flask with a slender swans neck, which was designed to allow in oxygen but to prevent spores carried in the air from contaminating the culture (Figure 14). The cultures in these flasks never showed any signs of life; however, they were capable of supporting life, as could be demonstrated by washing some of the dust from the neck into the culture. On the surface, the extraordinary mutation resistance of the genetic code argues that it was subjected to the forces of natural selection. An underlying assumption, which seems reasonable, is that resistance to mutation is a valuable feature of a genetic code, one that would allow organisms to maintain sufficient information to specify complex phenotypes. This reasoning suggests that it would have been a lucky accident indeedroughly a one-ina-million chanceto stumble on a code as error proof as our own. But all is not so simple. If resistance to mutation is an essential feature of any code that can support the complexity of organisms such as humans, then the only codes we could observe are ones that are error resistant. A less favorable frozen accident, giving rise to a more error-prone code, might limit the complexity of life to organisms that would never be able to contemplate their genetic code. This is akin to the anthropic principle of cosmology: many universes may be possible, but few are compatible with life that can ponder the nature of the universe. Beyond these considerations, there is ample evidence that the code is not static, and thus could respond to the forces of natural selection. Deviant versions of the standard genetic code have been identified in the mitochondrial and nuclear genomes of several organisms. In each case one or a few codons have taken on a new meaning. Reference: Freeland SJ & Hurst LD (1998) The genetic code is one in a million. J. Mol. Evol. 47, 238248.
113
114
original flask
115
There are several approaches you might try. 1. Analysis of the amino acids in the proteins would indicate whether the set of amino acids used in your organism differs from the set used in Earth organisms. But even Earthly organisms contain more amino acids than the standard set of 20, for example hydroxyproline, phosphoserine, and phosphotyrosine, which all result from modifications after a protein has been synthesized. Absence of one or more of the common set might be a more significant result.
A3
117
118
CALCULATIONS
119 A. The number (n) of generations of cell divisions required to produce 1013 cells is
A4
120
For calculations such as these, it is useful for purposes of estimation to remember that 45 103 (4n produces the series: 4, 16, 64, 256, 1024; thus, 45 = 1024 103) and that (1/4)5 (1/10)3. Hence, 4 different nucleotides can generate 1024 different DNA sequences, each 5 nucleotides long. Similarly, an 8nucleotide DNA sequence can provide enough diversity to tag 25,000 genes, there being 48 or 65,536 possible 8-nucleotide sequences. However, one would expect that most of these sequences would be present more than once in the 3.2 109 nucleotides of the human genome. Indeed, for a sequence tag to be rare enough to be expected to be present only once, it would have to be at least 16 nucleotides long. A 16-nucleotide sequence would be expected to be present about 0.7 times in the haploid human genome [(1/4)16 (3.2 109) = 0.75]. A probability calculation should properly be used to assess the likelihood that a tag is sufficiently long to be unique in the genome. For a sequence that is present in one gene, what is the probability that it is also present elsewhere in the genome? The probability of a match (PM) in any one comparison is the chance of a match at every nucleotide (1/4)n. Thus, for one comparison PM = (1/4)n Since the probability of all events is 1, the probability of not matching (PN) in one comparison is PN = 1 PM = 1 (1/4)n And the probability of not matching in any number of comparisons (c) is PN = {1 (1/4)n}c
A5
TRUE/FALSE
128 True. Phototrophs provide the major pathway by which carbon in CO2 is incorporated into the biosphere; however, it is not the sole mechanism. Most lithotrophs can also fix carbon, but the amounts are tiny in comparison to the carbon fixed by phototrophs. False. The clusters of human hemoglobin genes arose during evolution by duplication from an ancient ancestral globin gene; thus, they are examples of paralogous genes. The human hemoglobin a gene is orthologous to the chimpanzee hemoglobin a gene, as are the human and chimpanzee hemoglobin b genes, etc. All the globin genes, including the more distantly related gene for myoglobin, are homologous to one another.
129
THOUGHT PROBLEMS
130 Whether its sunlight or inorganic chemicals, to feed means to obtain freeenergy and building materials from. In the case of photosynthesis, photons in sunlight are used to raise electrons of certain molecules to a high-energy, unstable state. When they return to their normal, ground state, the released
A6
131
The hemoglobin of the giant tube worms binds O2 and H2S and transports them to the symbiotic bacteria, which use the H2S as an electron donor and the O2 as an electron acceptor to generate ATP and reducing power to meet their energy needs. The resulting growth of the bacteria benefits the worms by providing increased waste products and dead bodies to live on. Moreover, in the process the toxic H2S is rendered harmless by oxidation to elemental sulfur, thereby preventing it from poisoning the worms. The balanced equation for oxygenic photosynthesis, derived from experiments using water with isotopically labeled oxygen, is 6 CO2 + 12 H2O + light C6H12O6 + 6 H2O + 6 O2 In this form of the equation it is apparent that the O2 derives from H2O, and that all the oxygen in glucose derives from CO2.
132
133
Four (Figure 15). All could have split from the common ancestor at the same time. Eubacteriaarchaea could have split from eucaryotes, followed by the separation of eubacteria from archaea. Eubacteriaeucaryotes could have split from archaea, followed by the separation of eubacteria from eucaryotes. Archaeaeucaryotes could have split from eubacteria, followed by the separation of archaea from eucaryotes. Although horizontal transfers across these divisions make interpretations problematic, it is thought that archaeaeucaryotes first split from eubacteria, and then archaea split from eucaryotes. It is unlikely that any gene came into existence perfectly optimized for its function. It is thought that highly conserved genes such as ribosomal RNA genes were optimized by more rapid evolutionary change during the evolution of the common ancestor to archaea, eubacteria, and eucaryotes. Since ribosomal RNAs (and the products of most highly conserved genes) participate in fundamental processes that were optimized early, there has been no evolutionary pressure (and little leeway) for change. By contrast, less conservedmore rapidly evolvinggenes have been continually presented with opportunities to fill new functional niches. Consider, for example, the evolution of distinct globin genes that are optimized for oxygen delivery to embryos, fetuses, and adult tissues in placental mammals.
134
Figure 15 The four possible relationships for the evolution of archaea (A), eubacteria (B), and eucaryotes (E) (Answer 133).
A7
136
A8
139
It is not a simple matter to determine the function of a gene from scratch, nor is there a universal recipe for how to do it. Nevertheless, there are a variety of standard questions that help narrow down the possibilities. Below we list some of these questions. In what tissues is the gene expressed? If the gene is expressed in all tissues, it is likely to have a general function. If it is expressed in one or a few tissues, its function is likely to be more specialized, perhaps related to the specialized functions of the tissues. If the gene is expressed in the embryo, but not the adult, it may function in development. In what compartment of the cell is the gene expressed? Knowing the subcellular localization of the proteinnucleus, plasma membrane, mitochondria, etc.can also help to suggest categories of potential function. For example, a protein that is localized to the plasma membrane is likely to be a transporter, a receptor or other component of a signaling pathway, a celladhesion molecule, etc. What are the effects of mutations in the gene? Mutations that eliminate or modify the function of the gene product can also provide clues to function. For example, if the gene product is critical at a certain time during development, the embryo will often die at that stage or develop obvious abnormalities. Unless the abnormality is very specific, it is usually difficult to deduce the function or category of function. And often the links are very indirect, becoming apparent only after the genes function is known. With what other proteins does the encoded protein interact? In carrying out their function, proteins often interact with other proteins involved in the same or closely related processes. If an interacting protein can be identified, and if its function is already known (through previous research or the searching of databases), the range of possible functions can be narrowed dramatically. Mutations in what other genes can suppress effects of mutation in the unknown gene? Looking for suppressor genes can be a very powerful approach to investigating gene function in organisms such as bacteria and yeast, which have well-developed genetic systems, but this approach is not readily applicable to mouse or most higher eucaryotes at present. The rationale for this approach is analogous to that of looking for interacting proteins: genes that interact genetically are often involved in the same or a closely related process. Identification of such an interacting gene (and knowledge of its function) would provide an important clue to the function of the unknown gene. Addressing each of these questions requires specialized experimental expertise and a substantial time commitment from the investigator. It is no wonder that progress is made so much more rapidly when a clue to a genes function can be found simply by identifying a similar gene of known function in the database.
CALCULATIONS
140 It takes only 20 hoursless than a daybefore the mutant cells become more abundant in the culture. From the equation provided in the question, the number of the original (wild-type) bacterial cells at time t minutes after the mutation occurred is 106 2t/20. The number of mutant cells at time t is 1 2t/15. At the time when the mutant cells overtake the wild-type cells, these two numbers are equal. 106 2t/20 = 2t/15
A9
143
THOUGHT PROBLEMS
144 Like most questions about evolutionary relationships this one was decided by comparing sequences of genes such as those for ribosomal RNA. These comparisons showed that fungi are more similar in gene sequence to animals than to plants, and probably split from the animalplant lineage after plants separated from animals. Thus, fungi are thought never to have had chloroplasts, and fungi and plants are thought to have invented cell walls independently, as is suggested by the use of cellulose in plant cell walls and chitin in fungal cell walls. Nucleotide sequence comparisons with other species would allow you to decide whether Giardia represented an ancient lineage or a more recent
145
A10
146
Three general hypotheses have been proposed to account for the differences in rate of evolutionary change in different lineages. The individual hypotheses discussed below are not mutually exclusive and may all contribute to some extent. The generation-time hypothesis proposes that rate differences are a consequence of different generation times. Species such as rat with short generation times will go through more generations and more rounds of germ-cell division, and hence more rounds of DNA replication. This hypothesis assumes that errors during DNA replication are the major source of mutations. Tests of this hypothesis in rat versus human tend to support its validity. The metabolic-rate hypothesis postulates a higher rate of evolution for species with a higher metabolic rate. Species with high metabolic rates use more oxygen; hence, they generate more oxygen free radicals, a major source of damage to DNA. This is especially relevant for mitochondrial genomes, because mitochondria are the major cellular site for oxygen utilization and free radical production. The efficiency-of-repair hypothesis proposes that the efficiency of repair of DNA damage differs in different lineages. Species with highly efficient repair of DNA damage would reduce the fraction of damage events that lead to mutation. There is evidence in cultured human and rat cells that such differences in repair exist, in the expected direction, but it is unclear whether such differences exist in the germlines of these organisms. Reference: Li WH (1997) Molecular Evolution, pp 228230. Sinauer Associates, Inc.: Sunderland MA.
DATA HANDLING
147 A. The simplest hypothesis is that gene transfer occurred at the point indicated in Figure 16. Genera in many of the lineages beyond this point have a nuclear Cox2 gene, whereas lineages that branched off prior to this point do not. B. Five genera (Lespedeza, Dumasia, Pseudeminia, Neonotonia, and Amphicarpa) apparently have functional copies of both the mitochondrial and the nuclear genes, as indicated by shaded boxes in Figure 16.
A11
GENE RNA mt nuc mt nuc Pisum Clitoria Tephrosia Galactia Canavalia Lespedeza Eriosema Atylosia Erythrina gene transfer and activation Ramirezella Vigna Phaseolus Dumasia + Calopogonium + Pachyrhizus + Cologania Pueraria Pseudeminia Pseudovigna Ortholobium Psoralea Cullen Glycine Neonotonia Teramnus Amphicarpa + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Figure 16 Summary of Cox2 gene distribution and transcript data in a phylogenetic context, showing the most likely point of gene transfer and the minimal number of points for mitochondrial (squares) and nuclear (circles) gene inactivation (Answer 147). Boxes indicate genera with apparently functional copies of both the mitochondrial and nuclear genes.
+ + + + + + + +
+ + + + +
+ + +
+ +
C. Ten genera (Eriosema, Atylosia, Erythrina, Ramirezella, Vigna, Phaseolus, Ortholobium, Psoralea, Cullen, and Glycine) no longer have a functional mitochondrial gene. The minimum number of inactivation events that could account for the observed data is four, as shown by squares on the tree in Figure 16. D. Six genera no longer have a functional nuclear gene. The minimum number of inactivation events that could account for this is five, as shown by circles on the tree in Figure 16. E. These data argue strongly that transfer of genes from mitochondria to the nucleus is not a one-step process; that is, simultaneous loss of the gene from mitochondria and its appearance in the nucleus. This is an unlikely scenario a priori since nuclear versions of mitochondrial genes must acquire a special targeting sequence that allows the encoded proteins to be delivered to mitochondria (see MBoC Chapter 12). The data in Figure 16 argue that the transfer process begins with the appearance of the gene in the nucleus (presumably followed at some point by its activation via acquisition of a targeting sequence). This first step is not accompanied by loss of the gene from the mitochondria. Once the nuclear gene is activated, there appears to be an intermediate stage in which both genes function. Subsequently, one or the other gene is inactivated. If the nuclear gene is inactivated, the transfer process is effectively aborted. If the mitochondrial gene is inactivated (often initially by point mutations), then the transfer can proceed. The final stage of transfer is deletion of the defective mitochondrial gene, a process favored by the economics of genome replication. Reference: Adams KL, Song K, Roessler PG, Nugent JM, Doyle JL, Doyle JJ & Palmer JD (1999) Intracellular gene transfer in action: Dual transcription and multiple silencings of nuclear and mitochondrial cox2 genes in legumes. Proc. Natl Acad. Sci. U.S.A. 96, 1386313868. 148 If the intermediary in transfer were DNA, you would expect that the nuclear copy of the gene would have Cs at the sites of RNA editing. If the intermediary were RNA, you would expect Ts at the sites of RNA editing.
A12
149 A. Because synonymous changes do not alter the amino acid sequence of the protein, they are not subject to selection pressures, which operate at the level of the function of the protein (and how it affects the overall fitness of the organism). By contrast, nonsynonymous changes, which substitute a new amino acid in place of the original one, have the potential to alter the function of the encoded protein (and change the fitness of the organism). Since most amino acid substitutions are deleterious to the function of the protein, they are selected against. B. The histone H3 gene must be so exquisitely tuned to its function that virtually all amino acid substitutions are deleterious and, therefore, are selected against. The extreme conservation of histone H3 argues that its function is very tightly constrained, probably because of extensive interactions with other proteins and with its unchanging substrate, DNA. C. Histone H3 is clearly not in a privileged site in the genome because it undergoes synonymous nucleotide changes at about the same rate as other genes. Reference: Li WH (1997) Molecular Evolution. Sinauer Associates, Inc.: Sunderland MA. 150 A. The data in the phylogenetic tree (see Figure 13) refute the hypothesis that plant hemoglobin genes arose by horizontal transfer. Looking at the more familiar parts of the tree, we see that the vertebrates (fish to human) cluster together as a closely related set of species. Moreover, the relationships in the unrooted tree shown in Figure 13 are compatible with the order of branching we know from the evolutionary relationships among these species: fish split off before amphibians, reptiles before birds, and mammals last of all in a tightly knit group. Plants also form a distinct group that displays accepted evolutionary relationships, with barley, a monocot, diverging before bean, alfalfa, and lotus, which are all dicots (and legumes). The sequences of the plant hemoglobins appear to have diverged long ago in evolution, at or before the time that mollusks, insects, and nematodes arose. The relationships in the tree indicate that the hemoglobin genes arose by descent from some common ancestor. B. Had the plant hemoglobin genes arisen by horizontal transfer from a parasitic nematode, then the plant sequences would have clustered with the nematode sequences in the phylogenetic tree in Figure 13.