Professional Documents
Culture Documents
3. Consensus trees
5. Bootstrapping / Jackknifing
• There are many consensus methods that differ in: • They show only those relationships that are
1. the kind of agreement unambiguously supported by the data
2. the level of agreement
• The commonest method (strict component
• Consensus methods can be used with multiple consensus) focuses on clades/components/full
trees from a single analysis or from multiple splits
analyses
1
Systematics - Bio 615
includes all and only those full splits found in all the
fundamental trees
unresolved polytomies
• Can be less optimal than any of the optimal trees Simplest to interpret
• May include relationships that are not supported by • Other relationships are shown as unresolved
the most parsimonious interpretation of the data polytomies
• The commonest method focuses on clades/ • Of particular use in bootstrapping and Bayesian
components/full splits Inference (best not to use for single searches)
A B C D E F G A B C E F D G
A B C E D F G
1. Summarizing multiple equally optimal trees from
one search (but they shouldn’t be!)
2. Summarizing the results of a bootstrapping
analysis (multiple searches)
frequency of 100 66
analysis
clades in the 66 66
fundamental trees 66
Don’t confuse these! The numbers on the branches
mean very different things in each case
MAJORITY-RULE CONSENSUS TREE
2
Systematics - Bio 615
Prorocentrum
Prorocentrum Symbiodinium
Loxodes
Tetrahymena
Loxodes Prorocentrum
Euplotes
Gruberia
Spirostomum Spirostomum
Ochromonas Euplotes Tracheloraphis
A B!C! D! E! F! G!
Gruberia
Gruberia
Symbiodin ium
Prorocentrum
majority-rule
A! B! C! D! E! F! Loxodes Ochromonas
Tetrahymena
Spirostomumum
Euplotes Ochromonas
Tracheloraphis
Tetrahymena m
Spirostomumu
66
Spirostomum
Euplotes
Euplotes
100
Tracheloraphis Tracheloraphis
Taxon G is excluded! Gruberia Gruberia
3
Systematics - Bio 615
Ciliate SSUrDNA data Randomly permuted data - Using PAUP* to search for the shortest tree that
Ochromonas Ochromonas lacks the branch of interest using reverse
topological constraints
+27 Symbiodinium +1 Symbiodinium
+45
Prorocentrum
Loxodes
+1
+3
Prorocentrum
Loxodes
- with the Autodecay or TreeRot programs (in
Tracheloraphis Tetrahymena conjunction with PAUP*) - MacClade 4 will also
Spirostomum
+8
Tracheloraphis help prepare for a Decay analysis
+15 Gruberia Spirostomum
+10 Euplotes Euplotes
- An excellent use for the Parsimony Ratchet -
+7 Tetrahymena Gruberia
because finding the shortest tree length is all that
matters (not finding multiple shortest trees)
4
Systematics - Bio 615
- ie Decay Indices are not easily comparable as measures 4. Decay index (Bremer Support)
of branch support
5. Bootstrapping / Jackknifing
- Values < 4 should be considered weak regardless of
branch length 6. Statistical hypothesis testing (frequentist)
DeBry, R.W. (2001) Improving interpretation of the Decay Index for DNA sequence data. Systematic
Biology 50: 742-752. 7. Posterior probability (see lecture on Bayesian)
Bootstrapping (non-parametric)
• Bootstrapping is a statistical
technique that uses computer
intensive random resampling
of data to determine sampling
error or confidence intervals
for some estimated parameter
• Introduced to phylogenetics by
Decay values versus Bootstrap and Jacknife values Felsenstein in 1985
from one empirical study • Based on idea of Efron (1979)
Norén, M. & U. Jondelius. 1999. Phylogeny of the Prolecithophora
(Platyhelminthes) inferred from 18S rDNA sequences. Cladistics 15: 103-112.
Bootstrapping (non-parametric)
5
Systematics - Bio 615
groups A
B
R R Y Y Y Y Y Y
R R Y Y Y Y Y Y
! A
B
R
R
R
R
R Y Y Y Y Y
R Y Y Y Y Y
! multiple analyses with a
! !
majority-rule consensus tree
C Y Y Y Y Y R R R ! C Y Y Y Y Y R R R !
D Y Y R R R R R R ! D Y Y Y R R R R R !
Bootstrap values (BS) are the
Outgp R R R R R R R R Outgp R R R R R R R R frequencies with which groups
• Additional information is given in partition tables (for ! !
Outgroup! Outgroup!
....**.** 12.67!
13.17!
6
Systematics - Bio 615
Huelsenbeck, J.P. and Rannala, B. (2004) Frequentist properties of Bayesian posterior probabilities of
• In other words, although technically they are meant
phylogenetic trees under simple and complex substitution models. Systematic Biology 53: 904-913.
to be a measure precision, they are usually thought
to be at least strongly correlated with accuracy
Paul Lewis
7
Systematics - Bio 615 Hillis, D.M. and Bull, J.J. (1993) An empirical test of
bootstrapping as a method for assessing confidence in
phylogenetic analysis. Systematic Biology, 42: 182-192.
Jackknifing
• Jackknifing is very similar to bootstrapping
and differs only in the character resampling
strategy
Bootstrap - interpretation
Two types of precision (Hillis & Bull 1993):
299-320.
Picket, K.M. and Randle, C.P. (2005) Strange bayes indeed: uniform topological priors imply non-uniform
BS values have been criticized for a variety of clade priors. Molecular Phylogenetics and Evolution 34: 203-211. SEE ALSO: Brandley, M. et al. (2006)
Are unequal clade priors problematic for Bayesian phylogenetics? Systematic Biology 55: 138-146.
reasons:
8
Systematics - Bio 615
Study questions
Describe the difference between a strict and majority
rule consensus tree."
What were the key findings of DeBry in his (2001) paper on
Decay Indices?"