Lecture 11 - Phylogenetic Trees Flashcards

1
Q

what are the four steps of phylogenetic data analysis

A
  1. selection of sequences
  2. MSA
  3. tree building
  4. tree evaluation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what two things evolve which complicates molecular evolutionary studies

A

species and genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

define: additive trees

A

trees that attempt to preserve the sum of distances between terminal elements within the tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

describe distance-based tree methods

A

convert alignments into distances then use this to infer a tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

describe character-based tree methods

A

analyze individual character states and specific positions in the alignment and infer a tree based on observed changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the main idea of UPGMA (unweighted pair group method with arithmetic mean)

A

at each iteration, the two closest groups are combined to form one new group; continue until there is only one group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is the result of UPGMA

A

a rooted, non-additive and ultrametric tree
ultrametric - all leaves are the same distance from the root

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the idea of neighbor-joining (NJ)

A

minimize the total length of branches on the tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what do you start with for NJ

A

a star-shaped tree + pairwise distance matrix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are the steps of NJ

A
  • join pair with smallest sum of branch lengths with a new node
  • regard the pair as a single OTU and do pairwise comparisons again
  • repeat until the entire tree is generated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the result of NJ

A

a tree that accounts for variation in branch lengths and fits well with statistical models of evolution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

define: bootstrapping

A

computational technique for estimating the confidence level of a phylogenetic hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are bootstrap values calculated for

A

individual nodes (and/or branches)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what value is often used as a bootstrap value threshold

A

> 70%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how is bootstrapping done for trees

A
  • divide the MSA into a set of N ordered sites and randomly choose N sites with replacement
  • recalculate trees (100 or 1000 replicates is common) and determine the frequency of each node within the replicated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what types of tree methods are bootstrapping usually used for

A

rapid phylogenetic methods such as NJ

17
Q

what do parsimony methods look for

A

the fewest number of changes in an evolutionary tree

18
Q

what is a tree with the fewest number of changes equivalent to

A

a tree of minimum length

19
Q

what is the starting point of parsimony

20
Q

what are the two types of sites within an MSA defined for parsimony

A
  • informative
  • uninformative
21
Q

what is an uninformative site

A

invariant sites

22
Q

what defines an informative site

A

a site with has at least two different nucleotides that are present two or more times

23
Q

what is the general approach to building trees for comparison

A
  • build a starting tree using a fast method
  • make changes to that tree by moving branches or nodes
  • over time, keep trees that have an improved score
24
Q

how does NNI work

A
  • each internal branch is connected to four groups within a tree
  • change how an internal branch is connected
  • if the score is better, keep the new tree
25
what does NNI stand for
nearest neighbour interchange
26
what does SPR stand for
subtree pruning & regrafting
27
how does SPR work
- cut subtree from the original tree and move it to a new location - if the score is better, keep the new tree
28
what does TBR stand for
tree bisection & reconnection
29
how does TBR work
- the starting tree is split into two - all edges for subtrees are recombined, generating a set of new trees - if the score is better, keep the new tree
30
what is an issue with all tree exploration methods
they may be stuck in local minima
31
what is the process of parsimony ratchet
bootstrap-like approach - get a reasonable starting tree - re-weight a subset (10-20%) of columns in your MSA - explore the new tree space - reset to original weights and explore tree space again - keep the best tree so far - repeat (100x or more)