DISTANCE METHODS (UPGMA & NJ) Flashcards

1
Q

2 distance based tree methods

A

UPGMA - Unweighted Pair Group Method with Arithmetic mean
NJ - Neighbor-Joining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When two sequences are similar, they are likely to originate from the same ancestor

A

distance-based methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

sequence similarity can approximate evolutionary distances

A

distance-based methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

assume for any pair of species we have an estimation of evolutionary distance between them

e.g. alignment score

A

distance-based methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

goal is to construct a tree which best approximates these distance

A

distance-based methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

distance based tree

A

pls watch the video

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

can one always represent a distance matrix as weighted tree?

A

there is no way to add d to the tree and preserve the distances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

REMEMBER:

Real matrices are almost never additive

A

ok

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

used to search stochastically for the best-scoring trees in tree space

A

heuristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

heuristics

A

upgma and nj

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

clustering problem: group items with similar properties

  • clusters are homogenous
  • clusters are well separated
A

hierarchical clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

many clusters have natural sub-clusters which are often easier to identify

A

hierarchical clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

combine hierarchical clustering with a method to put weights on the edges

A

UPGMA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

rooted tree with edge lengths where all leaves are equidistant from the root

A

ultrametric tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

often represent the molecular clock which states that the rate of mutation is the same across all lineages of the tree

A

ultrametric tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

the distance from any internal node to any of its leaves is constant and equal

A

ultrametric tree

17
Q
  • assume the same rate of evolution (molecular clock hypothesis)
18
Q

the length from root to each leaf is the same (ultra metric)

19
Q

it is similar to fitch-margoliash algorithm (merge two most similar sequences or clusters first); but the calculation of branch lengths is even simpler

20
Q

minimum evolution - the least total branch length (distance-based)

21
Q

bottom-up clustering method

22
Q

does not assume same rate evolution

23
Q

fast & produce reasonable trees

24
Q

seitou & nel algorithm

25
Q
  1. Calculate pairwise distances
  2. Create distance matrix
  3. Determine net divergence for each terminal node
  4. Create rate-corrected distance matrix
  5. Identify taxa with minimum rate-corrected distance
  6. Connect taxa with minimum rate-corrected distance via a new node, and determine their distance from this new node
  7. Determine the distance of new node from rest of taxa or nodes
  8. Regenerate distance matrix
  9. ReturN to step 2