Lecture 6 Flashcards

1
Q

What is the phenetic approach based on?

A

sequences with short pairwise distances cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the cladistic approach based on? what is the benefit compared to phonetic?

A

sequences with many shared characters cluster, therefore evolutionary process is accounted for implicitly, and it doesnt rely entirely on pairwise distance matrices, and correlations which may not have been accounted in the phonetic approach are accounted for

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Cladistic approach uses —-, which finds the tree needing —?

A

parsimony, smallest number of mutations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the parsimony score of a tree?

A

The lowest number of mutations required to explain the sequences at the tips of the tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the parsimony tree?

A

the tree with lowest parsimony score : an optimisation problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

T or F : Rooted trees obtained from the same unrooted tree dont have the same parsimony score.

A

False, they do

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the parsimony method do?

A

For n sequences with length m, it considers each unrooted tree ((2n-5)!! trees), and it calculates the parsimony score for each of these trees (4^(n-1)*m), and it finally outputs the unrooted tree with the lowest score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can we improve the second step of the parsimony method?

A

using the fitch algorithm, we pick a cherry, we ask if they first nucleotides are the same, if they are not the same we write down the set containing both nucleotides. if yes, we dont update. if the sets aren’t disjoint, we write down the intersection. Therefore, the parmisony score is the minimal number of mutations required to explain the sequences at the tips.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How many internal nodes, does a rooted tree on n tips have?

A

n-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the running time of the fitch algorithm?

A

(n-1)m

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is the parsimony tree found in the fitch algorithm ?

A

It’s found by calculating parsimony score for each unrooted tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The parsimony decision problem is an — problem

A

NP complete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Is the parsimony problem statistically consistent or not?

A

It’s not since no back substitutions or parallel substitution are considered which lead to long branch attraction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How was the origin of HIV required?

A

incidence data was used which gives impression of the dynamics since the data was collected. virus sequencing data from different host species allows us to infer the phylogenetic tree informing before 1980. ML phylogenetic tree inference was used to investigate early HIV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the input and outputs for the ML tree inference?

A

Input is the sequence alignment. Output is the tree which maximises the probablity of the sequences given the tree and the sequence evolution parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The ML tree inference requires an —- model and the parameters of the model can be—–?

A

evolutionary, co-estimated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

In ML in phylogenetic, what is the parameter?

A

Each unrooted tree with branch lengths

18
Q

What do sequences in phylogenetic in ML evolve according to ?

A

they evolve according to the parameters provided in the rate Q matrix

19
Q

What is the inference in the ML method?

A

determine the best unrooted tree, parameter which best explain the alignment max L(tau, Q;D) where D is the sequence alignments

20
Q

Is the substitution process typically time reversible or not?

21
Q

What is the running time of the likelihood calculation? how did you get it?

A
  • multiply over all sites O(m)
  • sum over internal nucleotides at n-1 internal nodes (O(4^n-1))
  • multiply over 2n-2 branches O(2n-2)
    so over all: O(m4^nn
22
Q

How can we improve the ML calculation?

A

Using felestein’s algorithm

23
Q

What is the time complexity of felsentein’s pruning algorithm? explain how

A

Each recursion step is summation over four time four states : O(n)
-the recursion needs to be performed for each of m sites O(m)
so in total : O(nm)

24
Q

The problem of finding a tree and branch lengths with likelihood value >/L is —.

A

NP complete

25
Q

In ML inference we need to consider all —– trees and try all ——. we also have to integrate over all —-.

A

unrooted, realistic branch lengths, internal node sequences

26
Q

What are the two algorithmic methods from phenetic inference methods?

A

UPGMA and neighbour joining algorithm

27
Q

In UPGMA we obtain a — tree, in neighbour joining we obtain a — tree.

A

rooted, ultrametric tree, unrooted

28
Q

UPGMA assumes a —, where branch lengths correspond to —.

A

strict molecular clock, calendar time

29
Q

UPGMA and neighbour joining algorithm both have —- time complexity and are statistically —-.

A

polynomial, consistent

30
Q

Two optimality methods of phonetic approaches for inference are ?

A

Least square methods

31
Q

Least square methods have time complexity of — and are statistically —-.

A

NP complete , statistically consistent

32
Q

Phenetic approaches disregard information beyond —-.

A

pairwise distances

33
Q

Parsimony is a — approach and is an —- problem , and it is statistically —–.

A

cladistic, NP complete, inconsistent

34
Q

In parsimony what is returned?

A

the tree requiring the least amount of mutations is returned.

35
Q

What method is mechanistic approach for inference? in this method statistically consistent or not? what about it’s required time?

A

ML method, consistent, NP complete

36
Q

What does ML explicity account for?

37
Q

What does the ML method return?

A

It returns the ML tree with branch lengths which correspond to the number of mutations alongside evolutionary model parameters

38
Q

In the Fitch algorithm. do we obtain all most parsimonous ancestral sequences when choosing the different nucleotides in the curly brackets?

A

No we cant. write example like in the solution

39
Q

Does the maximum likelihood tree construction method return estimates for the internal sequcnes? give a reason

A

No, we get the probability of the data given the tree and some sequnces in the roots

40
Q

Does the fitch algorithm return the parsimony score for any phylogenetic tree and any sequence alignment?

41
Q

In a time reversible model, does the position of the root change the probability?

A

No, the model doest distinguish the direction so we obtain the same results.