Virus Evolution Flashcards

1
Q

What are branch lengths an estimate of?

A

Of the time since species diverged

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does an unrooted tree mean?

A

They are related by dont know where common ancestors sit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the root

A

The oldest point in the tree i.e. the common ancestor of all the species in the tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two ways to root a tree?

A
  1. Use a known out group - an out group is a species or group of species that diverged first
  2. Midpoint rooting - assume that all species are evolving at the same rate and put the root in the middle of the tree
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a node?

A

Where two branches join

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the tip?

A

The end of the branch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Clade?

A

Groups e..g A, B and D all share a common ancestor, could say they form a clade

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If there are 4 sequences how many unrooted trees are there?

A

3 unrooted trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If there are 4 sequences and 3 unrooted trees, how many branches and therefore rooted trees are there?

A

5 branches, 15 rooted trees. The root could be on any of the branches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If there are 5 sequences how many rooted and unrooted trees are there?

A

15 unrooted trees
105 rooted trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do distance matrices work?

A

Just count the number of differences between two sequences then convert to a percentage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Go through creating a tree from distance matrix methods

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does minimum evolution distance matrix methods use?

A

Simultaneous equations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does maximum parsimony use?

A

Informative sites

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is an informative site?

A

When there are at least two different nucleotides at the site, and each of which is represented in at least two of the sequences under study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why aren’t informative sites very accurate?

A

Not taking into account some evolutionary changes, only looking at informative sites

17
Q

What is bootstrapping?

A

Produce a new sequence of alignment of the same length as the real alignment , by random sampling from the sites in the real alignment
Make a phylogenetic tree from this alignment
Repeat steps 1 and 2 many times
Record how often each partition in the ‘real’ tree occurs among the bootstrapped trees

18
Q

Why is multiple hits important?

A

Models fo sequence evolution attempt to correct for multiple hits i.e. more than on substitution occurring at the same site over evolutionary time
If we sample at time 4 and time 1 and it has changed from A to T then might say it has only changed once but at time 3 and time 2 it may have been G and C so has changed 3 times

19
Q

Are transitions or trans versions more common?

A

Transitions more frequent than trans versions usually

20
Q

What is A to G called (purine to purine)

A

Transition

21
Q

What is A to C called (purine to pyramidine)?

A

Transversion

22
Q

What does JC69 model assume?

A

One rate for transversion and transitions

23
Q

What does K90 model assume?

A

Rates of transversion and transitions are different

24
Q

What is the GTR model?

A

General time reversible - six classes of substitutions, base frequencies vary

25
Q

What is the F81 model?

A

All substitutions are equal, base frequencies can vary

26
Q

What is the K2P model?

A

Transitions and trans versions have different substitution rates, base frequencies are assumed equal

27
Q

What is the HKY85 model?

A

Trnaisiotns and trans versions have different substitution rates, base frequencies can very

28
Q

What is a molecular clock?

A

Rate of evolution of a given protein remains roughly the same over time

29
Q

What does a molecular clock not mean?

A

Does not mean that -
All proteins evolve at the same rate
All sites within a protein evolve at the same rate

30
Q

What is a synonymous change?

A

Silence change, doesn’t change underlying amino acid

31
Q

What is ds?

A

The rate of synonymous (silent) substitutions
It reflects the mutation rate and is similar for different genes in the same virus

32
Q

What is dN?

A

The rate of non synonymous amino acid replacing substitutions

33
Q

Are dN or dS values lower?

A

DN values are lower because of constraints on proteins. DN values vary between proteins because different proteins have constrained to different extents

34
Q

How do you work out the substitution rate?

A

= slope so molecular distance/time