Lecture 16 - Patterns and Models Flashcards
What is the approximate time to fixation for neutral alleles in diploid populations?
4N generations.
What is a substitution model used for?
Used to describe the patterns and rates of sequence change over evolutionary time.
What does Juke’s and Cantor’s one-parameter model say?
All substitutions will occur with equal probability (i.e., no bias in the direction of change).
Only one parameter here - the rate of change of one nucleotide to another (α, alpha).
- Thus 3α is the rate at which one nucleotide can change to any other
- Probability of the nucleotide staying the same after one unit of time = 1 - 3α
Is the assumption made by Juke and Cantor (i.e., that all base substitutions are equally likely) a realistic assumption to make?
NO - because like-to-like changes are more probable.
- Transitions
- Transversions
Is the assumption made by Juke and Cantor (i.e., that all base substitutions are equally likely) a realistic assumption to make?
NO - because like-to-like changes are more probable.
- Transitions
- Transversions
What is a transition?
Like-to-like substitution.
- Purine to purine (A <-> G)
- Pyrimidine to pyrimidine (C <-> T)
What is a transversion?
A pyrimidine to purine change (or vice versa).
What does Kimura’s two parameter model say?
Assumes that like-to-like changes are more probable.
- This is seen in nature, as transitions are observed TWICE as frequently as transversions
- Thus, mutational bias towards transitions
What frequency will each reach nucleotide eventually (in both models)?
25% frequency (i.e., equilibrium).
What is the “multiple hits” phenomenon?
The more time that passes, the higher the chance that the same site will undergo multiple substitutions.
When is it more likely that multiple hits have occurred?
The higher the divergence, the more substitution events occurred, thus the more likely it is that multiple hits have occurred.
What problem arises from multiple hits?
The number of differences that are observed might not necessarily reflect the number of substitutions that have occurred.
Thus, by simply counting the differences, one can extremely underestimate the amount of evolutionary change!
(SEE LECTURE 16 part 1 @ 18:30 mins)
How do sophisticated models attempt to take multiple hits into account?
By looking at the degree of divergence between the two sequences.
What problem occurs in these sophisticated models (that take multiple hits into account) when looking at high levels of divergence?
Saturation.
(SEE LECTURE 16 part 1 @ 21 mins)
When do our substitution models work best?
When the degree of divergence is relatively low.
What are the two major violations to the assumptions made in our models (that are seen in nature)?
- Probability of mutation can vary across the gene or genome regions depending on base pair composition and other factors (e.g., chromatin organisation)!
- Probability of fixation can vary across the gene or genome regions, due to differing strengths of purifying selection!