Measuring Evolution, Patterns and Models of Sequence Change and The Molecular Clock Flashcards
Evolutionary Genetics
Broader study of how molecular and population genetics phenomena bring about long-term evolutionary change, including speciation and adaption
Population Genentics
Study of the genetic composition of biological populations and changes to this composition
Homology
Similarity due to common descent
Problems relying on morphology for reconstructing evolution 3
- Convergent evolution - independent evolution of shared desired traits that were not present in the LCA
- Character reversal - a species may lose the derived trait and revert back to the ancestral form - fleas lost their wings
- Erratic rates of morphological evolution
When was DNA identified as a molecule of inheritence?
1940-50s
When were tools developed to manipulate and sequence DNA to study genetic variation?
1960-70s
What was compared first at a molecular level
Proteins then DNA
Molecular homology
Molecular similarity due to common descent
How is homology ascertained?
Sequence similarity
Hamming distance or degree of divergence
The proportion of differences (n/N) for two sequences of length N that differ at n sites
Analogous sequences
Sequences that are similar but not homologous, due to chance or due to reoccurring evolutionary processes
All observable differences between homologous sequences are due to
Mutation
What is the base-substitution rate across the entire genome in humans?
1 mutation every 10^8 base pairs per generation
The Neo-Darwinian Model (Panselectionism) 4 - false
- selection - strongest force in evolution and drives substitution events
- Mutation - ultimate source of genetic variation but only plays a minor role in evolution
- Polymorphisms - mainly maintained by balancing selection
- Genetic drift - mostly irrelevant
The Neutral theory of evolution 4
- The majority of new mutations are neutral or deleterious
- Neutral alleles have no impact on an organisms fitness and will change in frequency by genetic drift alone
- Most substitution events observed occurred by drift, not selection
- Negative selection also plays a powerful but silent role, removing deleterious mutations and working to keep the status quo
What does the rate of substitution in a population equal to?
The number of new mutations x probability of fixation
What is the rate of substitution for neutral mutations independent of?
Population size
Jukes and Cantor’s one-parameter model
All substitutions occur with equal probability - there is no bias
3alpha = rate of change of one nucleotide to any other
1-3alpha = probability that nucleotide stays the same
Kimura’s two-parameter model
Assumes like-to-like changes are more possible
Transition = pyrimidine to pyrimidine or purine to purine
Transversion = pyrimidine to purine
In vertebrates, how many times more are transitions observed compared to transverions
Twice
In mDNA, how many times more are transitions observed than transversions
20
What does it probably mean when we observe a high degree of divergence between two sequences
Most likely the same nucleotide has undergone multiple substitutions
The simple models of sequences variation assume the probability of substitution is the same across all sites in the sequence of interest. Name two violations to this
- The probability of mutation occurring can vary across gene or genome regions depending on base pair composition and other factors (chromatin organisation)
- The probability of fixation can very across gene or genome regions due to differing strengths of purifying selection
Discuss something that causes variable mutation rates within genomes
CpG dinucleotides which are susceptible to methylation - this means they are easily deaminated to give thymine from methylcytosine
The rate of transition substitutions will be higher in sequences with a lot of CG dinucleotides
Discuss the impact of purifying selection
Regions under purifying selection evolve slower. Sequences of functional importance are more resistant to substitution
The slowest evolving regions of the genome are protein coding sequences and their regulatory regions