W4L1 Molecular Population Genetics and more tests of Neutrality M Flashcards
What is the advantage of neutral theory
it gives us expectations about molecular evolution in the absence of selection
Way of identifying changes in nucleotide
-Measure every polymorphic site
-measure minor allele frequency and count total amount of singleton via frequency spectrum
Measuring nucleotide diversity
-create a table which compare the amount of difference between each alleales polymorphic site
-create a average pi
Descriptive metric unit
- S (the number of segregating sites)
- p (the frequency of a variant)
- MAF (minor allele frequency)
- Haplotypes ( a particular combination)
- p (average pairwise divergence)
- Hobs (the frequency of heterozygotes)
Some theoretical metrics to give expectation
- Ne : The effective population size
- Hexp : the expected number of heterozygotes
- 0: ‘theta’. The expected nucleotide diversity and we expect theta=pi
Calculating theta by waterson
Waterson (1975) found that q = 4Neµ and it can be calculated from S.
Calculating theta under the neutral model
Tajimas D test
compares q (neutral expectation) to p (observed) to address frequency spectrum deviations
Tajimas D = p- q/
Sqr Var(p- q)
-if the value is negative, there is an excess of rare singelton
-if the value is positive, there is a lack of rare singleton
What would the reason be for the deviation between pi and theta
-population size does not stay the constant
coalescence
Looking back in time the lineages of all contemporary alleles will eventually “coalesce” to a single ancestor
There are n-1 event coalescence per sample size of n
On average, it will take 2Ne Generations to go from a large sample down to two lineages and the total for coalescent is 4Ne
The effect of a bottleneck on the coalescent
- a positive tajimaD, fewer mutation than expected if there is a recent bottleneck
-negative tajimaD, more mutation than expected if the bottleneck event is long ago
Coalescence with expanding population size
-a negative TajimaD result can indicate a population boom
What can TajimaD tell
-if there is a size fluctuation in the past
TajimaD and cotton bollworm
Genomic study of 141 individuals of cotton bollworm, which were collected from 13 locations in three cotton-producing regions of China, namely the Yellow River Region (YRR), the Changjiang River Region (CRR) and the Northwestern Region (NR)
* 5,227,071 high-quality SNPs
* Mean Tajima’s D values of different populations ranged from −1.22 to −0.67 show that there is a population expansion recently
Linkage Disequilibrium
the non-random association of alleles at different loci in a population