W4L1 Molecular Population Genetics and more tests of Neutrality M Flashcards

1
Q

What is the advantage of neutral theory

A

it gives us expectations about molecular evolution in the absence of selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Way of identifying changes in nucleotide

A

-Measure every polymorphic site
-measure minor allele frequency and count total amount of singleton via frequency spectrum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Measuring nucleotide diversity

A

-create a table which compare the amount of difference between each alleales polymorphic site
-create a average pi

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Descriptive metric unit

A
  • S (the number of segregating sites)
  • p (the frequency of a variant)
  • MAF (minor allele frequency)
  • Haplotypes ( a particular combination)
  • p (average pairwise divergence)
  • Hobs (the frequency of heterozygotes)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Some theoretical metrics to give expectation

A
  • Ne : The effective population size
  • Hexp : the expected number of heterozygotes
  • 0: ‘theta’. The expected nucleotide diversity and we expect theta=pi
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Calculating theta by waterson

A

Waterson (1975) found that q = 4Neµ and it can be calculated from S.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Calculating theta under the neutral model

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Tajimas D test

A

compares q (neutral expectation) to p (observed) to address frequency spectrum deviations
Tajimas D = p- q/
Sqr Var(p- q)
-if the value is negative, there is an excess of rare singelton
-if the value is positive, there is a lack of rare singleton

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What would the reason be for the deviation between pi and theta

A

-population size does not stay the constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

coalescence

A

Looking back in time the lineages of all contemporary alleles will eventually “coalesce” to a single ancestor
There are n-1 event coalescence per sample size of n
On average, it will take 2Ne Generations to go from a large sample down to two lineages and the total for coalescent is 4Ne

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The effect of a bottleneck on the coalescent

A
  • a positive tajimaD, fewer mutation than expected if there is a recent bottleneck
    -negative tajimaD, more mutation than expected if the bottleneck event is long ago
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Coalescence with expanding population size

A

-a negative TajimaD result can indicate a population boom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What can TajimaD tell

A

-if there is a size fluctuation in the past

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

TajimaD and cotton bollworm

A

Genomic study of 141 individuals of cotton bollworm, which were collected from 13 locations in three cotton-producing regions of China, namely the Yellow River Region (YRR), the Changjiang River Region (CRR) and the Northwestern Region (NR)
* 5,227,071 high-quality SNPs
* Mean Tajima’s D values of different populations ranged from −1.22 to −0.67 show that there is a population expansion recently

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Linkage Disequilibrium

A

the non-random association of alleles at different loci in a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a haplotype

A

is one combination of allelic states that is inherited together

17
Q

How is haplotype created

A

-at first, there is 3 linked sites, after a few generation, drift may bring the new mutant-bearing allele to higher frequency, creating linkage disequilibrium in preexisting polymorphism. But it can be erode by recombination overtime

18
Q

Loss of haplotype overtime

A

Let’s call the recombination rate between two polymorphisms c
The probability that the haplotype is NOT broken down in one generation
P=1-c
The probability the haplotype is not broken down in G generations is:
P=(1-c)^G

19
Q

Homosapien and Neanderthal cross breeding

A

Admixture: the mixing of two genetically differentiated population
Initial Linkage dismorphia will be porpotional to alleles frequency differences between the two population and is unrelated to the distance between the markers
Extended LD: dependent on migration rate and recombination rate

20
Q

Factors affecting LD

A

Mutation, drift, limited recombination
Demographic effects –Admixture
–Bottlenecks eg. Founder effects
Selective sweeps

21
Q

Selective sweep and LD

A

Left in the patterns of nucleotide polymorphism
* Locally reduced variation
* Skewed frequency spectrum negative TajimaD
* Increased LD/haplotypes

22
Q

FOXP2 selective sweep example

A
  • mutations associated with severe articulation problems, linguistic and grammatical impairment
    -it is a regulator for a lot of gene
    -only 2 amino acid between mouse and chimp but 2 difference between human and chimp
    -but it turned out that the result is due to sampling bias