L28 Human Population Genetics 1 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

How many ancestors do you have?

A

see onenote slides

Can one person be multiple ancestors at once?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How many ancestors are you genetically related to?

A

see onenote

How much did each of them contribute to your genome?

From grandparents we don’t get a clean cut 25% of their genome from them, can be around 23% or 27%
After parents, there is uncertainty of how much of our ancestor’s genome we inherit (inherit 50% from mum and 50% from dad)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Humans are not infinite

A

see onenote

ancestors at generation
n=2^n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Who does your genome come from?

A

see onenote slides

% genome inherited from each

parent: 50%
gparent: 25%
ggparent: 12.5%
gggparent: 6.75%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Human population genetics

A

All humans are related due to our evolutionary history

but the specifics are trickier:

  1. when?
  2. where?
  3. how much?
  4. what does it mean for you?
  5. what does it mean for the species?

population genetics: the extension of Mendelian genetics to evolving populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we talk about pop gen?

A

see onenote

need a formal means to describe processes such as:
- heterozyosity
- relatedness
- diversity
- mutations
- etc.
to synthesise these observations into coherent knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Multiple ways of thinking about a population

A

see onenote

  1. full genealogy
  2. ancestry of extant lineages
  3. ancestry of sampled lineages
  4. coalescent of sampled lineages
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Our most basic tool - a single segregating locus

A

see onenote

single diploid locus
2 alleles: A and a
p = A freq.
q = a freq.
p + q = 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hardy-Weinberg equilibrium

A

see onenote

relationship between genotype and allele frequencies knows as h-w equilibrium

for it to hold, multiple assumptions must be true:

  1. random mating
  2. no selection
  3. no migration
  4. no mutation
  5. no genetic drift (infinite population size)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Beyond loci, to populations: the wright-fisher model

A

see onenote slides

still only one locus but now we explicitly model the passing of time

  1. population size is constant (not infinite)
  2. generations are discrete and non-overlapping
  3. all individuals are equally fit
  4. no mutation
  5. haploid individuals
  6. when extended to diploid mutations
    - non-assortative mating
    - no recombination

Wright-Fisher model, models genetic drift
- Binomial sampling with some extra parameters
Can be extended to incorporate things like selection

WF provides a robust null hypothesis for everything else we discuss

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Fundamental properties of the WF model

A

see onenote

each individual at generation t has 2N opportunities to become an ancestor to generation t+1

the probability of being ancestor to a specific individual is 1/2N

binomial distribution
n = 2N
p = 1/2N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

WF from the gene’s perspective

A

see onenote

there can be multiple copies of a gene in a generation

probability of being transmitted to generation t+1 now depends on number of copies of gene at t

still a random binomial process
n = 2N
p = x/2N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Extending the model: WF with mutation

A

see onenote

introduce mutation into the population with probability “mu” per site per generation

in humans, “mu”~1 x 10^-8 per site per generation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A window into the past: the coalescent

A

see onenote slides

can use assumptions of the WF model to go backwards and reconstruct the past from the present

Can answer questions such as:
- When did two individuals last share a common ancestor? When did they coalesce?
- Coalescent lets us date relatedness in a way that we can track
Can extrapolate current data back into the past to figure out relatedness between individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Coalescent lets us date relatedness and divergence between sequences

A

see onenote

  • given a DNA phylogeny, it allows us to find the time to most recent common ancestor (TMRCA) of sequences in the phylogeny
  • coalescent modelling can be expanded to incorporate selection, expansion etc.
  • gives us the framework against which to test predictions and observations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

But real populations are far more complicated

A

WF model is unrealistic but it gives us a mathematical language to describe expectations under neutrality, as well as mutation, selection, population growth etc.

does the parameter of the model resemble observed data?

WF and coalescent modelling empowers us to make robust judgements about actual populations and their genetic data

17
Q

Describing population structure

A

Population
- Randomly reproducing group of individuals

random mating = random relative to genetics
- meaning there is no underling population structure or subpopulations

but humans violate these assumptions:

  1. geographic structure/isolation by distance
  2. assortative mating
  3. inbreedinge etc.
18
Q

F statistics for relationships between populations

A

see onenote

H statistics = measurements of heterozygosity and diversity

  • Hi = heterozygosity within an individual
  • Hs = heterozygosity within a subpopulation
  • Ht = heterozygosity within the total population

F statistics = measure of population structure and differentiation

  • Fxy = correlation between gametes drawn from x relative to y
  • Fis = describes correlation (or diversity) between gametes from a single individual relative to what is seen in the subpopulation

F and H statistics are hierarchical

Fst is the most widely used F statistic in population genetics
- it describe the proportion of the variance due to between-population rather than within-population differences

19
Q

The international HapMap project

A

see onenote slides

aimed to generate a haplotype map of the human genome to be used in mapping genes associated with any disease

use of trios allowed for phasing of data and identification of haplotypes

20
Q

Haplotype

A

a block of SNPs physically on the same chromosome, inherited together due to LD

haplotypes are used to identify recombination break point, linkage blocks and tag SNPs

tag SNP = representation SNP for an entire haplotype

21
Q

HGDP-CEPH and 1000 genomes

A

see onenote

Human genome diversity project
- for ethnic and evolutionary studies

1000 genomes
- for broadly medical studies

22
Q

HGDP-CEPH - treasure trove or ethics quagmire?

A

see onenote

aim of the project was studies of human history and evolution - not medical research but all project data is publicly available so…

sought to collect DNA from isolated populations “at risk of becoming extinct” without working with the communities to prevent it

biocolonialism has resulted in many indigenous groups worldwide refusing to participate in genetic research when the distribution of benefits is inequitable