Genome wide association studies Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What types of diseases do rare but high impact alleles cause?

A

Mendelian disorders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What types of diseases do common but low impact alleles cause?

A

Multifactorial disorders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What types of diseases do common but high impact alleles cause?

A

Mendelian disorders with a heterozygote advantage. Favourable for heterozygotes but cause disease in homozygotes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is linkage disequilibrium?

A

Non-random association of alleles at two or more loci. A particular grouping of alleles at linked loci occurs more often than by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we determine the max number of haplotypes for a set of loci?

A

2^x, where x = number of loci being examined

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we determine if loci are showing linkage disequilibrium?

A

Determine the frequencies of alleles in the population, then use that to calculate the expected frequencies. Then do a Chi square test to determine if the observed frequencies differ significantly from the expected frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

If the Chi square test was significant, would there be evidence of linkage disequilibrium between the loci?

A

Yes, if the expected frequencies differ from the observed frequencies, there is a set of alleles inherited together more than what is expected by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does D mean in linkage disequilibrium?

A

The deviation, the magnitude of the disequilibrium

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What do we usually use as a measurement of linkage disequilibrium?

A

r^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does an r^2 of 0 mean for linkage disequilibrium?

A

Complete equilibrium and random association of alleles at the loci

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does an r^2 of 1 mean for linkage disequilibrium?

A

Complete disequilibrium

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two factors that will affect how long a region stays in linkage disequilibrium?

A

Time and distance. Will break down in alleles further apart over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are two international projects that have focused on documenting genomic variation in different populations?

A

International HapMap project and the 1000 genomes project. Have identified and mapped millions of SNPs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are haplotype blocks?

A

Regions in a chromosome with high degree of linkage disequilibrium and little diversity in which alleles are present

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What defines a haplotype block?

A

Fewer haplotypes in the population are present than what would be expected by 2^x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why are haplotype blocks useful for studying the association between a SNP and a disease locus?

A

Since they’re always inherited together and we can use one allele in there to predict the others, we can look at a representative SNP for that block and it cuts down our work

17
Q

Are allele frequencies static across populations?

A

Not in the slightest. Need to consider the population being studied

18
Q

What does LD LINK do?

A

Lets us select SNPs in a population and generates a heat map while also calculating r^2 and D. Cooler colours indicate lower r^2 and warmer colours indicate high r^2 and high LD

19
Q

Will the same alleles be in linkage disequilibrium between populations?

A

Nope. Can be in LD in one population and not the other

20
Q

What are LD maps?

A

Triangle heat map plots displaying the degree of LD between select SNP loci. Makes haplotype blocks really obvious (strong LD)

21
Q

What would be the ideal end goal of a GWAS study?

A

Look at every single SNP in the human genome for association with a disease

22
Q

Why don’t we need to look at every single SNP while doing GWAS studies?

A

Haplotype blocks. Some are in such strong LD that we only need to look for association between 1 or 2 of them from that block to know if there’s association with anything else in that block

23
Q

What are tag/proxy SNPs?

A

Representative SNPs from a haplotype block

24
Q

What are the two critical advances that have allowed us to do GWAS studies in the first place?

A
  1. Ability to do high throughput SNP genotyping with SNP arrays
  2. Knowledge that the genome is comprised of haplotype blocks with strong LD
25
Q

How is genotyping done in GWAS studies?

A

SNP chip arrays. One for each person and includes >500 000 SNPs on it

26
Q

How is a GWAS study set up?

A
  1. Get a big group of people with the disease, and another big group of people without it
  2. Genotype every single individual with SNP arrays
  3. Score the allele frequencies in each group
27
Q

How do we determine if a SNP allele and a disease state are associated?

A

Do a Chi square test to see if the SNP allele and the disease state are independent

28
Q

How do we tell if a SNP and disease state are associated with a Chi square test?

A

If the test stat > crit value, reject the null and they are associated

29
Q

Why isn’t it always feasible to have the same sample size for both the control and affected groups in a GWAS study?

A
  1. Some people might die before the study is complete
  2. Some people could revoke their consent
  3. Sometimes can’t recruit enough people for both groups (like if the disease is rare)
30
Q

What is the odds ratio?

A

Compares the risk of a complex disease in two groups with different genotypes

31
Q

What is the typical odds ratio we see for complex diseases? Why?

A

1.1 to 1.2, very small increases in risk. Because these alleles alone don’t do much, but they interact with other alleles and the environment

32
Q

How do you select individuals for your case group?

A

Select those that will best be able to find the associations, like the most severely affected, familial history, early age of onset

33
Q

How do you select individuals for your control group?

A

Select those with similar sex, age, and demographics to the case group, but are at the lowest risk of developing the disease

34
Q

What’s the most common way to display the results of a GWAS study?

A

Manhattan plot

35
Q

How do we interpret a Manhattan plot?

A

All the SNPs on the array are arranged by chromosome on the x-axis. The y-axis is the negative log of the p-value for every Chi square test. Threshold for significance is defined. Look for any SNPs above that threshold