GWAS Flashcards
looking at the threshold model for multifactorial disease: if individuals cross the threshold they will have what?
combination of increased number of risk alleles or increased number of environmental risk factors
the relative contribution of each locus to overall population depends on:
magnitude of the effect of the risk allele
allele frequency
dominance
epistasis
risk alleles may be
common but have a small effect
rare with a large effect
mixture of oligogenes modified by polygenes and modifiers
effect size determines ease of what
identification or mapping
only a small proportion of individuals with the at risk genotype may suffer from the disease T/F
T
there are only a small number of alleles that are associated with a large risk of a complex disease. which locus and mutation is associated with type 1 diabetes
HLA locus
DQβ1 asp57any
if individuals are homozygous for which allele they are mmore likely to get alzheimers
ApoE ε4
what is an SNP
variation in a single nucleotide that occurs at a specific position in the genome
if you compared any 2 genomes you would find a difference around ever how many bases
1kb
in a population comparison with multiple genomes you would expect a difference every…
300bp
how many common SNPs are there
15million
how do we measure the effect sizes of alleles
Odds ratio
how do we calculate the odds ration (OR)
(cases/non cases genotype1) / (cases/non cases genotype2)
an odds ratio less than 1 indicates
condition is less likely to occur with the the first genotype
an odds ratio of more than 1 indicated
condition is more likely to occur with the the first genotype
what is genotype relative risk (GRR)
the ratio of risk with the allele compared to without the allele
what is the difference between OR and GRR
Relative risk is a ratio of probabilities whereas odds ratio is a ratio of odds
OR can overestimate and magnify risk, especially when the disease is more common (eg, hypertension) and should be avoided in such cases if RR can be used. T/F
T
what are the steps for conducting a GWAS
- collect datasets of cases and controls
- genotype each subject with a genome wide panel of SNPs
- calculare the OR for each allele in the panel for each individual in the dataset
- for each allele in genome wide panel - is the OR significantly different in cases and controls
if the OR is significantly different in cases compared to controls you can conclude
such alleles are associated with the disease
what are the 2 reasons of why an allele can be associated with a disease
1) the allele causes the risk
2) the allele is correlated with the causative allele because of linkage disequilibrium
which is more common
1) the allele causes the risk
2) the allele is correlated with the causative allele because of linkage disequilibrium
more common to be in linkage disequilibrium
what are haplotype blocks
set of closely linked alleles/markers on a chromosome that, over evolutionary time, tend to be inherited together
haplotype blocks mean that we can use one tag and easily work out the intervening sequences T/F
T
how can we work out the intervening sequences
because of linkage disequilibrium in haplotype blocks
the HapMap project was an international consortium that did what
used commercial microarrays to map haplotype blocks in the human genome
what do you use the map of haplotype blocks for
to select panel of SNPs that effectively stand as proxy for all other SNPs across genome
in GWAS you can get type 1 false positives - what does this mean
each SNP as individual test p value of 0.05 means the result would occur less than 1 in 20 times by chance. as you test more SNPs there will be more false positives