Genome Wide Association Studies (GWAS) Flashcards
What technological advances allowed GWAS?
- International HapMap project delivering millions of mapped SNPs
- Extension of microarray technology allowed automated genotyping of huge numbers of SNPs
What are GWAS?
Genome wide association studies designed to identify common variants = assume common complex diseases caused by common variants. Using Case-control studies
Why use SNPs over microarrays in GWAS?
- Microarrays only occur 1/50,000 compared to 1/1000 for SNPs
- SNPs may be less informative but have genotyping platforms
What is the workflow of GWAS?
- Using HapMap data, SNPs selected which tag the common haplotypes at each locus
- Tagged SNPs genotyped in disease and control cases using microarray
- Allele frequencies for each SNP compared in 2 groups
- SNPs associated with disease are genotyped in replication test
- Which associations are robust
What are SNP chips?
Detect DNA variation via DNA chips, using known genomic sequence –> DNA hybridizes to chip
(key SNP tech)
What is a whole genome high density chip?
Collection of 223 high density arrays containing more than 10 billion unique oligonucleotides
How is GWAS data visualised?
On a Quantile-Quantile (Q-Q plot)
How to find a gene in GWAS?
- Locate a chromosomal area associated with a particular syndrome
- screen hundreds of patients to find minimal region associated
- map all gene candidates
- check all candidates (try mutations, KO etc)
- study gene functions in the tissue and tumours
What are the limitations of GWAS?
- common disease variants identified by GWAS have weak effects
- even cumulative contributions are small
- available GWAS data explain small portion of genetic variation of complex diseases = missing heritability
What explains the missing heritability in GWAS?
- large numbers of common variants have weak effects
- rare variants have large effect
- gene-gene and gene-environment interactions (doesn’t take into account genetic interactions)
What is the common variant hypothesis?
Different combinations of variants at multiple loci aggregate in specific individuals to increase disease risk. This explains the steep falling away of disease risk in relatives from ancestor with a common disease. Common variants are expected to be of ancient origin
What is the rare variant hypothesis?
Rare variants as recent mutations, and so they are likely to have strong effects on an individual. But natural selection says that rare variants that are super deleterious aren’t likely to stay for more than a couple generations. So they wouldn’t appear on common haplotypes.
With complex diseases, there are subtypes which segregate according to Mendelian ratios where there are rare mutations with strong effects.
So rare variants with moderate effect may not be detected because they’re just below GWAS detection threshold