Lecture 16 - human genetics Flashcards
What are different possible architecture of complex diseases?
- small of dominant alleles confer a large increase in risk
- common disease, common variant model - many alleles confer a small increase in risk
- intermediate - one major allele exerts a large effect, numerous other lower risk alleles
What are single nucleotide polymorphism (SNPs)?
- between 11-15 million common SNPs (Minor Allele Frequency >5%)
- uneven distribution of SNPs in the genome
Where do SNPs occur at?
- coding regions
- non-coding regions
What are coding regions?
- synonymous (no change in encoded amino acid)
- non-synonymous (e.g. missense or nonsense mutation)
What are non-coding regions?
- can affect expression/regulation of associated genes
- complex diseases arise from combinations of multiple SNPs
What is the exome aggregation consortium (ExAC)?
- exomes from unrelated individuals sequences as part of various disease-specific and population genetic studies
- 7.4 million variants mapped
- records frequency of alleles in a population
- records frequency of alleles in a population
- documents rare mutations
- highly pathogenic variants seen with a lower frequency in the general population
- gnomAD aggregates over 125,000 exome and 15,000 genome datasets
What are genome wide association studies?
Population-based studies looking at individuals with a condition against a control population
- examine a panel of SNPs in the genome for association with the disease phenotype
- search for alleles that occur more frequently in disease cases than in matched controls
- requires many participants
- GWAS studies have been performed for most common diseases
- many risk loci have yet to be identified
- missing loci contribute to the ‘missing heritability problem’
What did the genome wide association studies do?
- GWAS compare the allelic frequency across the entire genome in case and control populations
- significant differences in allelic frequency constitutes an association with disease
How are SNPs associated with disease?
- association studies can tell us if an allele is associated with a disease
- the SNP itself
- the SNP correlates with the risk allele due to linkage disequilibrium
What is linkage disequilibrium?
the non-random association of alleles at different genomic sites
What does linkage disequilibrium depend on?
- distance between alleles
- recombination rate
What can patterns of Linkage Equilibrium be summarised as?
Haplotype blocks
What are haplotype blocks?
regions of high linkage disequilibrium that are separated from other haplotype blocks by many historical recombination
What occurs in haplotype mapping?
groups of alleles are clustered so a single SNP can identify the cluster of alleles (Tag SNP)
How does identification of risk alleles occur?
- GWAS studies identify SNPs associated with disease, not necessarily risk alleles
- need integration with functional data on candidate regions to identify causality