Association Analysis Flashcards
Define the term “Genetic Association”
asdf
Describe how a genetic association study is conducted
- cases and controls (well matched for risk factors)
- measure for genetic loci of interest: what allele is common in all cases which is not present in controls?
Describe an ideal genetic marker and how SNPs fit this description
genetic markers:
- polymorphic
- random distribution across genome
- frequent and fixed position
- stable with time
easy to genotype
SNPs:
- located 1 in every 3000 nucleotides
- 12 million recorded in database
- generated by mismatch repair during meiosis
- could be a MAF (minor sequency allele)
location in ‘coding region’: synonymous change, amino acid change = non-synonymous, new stop codon = nonsense
- location in exon : promoter, terminator, splice site / intergenic region
Describe the principles of a genome-wide association study (GWAS)
GWAS:
- recruitment of large sets of cases and controls
-using SNP microarrays as genotype markers across the genome
- chi squared test: statistical significance of p-value (above 5%) show more than 95% probability the observed results are NOT due to chance
Describe and interpret a Manhattan plot
Manhattan plot:
- y axis -log10(p-value) / chromosomes (x- axis)
- alternating colours for each chromosomes
- higher = different shaded peaks with SNP association due to linkage disequilibrium
- sky scraper = peaks
Describe and interpret a regional association plot
regional association= lines up asosicated SNPs against gene reigons along chromosome (zooming into the sky-scraper peaks) the higest log(p-val) shows strongest loci association to causal variant
Describe meta-analysis and how it is used in genetic studies
meta analysis = implementing/combining loads of GWAS studies and adding it to their data = combining data from multiple small studies
Describe the known problems with GWAS
- 5% contribution to disease
- no CNV covered
- common SNPs only
Give an example of how GWAS has identified susceptibility variants in genes in a common disease
- plot manhattan
- observe peaks
- region association
- share/combine studies
eg: obesity FTO
Describe the relationship between genetic association and linkage disequilibrium
disequilibrium