Association Analysis Flashcards

1
Q

Define the term “Genetic Association”

A

The presence of an allele at a higher frequency in unrelated subjects with a particular trait, compared to those that do not have the trait

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do we determine whether variants in the genome are associated with a disease?

A

If we substitute the word “disease” for “trait”.
- With disease = cases
- Without disease = Controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe how a genetic association study is conducted

A
  • Cases are subjected with the disease of interest E.g obesity
  • Definition of the disease must be applied consistently
  • Controls must be as well (similarly) matched as possible for the non disease traits
  • such as age, sex, ethnicity, location
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What other factors need to be catered for?

A

Match for all other risk factors
- affected/unaffected cases
- Measure genetic loci of interest
- Statistical analysis which genetic loci are associated with disease
- Identify genes/genomics region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How would you make the study fair?

A
  • Use a large number of well defined cases
  • use an Equal number of matched controls
  • Reliable genotyping technology (SNP microarray)
  • Standard statistical amalysis (PLINK)
  • Positive associations should be replicated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does using a genetic marker fit well in a genetic association study?

A

Individuals in a population are genetically far more diverse than individuals in a single family

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is this genetic diversity captured?

A
  • This genetic diversity is captured through reliable genetic markers
  • Genetic markers are alleles that we can genotype and assess whether they are associated with the disease
  • Association means <100kb from a casual variant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the ideal Genetic Marker?

A

If it is
- Polymorphic
- Randomly distributed across the genome
- Fixed location in genome
- Frequent in genome
- Frequent in population
- Stable with time
- Easy to assay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is a Single Nucleotide Polymorphism (SNP) used in a genetic association study?

A
  • Common in the genome ~1/300 nucleotides
  • 12 million common SNPs identified in human genome
  • Generated by mismatch repair during mitosis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Where could SNP’s be found?

A

In the gene (Coding region)
- No amino acid change (synonymous)
- Amino acid change (non-synonymous)
- New stop codon (nonsense)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Where else could SNP’s be found?

A

In the gene (Non coding region)
- Promoter: mRNA and protein level changed
- Terminator: mRNA and protein level changed
- Splice site: Altered mRNA, altered protein

Could also be found in the intergenic region (98% of genome)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe what a dbSNP is

A
  • An online database at NCBI, database of SNP’s
  • The rs number is a unique identifier given to each SNP
  • Has two unique flanking sequences between a single polymorphism
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe what a minor Allele Frequency is (MAF)

A

SNP’s have two forms. The major and minor form.
- The less common allele is called the minor allele
- Major allele frequency + Minor allele frequency = 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why are SNP’s chosen for genetic association studies?

A
  • SNP’s are chosen on the basis of their MAF
  • Common diseases are likely to be caused by common variants
  • SNPs with MAF >0.05 (5%) are usually used in association studies - GWAS
  • Exceptions are known monogenic disease SNPs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is a Genome Wide Association Study carried out?

A
  • Recruit large numbers of cases and controls
  • Genotype markers across the whole genome
  • Look for association between disease and alleles of each marker (Chi squared test)
  • Positive association is at p<5x10-8 (multiple testing correction)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How are the GWAS results plotted? VD

A
  • Presented as a single graph called a Manhattan plot
  • All results are plotted typically for >1M SNPs
  • X-axis Is the position of the SNP on the chromosome
  • Y-axis is -log10 (p-value) of the association
17
Q

View the results from the Wellcome Trust Case Control Consortium study. How are these results interpreted?

A
  • The peak of association often does not identify the gene causing the disease
  • The peak identifies the genomic region associated with disease and this is usually smaller than 100kb.
18
Q

When is a meta analysis used?

A
  • When is it difficult to do large studies (>1K cases/controls)
  • Easier to combine smaller studies
  • Meta analysis allows the statistical combination of regulars from multiple studies
19
Q

What are the problems with GWAS?

A
  • GWAS has identified associations that are statistically strong and reproducible
  • However their contribution to the genetic component of disease is estimated to be low (<5%)

Possible answers
- Many common SNPs of very small effect
- Rare SNPs
- Copy Number Variation
- Epigenetic variation

20
Q

What are some of the medical complications of obesity?

A
  • Pulmonary disease
  • Coronary Heart Disease
  • Severe pancreatitis
  • Stroke
  • Cancer
21
Q

How strongly is Obesity linked to genetics?

A
  • Twin studies: 70-80% of body shape is genetically determined
  • Adoption Studies: 30-40%
  • Family Studies: 40-60%
22
Q

What was the results of the large scale meta analysis done on obesity?

A
  • BMI meta analysis in 322k subjects
  • 97 BMI associated loci
  • 125 separate studies
  • > 60l authors and >2000 collaborators
23
Q

What are some advantages of GWAS?

A
  • They have already highlighted hundreds of loci that are associated with obesity or BMI related traits
  • Results are reproducible
  • however there’s still a long way to understand how all these genes contribute to obesity
24
Q

What is the relationship between linkage analysis and recombination?

A
  • During meiosis, recombination occurs at random points along the chromosome
  • Recombination reduces the linkage between the two loci over time in proportion to the distance between them
25
Q

What is Linkage Disequilibrium?

A
  • LD is when two alleles are inherited together more often than expected by chance
  • This is usually because they are close together in the genome
  • Alleles that are physically close together are rarely separated by recombination
26
Q

How is Linkage disequilibrium used in disease gene mapping?

A
  • If a genetic marker allele and an undiscovered allele for disease susceptibility are in LD then the genetic marker will be associated with disease
  • Genotype many marker alleles and we can find the regions of associations with disease - GWAS
  • Remember that the marker alleles are close together in a GWAS so we would expect to see many alleles associated with disease in a region due to LD