Finding Disease Genes Flashcards

1
Q

Candidate Gene Association Study

A

Hypothesis driven approach

Uses markers to test gene/causal variant indirectly

Depends on a priori biological or positional hypothesis (almost always wrong!)

Fatal flaws lead to false positives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Genome Wide Association Study

A

Hypothesis Free approach

Rather than look gene by gene (candidate gene association study) we could do whole genome at one time!

Search for SNPs with significantly different allele frequencies in cases verse controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Genetic linkage study

A

Hypothesis free approach

Search for genome disproportionately coinherited along with disease in multiplex families

Assumes affected relatives within a family share disease susceptiblity genes “identical by descent”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Exome/Genome Sequencing Study

A

Sequence
Compare to reference
Look at common anomalies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Single gene sequencing

A

Sequence hypothesized gene

Most hypotheses wrong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does genome mapping require?

A

Polymorphic DNA markers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Do we sequence the entire genome?

A

No… still too expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What do polymorphic DNA markers do for us?

A

They provide “Sign posts” where we can look at differences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Polymorphic DNA makers are surrogates for what?

A

Potential disease mutations;

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are three commonly used marker types?

A

Microsatellites
SNPs
CNVs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Gene Mapping: what are physical maps?

A

Maps that tell us absolute positions - this is here and this is here

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Gene Mapping: what are genetic maps?

A

Relative maps based on recombination - across a whole population roughly how far apart are these two things/sequences from each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Microsatellites

A

Simply a repeat sequence in the genome for which the copy number varies
Simple sequence repeats
Used in forensics
Multi-allelic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Single nucleotide polymorphisms

A

Bi-allelic
Used for association studies

Occurrence/allele frequencies differ in ethnic groups/populations

SNPs occur in local context (haplotype) of surrounding SNPs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How frequent are SNP?

A

1/50-300bp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

SNP haplotypes

A

Recombination breaks macro-pattern of polymorphic genotypes on the same chromosome into blocks in which SNP alleles are in linkage disequilibrium (makers within blocks tend to be co-inherited because recombination within blocks is uncommon)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

If you genotype enough SNPs to identify a haplotype you can impute other variation that wasn’t genotyped and use this to infer ?

A

causal variation took place in this haplotype, even though SNP may not be causal variant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Copy Number variants

A
Common genomic deletions
Bi-allelic
Multi-allelic 
Unique
Most not causal for human disease
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

If we have a common disease allele that has a small effect, what studies are best suited to hunt for the disease gene?

A

Association

Candidate gene or GWAS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

If we have a rare disease allele that has a large effect, which studies are best suite to hunt for the disease gene?

A

Linkage
Sequencing
(track genes through families using linkage)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

To track things that are common but have relatively little effect we use which type of studies?

A

Association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

To track big effect genes that are relatively rare we use which kind of study?

A

Linkage in families

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Hypothesis Driven Studies

A

Candidate DNA Sequencing

Candidate Gene association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Candidate gene DNA sequencing

Where do we come up with our candidate?

A

biological or positional

“hit” from GWAS or other mapping method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

When do Candidate DNA studies work?

A

Single gene Mendelian diseases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Candidate DNA sequencing, are most hypotheses correct?

A

NO! most are wrong!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Which type of study uses markers to test gene/causal variant indirectly?

A

Candidate gene association studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Which genetic study is the most common?

A

Candidate gene association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What do candidate gene association studies depend on?

A

A prior hypothesis

30
Q

What are fatal flaws of candidate gene association studies and what do they lead to?

A
  1. Multiple-testing correction impossible
  2. Ethnically matched impossible
    False positives!
31
Q

Concept behind Candidate Gene Association study?

A

Causal disease variation in candidate gene is tagged by local haplotype of polymorphic DNA markers in Linkage Disequilibrium

Depends on Linkage disequilibrium

32
Q

Candidate gene association studies depend on linkage disequilibrium - in that?

A

DNA sequence variations close together on the same piece of DNA will tend to not be separated by recombination over long periods and so will be non-randomly co-inherited

33
Q

Candidate gene association studies - approach

What kind of study design?

A

Case control

34
Q

Candidate gene association studies - (2)

A
  1. genotype marker in candidate gene in cases nad controls

2. compare allele frequencies in cases and controls

35
Q

G.A.S.

Study size?

A

Hundreds

36
Q

G.A.S. stats?

A

Uses simply stats (chi sqaure, Fisher exact) p

37
Q

Genetic association studies - because we test multiple variants - what must we do?

A

We must apply multiple-testing correction

38
Q

G.A.S.

What does an association imply?

A

Not causation but does imply at least linkage disequilibrium with causal mutation

39
Q

What is the issue with multiple testing correction in G.A.S.

A

Have to take into account every variant of every study ever done - unrealistic
Take that number and divide you P value by the number of variants to get new significance value, which will be much much lower.

40
Q

G.A.S.

2 Fatal Flaws…

A
  1. True multiple testing correction must include all tests, even those done by others and perhaps never published
  2. Must ethnically match cases and control; otherwise, observed differences in allele frequencies may reflect different genetic backgrounds of cases and control, not true disease association - not possible to achieve
41
Q

Why can’t we ethnically match cases and controls in G.A.S.?

A

Because even in homogenous population, occult population differences (stratification) can lead to false positives

42
Q

What percent of published (3x confirmed) genetic association studies ultimately appear to be false positives due to stratification and publication bias?

A

96%

43
Q

Is genetic linkage analysis hypothesis free?

A

Yes!

Search genome for segments disproportionately co-inherited along with disease in “multiplex” families

44
Q

What is the underlying assumption in genetic linkage analyses?

A

Affected relatives within a family share disease susceptiblity genes
“identical by descent”

45
Q

What traits are best suited for genetic linkage analysis?

A

Mendelian (uncommon alleles with strong effects)

46
Q

Genetic linkage analysis for complex traits ?

A

Less powerful

47
Q

What can be said to be a search across the genome for marker(S) that co-segregate with disease in families?

A

Genetic linkage analysis

48
Q

What does genetic linkage depend on?

A

Principle depends on recombination - Loci close to each other (marker and gene) on a chromosome tend not to be separated by recombination vs. loci far apart

49
Q

What is the unit of genetic linkage/recombination?

A

centiMorgan (cM)

1 cM = 1% recombination between two loci per meiosis

50
Q

What is the statistical measure or linkage in genetic linkage analysis?

A

Log of odds score

51
Q

LOD =

A

Log10 (likelihood of data if loci linked at _cM / likelihood of data if loci unlinked)

52
Q

What is the significance level for genetic linkage analysis for LOD score?

A

> or equal to 3 is considered proof of linkage/gene localization

53
Q

How do we localize a gene using genetic linkage analysis?

A

We can follow ancestral haplotypes of linked marker alleles in each family

Through the generations recombination evens prune the haplotype –> Localizing the gene

54
Q

What are you looking for in genetic linkage analysis?

A

You are looking for a region of the genome where there is something that seems to be shared among affected relatives that you assume has been inherited from a common ancestor given the family structure

55
Q

What kind of studies are GWAS?

A

Case-control

56
Q

What do GWAS do?

A

Test hundreds of thousands / millions of markers (SNPs) across the entire genome

57
Q

What are we looking for in GWAS?

A

SNPs with significantly different allele frequencies in cases vs. controls

58
Q

Do we still need to match cases and controls ethnically?

A

Yes, and we are met with the same stratification problem, however, now we can accurately measure and correct for population stratification (whole genome)

59
Q

GWAS are we still faced with multiple testing correction issue?

A

NO! we know the number of tests performed genomewide; so we can perform appropriate multiple testing corrections (usually assume one million tests, so p

60
Q

Because we have a huge multiple-testing correction in GWAS, how big must our study be?

A

Usually at least 1000 cases and controls

61
Q

What happens if we find a significant association in a GWAS study?

A

We require confirmation by independent replication by follow up association study of specific SNPs

62
Q

When are GWAS most effective?

A

Common alleles with moderat effect sizes (ORs) 1.5 to 1.15

63
Q

What limits GWAS?

A

Sample size

64
Q

What is the hope of GWAS?

A

That we will be able to determine genetic architecture of disease - infer th ebiological pathway - and then separate/recategorize disease based on pathway it follows - which could significantly increase our odds ratio - because it would no longer be diluted by irrelevant pathways that cause same disease

We are trying to tease out genes specific to pathways

65
Q

Are most GWAS findings coding?

A

No, most are regulatory in nature, which is good because it may be easier to target and treat

66
Q

Which investigation combine hypothesis based and hypothesis free approaches?

A

Deep re-sequencing

67
Q

What is Deep re-sequencing?

A

High throughput DNA sequencing

  • of Biological candidate genes
  • from GWAS signlas
  • Full genome or exome
68
Q

What is a problem of Deep Sequencing?

A

Difficult to distinguish potentially causal variants from non-pathologic
- Prioritize for follow-up functional analysis

Variants of unknown significance

69
Q

Exome Genome Sequencing

How it works?

A

Pull down genome
Sequence
Reference
What’s different

70
Q

Is noise an issue with Exome/Genome Sequencing?

A

Yes! There is a lot of noise

71
Q

Exome/Genome Sequencing Filtering Schemes

A

Make assumption that disease populace share similar things -
Sequence effected patients / effected family

Look for something rare

Find something? Look at catalogs

Do LOD score