Gene Identification Flashcards

1
Q

General goal of linkage analysis

A

Identify marker alleles or haplotypes that are co-segregated with a disease phenotype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How linkage is detected

A

Determining the amount of recombination between two markers

Rarer recombination between two loci are, the closer together they are on a chromosome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Do linked loci segregate independently during meiosis?

A

No- they are inherited together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Genetic markers definition

A

DNA variants in a given population or pedigree that are linked to a given disease or trait

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Do genetic markers associated with a trait generally cause that trait?

A

No- they generally are close to the causal genes, thus inherited together with them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Two major types of genetic markers

A

Variable length polymorphisms and SNPs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Variable length polymorphisms

A

Type of genetic marker
Characterized by long stretches of repeats
Different individuals have varying lengths of these repeats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Single nucleotide polymorphisms (SNPs)

A

Type of genetic marker
Characterized by 1 base difference between individuals (2 different allele possibilities at a locus: some individuals carry one, some carry another)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How SNP genotyping works

A

Used for already discovered SNPs
Different colored probes hybridize to different nucleotides (ex- homozygous A allele is one color, homozygous B allele is a second color, and heterozygous is a third color)
Hybridized DNA is shown on a plate: every SNP shows up as a dot on the plate
Can show up to several million SNPs on one plate
Each plate is one individual: can have multiple plates on a large plate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Genetic distance

A

How often two markers are separated by recombinations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Physical distance

A

Related to genetic distance

How many bases there are in between two markers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What “1 centiMorgan” refers to

A

1% recombination per generation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Is recombination uniformly distributed across the genome?

A

No- recombination resides primarily in “hot spots” (areas of higher recombination)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Recombination fraction

A

Designated as Greek letter theta
Measures degree of linkage: fraction of times two markers are separated by recombination in a pedigree
Ranges from 0 to 0.5 (in meiosis, 2 chromosomes stay together half the time)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What recombination fraction value (theta) is associated with linkage of two traits?

A

Two loci are said to be linked with theta is less than 0.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does a recombination fraction value (theta) of 0.5 mean?

A

Loci are segregating independently- no linkage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Log of the odds (LOD) score

A

Used to measure statistical significance of linkage

Higher score: linkage more likely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What LOD score is considered to be significant?

A

Greater than or equal to 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Weaknesses of linkage analysis

A

Requires studying relatives (cannot use unrelated individuals)
Works best for rare, dominant traits (not useful for common traits)
Linkage regions are large and finding “the gene” is hard

20
Q

Do rare variants with large effects (as identified from pedigrees) underlie common genetic diseases?

21
Q

Common disease/common variant hypothesis

A

There are many variants with modest effects at higher frequency in the population that lead to common disease

22
Q

Significance- association analysis

A

Measures whether or not an association between a given SNP and disease is real

23
Q

Effect size- association analysis

A

Measures the strength of variant effect

24
Q

Power- association analysis

A

Measures the likelihood of finding a real association

25
What are p values used to measure?
Statistical significance
26
P value definition
Probability of observing a given association, if there is no real association (null hypothesis)
27
What p value indicates that an association is statistically significant?
p < 0.05
28
Genome-wide association studies (GWAS)
Testing millions of SNPs across the human genome to discover which genetic variations are associated with a given trait/disease
29
Which populations GWAS tests
Population with given trait/disease | Population without given trait/disease
30
Bonferroni correction: what it does and how it is calculated
Used to decrease number of false positives when testing a large number of samples Calculated by dividing 0.05 by number of tests
31
2 ways to quantify effect size
Odds ratio | Relative risk
32
When is odds ratio used?
Case-control designs
33
When is relative risk used?
Cohorts and general population samples
34
Odds ratio formula
``` a/b divided by c/d a- cases with allele b- controls with allele c- cases without allele d- controls without allele ```
35
Relative risk formula
``` a/(a+b) divided by c/(c+d) a- cases with allele b- controls with allele c- cases without allele d- controls without allele ```
36
Interpretation of odds ratio
Used to determine how likely an individual with the risk allele is to have the specific phenotype Bigger odds ratio: greater likelihood of having disease Odds ratio of 2 means that person who inherits risk allele is twice as likely to develop disease
37
Interpretation of relative risk
Relative risk = 1: no difference in risk between experimental and control group Relative risk <1: event is less likely to occur in experimental group Relative risk >1: event is more likely to occur in experimental group
38
How to calculate absolute risk
Multiply population risk by relative risk
39
What is GWAS useful for finding?
Genetic variations contributing to common complex diseases where many common SNPs are associated with small effect sizes as well as large effect size from more rare SNPs for less common conditions
40
How it is possible to see the effects of all the SNPs that were not directly genotyped in a GWAS
Not all theoretical haplotypes for a given combination of alleles on a chromosome exist- haplotype of a given individual can be inferred
41
Linkage disequilibrium
Non-random association of two or more alleles (in GWAS, marker allele and disease-causing allele)
42
How linkage disequilibrium is measured
Correlation coefficient (r^2 value): value ranges between 0 and 1 0: complete equilibrium (random segregation) of two alleles 1: complete linkage disequilibrium
43
Do larger or smaller sample sizes yield higher power?
Larger sample sizes
44
Do larger or smaller effects yield higher power?
Larger effects
45
Is power higher for SNPs with higher or lower allele frequency?
Higher allele frequency
46
3 outcomes of low powered studies (i.e. small sample size)
True effects can be missed Effect estimates can be less precise (even in the wrong direction) Some "detected" effects can be false positives