Linkage Analysis Flashcards

1
Q

What is genetic variation and what are the different effects it can have?

A

Genetic variation refers to differences in the DNA sequence between individuals in a population

Genetic variation can have different effects:
Alteration of the amino acid sequence (protein) that is encoded by a gene
Changes in gene regulation (where and when a gene is expressed)
Physical appearance of an individual (e.g. eye colour, genetic disease risk)
Silent or no apparent effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is genetic variation so important?

A
  1. Genetic variation underlies phenotypic differences among different individuals
  2. Genetic variations determine our predisposition to complex diseases and responses to drugs and environmental factors
  3. Genetic variation reveals clues of ancestral human migration history
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the three different types of mutations?

A

Germline mutations: present in germ cells, passed on to descendants
Somatic mutations: present in somatic cells, not transmitted to descendants, can cause cancers
de novo mutations: new mutation not inherited from either parent, in early development but can then be passed on to next generation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is homologous recombination?

A

Homologous recombination: shuffling of chromosomal segments between partner (homologous) chromosomes of a pair

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is gene flow?

A

Gene flow: the movement of genes from one population to another (e.g. migration) is an important source of genetic variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the difference between a mutation and polymorphism?

A

A mutation is a rare change in the DNA sequence that is different to the normal (reference) sequence.
- The ‘normal’ allele is prevalent in the population and the mutation changes this to a rare ‘abnormal’ variant
By contrast, a polymorphism is a DNA sequence variant that is common in the population. In this case no single allele is regarded as the ‘normal’ allele.
- Instead there are two or more equally acceptable alternatives
The arbitrary cut-off point between a mutation and a polymorphism is a minor allele frequency (MAF) of 1% (i.e. for a variant to be classed as a polymorphism, the least common (minor) allele must be present in ≥1% of the population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When does recombination happen?

A

The creation of haploid gametes (i.e. sperm and eggs)
Genetic recombination takes place during prophase 1 where the crossing over of homologous arms of chromosome happens
This is completely random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the classification of genetic disease?

A

Mendelian / Monogenic: disease that is caused by a single gene, with little or no impact from the environment (e.g. PKD)
Linkage analysis can only be applied to Mendelian/Monogenic diseases
Non-Mendelian / Polygenic: diseases or traits caused by the impact of many different genes, each having only a small individual impact on the final condition (e.g. psoriasis)
Multifactorial: diseases or traits resulting from an interaction between multiple genes and often multiple environmental factors (e.g. heart disease)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is linkage analysis?

A

Linkage analysis is a method used to map the location of a disease gene in the genome
The term ‘linkage’ refers to the assumption of two things being physically linked to each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the principles of genetic linkage?

A

Genetic linkage is the tendency for alleles at neighbouring loci to segregate together at meiosis
Cross-overs are more likely to occur between loci separated by some distance than between loci close together on the chromosome
Therefore to be linked, two loci must lie very close together
A haplotype defines multiple alleles at linked loci. These chromosomal segments can be tracked through pedigrees and populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the method for detecting genetic linkage?

A

Genotype multiple genetic markers across the genome
Genotype multiple family members from families with the genetic trait
Identify which genetic markers co-segregate with the disease (phenotype)
(i.e. which haplotypes are the same in all affected family members)
These genetic markers are therefore ‘linked’ to the disease gene
–> This indicates where in the genome the disease gene is likely to be located
NB: further work is needed to identify the gene and disease-causing mutation!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two different types of genetic markers?

A

Microsatellite markers

Single nucleotide polymorphism (SNP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the features of microsatellite markers?

A

Less common now. Highly polymorphic short tandem repeats of 2 to 6 bp
Microsatellites may differ in length between chromosomes (heterozygous)
Are relatively widely spaced apart

To do a genome linkage analysis you'd be looking at:
400 (200) microsatellite markers
Average spacing 9 cM (20 cM)
PCR-based system
Fluorescently-labelled primers
Manual assignment of genotypes
Labour intensive
Whole genome scan >2-3 months
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the features of SNP markers?

A

Now the genetic marker of choice. Biallelic (a SNP will be one of two possible bases)
Lower heterozygosity than microsatellites, but spaced much closer together
More informative

To do a genome linkage analysis you'd be looking at:
~6,000 SNPs
Spaced throughout the genome
Microarray-based system
Genotypes assigned automatically
Highly automated
Data returned within <1-2 months
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is microsatellite genotyping used for?

A

Typically used for:
DNA fingerprinting from very small amounts of material
Standard test uses 13 core loci making the likelihood of a chance match 1 in three trillion
Paternity testing
Linkage analysis for disease gene identification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is fluorescent genotyping?

A

Fluorescently-tagged PCR primers
Allows for multiplexing of PCR products with different colours and fragment lengths
Fragment sizes separated down to 1bp resolution

Look at the 3 microsatellite genotypes on the left
Which alleles did the patient inherit from each parent?

ADD DIAGRAM

17
Q

What are SNP genotyping microarrays?

A

Provides genome-wide coverage of SNP markers
SNPs are proxy markers; NOT the causal disease variants
Can amplify thousands of markers in a single experiment
Alleles are identified by relative fluorescence
homozygous for allele 1 = green signal
homozygous for allele 2 = red signal
heterozygous (1/2) = yellow signal

18
Q

What are SNP genotyping microarrays used for?

A
Linkage analysis in families (affected vs unaffected relatives)
homozygosity mapping (autosomal recessive) and mapping of Mendelian traits
GWAS in populations (unrelated cases vs matched controls)
non-Mendelian disorders and multifactorial traits
19
Q

How does linkage mapping using genetic markers suggest genetic linkage?

A

Uses an observed locus (genetic marker) to draw inferences about an unobserved locus (disease gene)
If a marker is linked to a disease locus (i.e. M3 and M4), the same marker alleles will be inherited by two affected relatives more often than expected by chance
If the marker and the disease locus are unlinked (i.e. M5 – M8), the affected relatives in a family are less likely to inherit the same marker alleles

20
Q

How can you statistically analyse linkage?

A

The probability of linkage can be assessed using a LOD score
LOD = logarithm of the odds score
The higher the LOD score, the higher the likelihood of linkage
LOD scores can be calculated across the whole genome using genotype data for many genetic markers in multiple members of a family
Parametric analysis specifies the pedigree structure and inheritance pattern (model)
Non-parametric analysis detects allele sharing between affected individuals
LOD scores are additive – different families linked to the same disease locus will increase the overall score
A LOD score ≥ 3 is considered evidence for linkage
Equivalent to odds of 1000:1 that the observed linkage occurred by chance
Translates to a p-value of approximately 0.05
A LOD score ≤ -2 is considered evidence against linkage

21
Q

What is the difference between parametric and non-parametric analysis?

A

Parametric analysis:
- Specifies analysis parameters (e.g. inheritance pattern, disease allele frequency, penetrance)
Non-parametric analysis:
- No parameters specified
- Looks for allele sharing between affected individuals