4. Phylogenetics Flashcards
Cross-Over
- the idea of localising the disease gene is based on the event of ‘cross over’
- during the meiosis stage there is an exchange of genetic material between homologous chromosomes
- this results in exchange of genes, genetic recombination
Haplotypes Example
Outline
- two loci: A and B
- genotypes Af, Am, Bf, Bm where f indicates father and m indicates mother
- formed by haplotypes of two gametes AfBf from father and AmBm from mother
Haplotypes Example
Non-Recombinant
-after meiosis can have
AfBf and AmBm
-i.e. no recombination between the parental haplotypes
Haplotypes Example
Recombinant
- during meiosis if crossing over occurs, can get recombination between parental haplotypes:
e. g. AmBf and AfBm
Recombination Fraction
- usually denoted θ
- the probability that a gamete is recombinant with respect to the locus
Likelihood of Crossober
- for loci in different chromosomes, independent segregation insures that R and NR gametes are equally likely to occur, θ=1/2
- for loci in the same chromosome, separation of two paternal / maternal alleles requires the occurrence of crossover between the two loci
- the closer the two loci the less likely this is, θ<1/2
Linkage Definition
- two loci with a recombination fraction less than 1/2 are said to be in linkage
- the smaller the recombination fraction, the more tightly linked the two loci are
Morgans
-the genetic map distance between two loci is defined as the expected number of crossovers occurring between them on a single chromatid during meiosis, unit Morgans
1cM ~ 1 million bases
Map Functions
- a mathematical relationship that converts map distance (m) to recombination fraction θ is called a map function
- the function connects two key quantities; genetic distance and recombination probabilities
- the most famous map functions are Haldane and Kosambi
The Haldane Map Function
Description
- assumes that crossovers occur at random, independently of each other
- the occurrence between two loci is a Poisson process i.e. they are equally likely at any point between the loci and the number of crossovers between loci follows a Poisson distribution
The Haldane Map Function
Function
θ = [1-exp(-2m)]/2
-with inverse
m = -1/2 log(1-2θ)
The Kosambi Map Function
Description
-a generalisation of the Haldane function
The Kosambi Map Function
Function
m = 1/4 log{[1+2θ]/[1-2θ]}
-with inverse
θ = 1/2 [exp(4m)-1]/[exp(4m)+1]
Genetic Marker
Definition
- genetic variants with known DNA sequence and known location
- for the purpose of linkage analysis these markers need to be easily and reliably detectable
Microsatellites
-repeats of simple DNA sequences, e.g.:
…CACACACACAC…
SNPs
-single nucleotide polymorphisms, e.g.
…CTGGTAGCTA…
…CTGGCAGCTA…
Linakge Analysis
Description
-based on the event of crossover during meiosis
-need a known genetic marker to estimate the recombination fraction θ
-once we have θ^, can estimate the distance between the marker and location of interest
-if R and NR gametes in a random sample can be counted:
θ^ = #R / [#(R+NR)]
-a test for linkage simplifies further to testing:
Ho : θ = 1/2
vs
H1 : θ < 1/2
Identifying R and NR Gametes
- to identify which gametes are R and NR, we need to know their phases
- phase is the situation where one of the alleles in the disease gene is in the same strand as one alleles of the marker
- in lab conditions (with animals) this is easily achievable
- in a human population, we need a three generation pedigree to be able to know the phase with certainty
θ^ Estimate
θ^ = R/N when RN/2
-since θ>1/2 is inadmissable on biological grounds
θ^ Estimate
R < N/2
-then: θ^ = R/N -with approximate standard error: √[θ^(1-θ^)/N] -to test Ho:θ=1/2 vs H1:θ<1/2 can use a chi square test, likelihood ratio lest and LOD score
θ^ Estimate
Chi-Square Test
-under Ho, expected numbers of R and NR are both N/2
-test statistic:
T = [R-N/2]²/[N/2] + [N-R-N/2]²/[N/2]
= [N-2R]² /N
-a one-tailed test with 1DoF
-if R > N/2, T reassigned to 0 and conclude test is not significant
θ^ Estimate
Likelihood Ratio Test
L(θ) = θ^R [1-θ]^(N-R) -can take log for log likelihood l(θ) -likelihood ratio is: Λ(θ) = L(θ) / L(θ=1/2) -test statistic: X = 2logΛ = 2 [ l(θ) - l(θ=1/2)]
θ^ Estimate
LOD Score
Z(θ) = log_10_(Λ(θ))
-the conventional critical value for calling a test significant is Z≥3
Unknown Parental Haplotypes
-two possible phases for parent: AB / ab Ab / aB -a priori these are equally likely with probability 1/2 -so likelihood is: L(θ) = 0.5θ^4[1-θ]² + 0.5θ²[1-θ]^4 -can show MLE of θ is 1/2 -once θ^ is obtained, can use the same likelihood ratio test or LOD score as phase known pedigree -but NOT chi-square test
Model Free Linkage Analysis
Description
- does not depend on prior specification of a model of inheritance for the disease of interest
- genotype frequencies and penetrance need not be known in advance
- several methods:
- -affected sib pair test (ASP)
- -non-parametric linkage (NPL) score
- concepts of allele sharing are needed
Allele Sharing Between Individuals
- IBS and IBD are concepts of allele sharing between individuals
- allele sharing is comparing the DNA sequence or allele at the same locus between two individuals