Selection Flashcards

Question

What different ways can you test selection, and what tests are used for each of these methods?

Answer 1

- **Frequency/distribution tests** (mainly comparisons of theta) - Tajima's D test, Fu and Li statistic - **Haplotype diversity** - mismatch distribution - **Haplotype length based tests** - linkage disequilibrium - extended haplotype homozygosity test - **Codon-based tests** - dN/dS rations, MK test, HKA test

Answer 2

**Tajima's D is the difference between two two estimators of genetic diversity (theta):** - The **average pairwise differences (PI)** in a sample - The **number of segregating sites (Sn) divided by An** These are scaled so that they are expected to be the same in a neutrally evolving population of constant size

Answer 3

- If a sample has an excess of rare variants - theta > PI and so D<0 - this suggests positive selection or population growth - If a sample has an excess of intermediate frequency variants - PI > theta and D>0 - suggests balancing selection or population subdivision

Answer 4

**A comparison of the number of derived singleton mutations and the total number of derived nucleotide variants** - Similar concept to Tajimas D - Uses assumption that the expected number of derived mutations that are present only once in a sample (singletons), n is equal to theta in the neutral case

Answer 5

- Negative value indicates excess of singletons -similar to tajimas D - since selective sweeps tend to generate an excess of singletons - A positive value indicates lack of singletons - e.g., balancing selection - More sensitive than Tajimas D in the genetic sweep scenario

Answer 6

- Comparison of the number of expected haplotypes and the observed number of haplotypes given the number of segregating sites Extremes: - Two or few haplotypes; less haplotypes than number of segregating sites - balancing selection or population structure - Each segregating site defines a new haplotype - selective sweeps or population growth

Answer 7

- High under balancing selection - Low under selective sweeps - (or population growth)

Answer 8

- Spikey distribution = balancing selection or population structure - Modal distribution - possible positive selection or pop growth - Positive selection changes tree shape more than negative selection - Negative selection mostly removes individual terminal branches - Effect of positive selection is a reduction in number of lineages: one lineage is fixed, which then starts diversifying again - Similar to population bottleneck - but it only affects one length of sequence

Answer 9

**Measure and compare multiple sequences to find fraction of non-synonymous sites that are variable (dN) and also fraction of synonymous sites that are variable (dS)** - Main effect of purifying selection is to reduce genetic diversity - But selection only expected on non-synonymous sites - where mutation changes the protein - Synonymous sites should be neutral - codon bias excepted

Answer 10

- dN/dS = Ka/Ks = w - w = 1 - neutral evolution - w > 1 - positive selection - w < 1 - purifying selection

Answer 11

- Nearly all active genes in humans may show some evidence of purifying selection (w<1) - meaning most mutations to proteins sequences are deleterious - A few genes have w >1 - often associated with host pathogen-interactions - e.g., MHC class 1/class 2 - Genes mostly have a low w (around 0.1). An increase in the typical w for that gene could be due to positive selection or a weakening of selection - e.g., pseudogenes

Answer 12

- Major Histocompatility complex class 1 and 2 genes recognise foreign antigens and stimulate immune response - Within species loci typically show dN/dS ratio > 1 suggests **diversifying selection** driven by the need to recognise pathogens

Answer 13

- **Compares dN/dS within and among species to idetify +ve selection in related species lineages** - Test compares ratio of sites fixed by selection versus drift - Find ratio of non-synonymous to synonymous substitutions between and within species - Positive selection - ratio of non-synonymous to synonymous variation within species is lower than the ratio between species - Weakly deleterious mutations can reduce power of MK test - as causes +ve selection to be underestimated - Neutral: NI ~ 1 - Positive: NI << 1 - Balancing: NI >> 1

Answer 14

- FOXP2 - gene with functions strongly linked with cognition and complex and language - Highly conserved across mammals - Enard et al., 2002 - MK test - showed excess of non-synonymous mutations in FOXP2 in modern humans - driven by recent +ve selection - But - more recent study with larger sample sizes suggest this was driven by sample composition - no evidence for +ve selection - Atkinson et al., 2018 - But - high diversity of FOXP2 in echolocating bats - Wang et al 2007

Answer 15

- Similar to MK test - but compares rates of evolution between different loci - Within species diversity (polymorphism) should be correlated to inter-species diversity (divergence) in a neutrally evolving gene - unless selection activity - Requirement for sequencing from multiple loci and data from outgroup species - Based on comparisons of ratio of polymorphism with focal species to divergence with outgroup (rpd) - rpd - contant across neutrally evolving loci - Choose reference locus that is evolving neutrally - Lots of different test carried out on genes - refs in lecture

Answer 16

- Branch-site models detect positive selection that affects only certain sites on predefined lineages within a phylogenetic tree - Comparisons among species - identify non-synonymous mutatins unique to specific lineages (species) e.g., **Bowhead whales:** - Comparison of bowhead whale genome to other cetaceans and mammals - identified 14 genes which showed elevated rayes of evolution - many of these associated ewith cancer susceptibility and aging - ERCC1 - Bowheads are potentially the longest lived mammals with lifespans over 200 years

Selection Flashcards

(40 cards)