Evolutionary Genetics Flashcards

1
Q

What is the goal of phylogenetics?

A
  • To infer evolutionary relationships between species

- Gives insight into timing and sequence of evolutionary events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some practical applications of phylogeny?

A
  1. Flu vaccines:
    - Phylogeny is used to predict the strains of flu that will cause the most disease in the upcoming flu season (the dominant cluster) and allowing this information to inform how the vaccine is produced
  2. Criminal cases of HIV transmission:
    - HIV is an extremely diverse virus genetically; there is even a lot of diversity within an individual patient as the virus can evolve and change
    - Using this diversity it can be proved if one individual gave another HIV (if genetic variance matches)
  3. Determining the origin of diseases:
    - Many emerging pathogens were originally only pathogens in certain non-human organisms
  4. Predicting the function of a gene:
    - When a gene is sequenced and its function is unknown phylogeny can be used to predict its function
    - If the unknown gene protein product is similar to the protein in other organisms (e.g. model organisms) we can hypothesise the unknown gene will have a similar function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a weighted vs unweighted tree?

A

Weighted Trees:

  • There is a different amount of evolutionary change across each of the lineages
  • The branch lengths indicate the amount of evolutionary chance along that lineage

Unweighted trees:
- Shows that lineages share a common ancestor but makes no assumptions regarding the rates of genetic evolution in the branches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What information do rooted trees give?

A
  • The species at the root is the most common recent ancestor (MCRA) for every OTU on the tree
  • You can easily map evolutionary time using fossil records onto these trees
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What data is used to build phylogenetic trees?

A
  1. Molecular sequence data:
    - The most common way is DNA sequence alignment
    - Amino acids of proteins can also be aligned (better if species are more distantly related)
  2. Gene content:
    - What genes an organism has/does not have (presence absence data)
  3. Genetic distance:
    - DNA-DNA hybridisation
    - Immunology
  4. Rare Traits:
    - Gene order
    - Introns
    - SINEs and LINEs
  5. Morphological Data:
    - Measure number of vertebrae etc.
    - Used less and less (unreliable)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the pairwise model of DNA change?

A
  • The simplest means of determining genetic distance = the proportion of differences between any two taxa
  • Calculated by: Number of distances seen/number of nucleotide sites examined
  • Integrated pairwise difference data can make a matrix
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the parameter model of DNA inheritance?

A
  • In this model there are two parameters:
    1. the rate of transitions (a) - changes between ‘like’ bases- between purines and purines
    2. The rate of transversions (B) - changes between ‘unlike’ bases- between purines and pyrimidines
  • Each axis gets its own probability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do we test the reliability of a phlyogenetic tree through bootstrapping?

A
  • If we have constructed a tree we can conduct a bootstrapping procedure to determine the believability and reliability of its nodes
  • This is done by making many (500+) new alignments using the original alignment (randomly sampling one site, then randomly sampling again- until you have the same number of sites as the original piece)
  • These new stretches of hypothetical DNA are constructed into pseudo-trees
  • You look for similarities between the pseduo-trees and the original tree
    e. g. If the original tree has a and b as most closely related, what proportion of the pseudo-trees have a and b as most closely related?
  • If 75% of trees from pseudo-samples show the same grouping of a and b, and c and d, both nodes would have the number 0.75 placed on them
  • This means the node is 75% believable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the features of morphological trait variation?

A
  • Variation is continuous and discordant
  • It is possible to cluster people on the basis of any one trait- but the resulting classification does not allow one to predict clustering for other traits
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What proportion of genetic variance is due to (continent) population dependent differences

A
  • Only 10-15%
    (85% of variance between individuals is random and occurs within populations)
  • This still corresponds to <300,000 variable sites
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What genetic variations do we use to determine populations?

A
  1. SNPs:
    - Short nucleotide polymorphisms
    - Present in the human genome- not necessarily related to phenotype
    - Used as signals of relatedness and evolutionary history
    - Okay measure (small amount continent specific)
  2. Haplotypes:
    - Long sections of SNPs that group together (SNPs that are linked)
    - Another measure that can be used to determine relatedness
    - Best measure 29% are continent-specific
  3. CNV:
    - Copy number variation
    - Genes that we have multiple copies of in the genome
    - Not very useful (mutating too rapidly)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Is there more genetic variation in African or non-African populations?

A
  • When grouping populations there is the greatest amount of variation in African populations as these are the oldest, ancestral populations
  • There is a genetic bottle neck in populations that left Africa
  • This is because only a small subset of the African population migrated out of Africa so there is less genetic diversity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the international HapMap project?

A
  • Aimed to identify similarities and differences between humans
  • Used SNPs
  • Only sampled 270 from 4 populations (HGDP has better sampling)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Human Genome Diversity Project?

A
  • Similar to HapMap
  • Used more SNP loci
  • 1043 individuals from 51 populations (much better sampling)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do association studies work?

A
  • The idea is that: if there is a mutation that occurs and it is inherited into groups then these individuals may share a phenotype that relates to the genotype
  • The SNP does not cause the disease- rather it is located close (linked) to the gene that does e.g. a gene that when mutated causes heart disease
  • The SNP can be used as a signal, if individuals inherit the SNP then they most likely also inherit the particular gene mutation being studied that may cause elevated levels of cancer or heart disease etc.
  • If a SNP is found to correlate with a disease, the genes surrounding the SNP can be studied to determine if they are causative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some population specific diseases?

A

Cardiovascular diseases

Type 2 Diabetes

17
Q

What are the modern human groups we focus on in human evolution?

A
  1. Homo sapiens
  2. Homo neanderthalensis
  3. Denisovans
  4. Homo heidelbergensis
18
Q

Did homosapiens, neandethals and denisovans live at the same time?

A
  • Yes
  • Remains dating from the same time have been found in the same cave
  • Interbreeding occurred- modern humans have Neanderthal and Denisovan DNA within their genomes
19
Q

How is Neanderthal DNA sequenced?

What was found?

A
  • The genomes of these ancient hominins are difficult to sequence
  • They are very susceptible to contamination (most people working on sites have their genomes sequenced so they can be ruled out)
  • With more fossil finds we are getting more and more Neanderthal genomes
  • Neanderthal and Denisovan populations emerged from Homo heiderbergensis that migrated out of Africa- there was very little back migration; therefore African populations have very little of their DNA incorporated into their genomes
  • Also because Neanderthals and Denisovans split off together and earlier than modern Homo sapiens they are more related to each other
  • Modern Homo sapiens emerged in Africa separately and then another migration event out of Africa took place
  • Genotyping revealed Neanderthals had light skin and light eyes and often red hair
  • They lived in more Northern latitudes than Homo sapiens so were more adapted to cold climates
20
Q

What are two Neanderthal/Denisovan genes that have incorporated into the human genome?

A
  • FOXP2 gene: required for proper speech and language

- HLA Class I alleles