Lecture 23 (Part 2) – Base Composition Evolution Flashcards

1
Q

What do we mean by “base composition”?

A

The proportions of the four bases (A, C, G, and T/U) present in DNA or RNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is base composition usually expressed?

A

Usually expressed as the percentage of bases that are G and C - the GC content.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What would a GC content that is greater than 50% indicate?

A

Bias or skew in a certain direction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the typical range for GC content in vertebrates?

A

40-45%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the approximate GC content in humans?

A

41% approx.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Where do we see the most variation in GC content?

A

In unicellular organisms.
- Single-celled fungi
- Bacteria
- Archaea

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the two main hypotheses for variation in GC content?

A
  1. Mutationist - GC biases are just a reflection of mutation patterns.
  2. Selectionist - GC biases are an adaption to selective pressures.
    - In other words, the GC content can have a fitness impact and thus should be shaped by selection
    - “Optimal GC content”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the mutationist hypothesis argue?

A

That GC biases are just a reflection of mutational patterns.
1. Involves the propensity of different bases to spontaneously mutate.
- There is a universal bias towards AT pairs.
- This is related to the inherent chemical properties of the bases (e.g., cytosine prone to deamination which leads to GC to AT mutations)

  1. Also involves the fidelity and bias of DNA replication and repair machinery.
    - E.g., biased DNA mismatch repair systems
    - Can differ between species (e.g., GC biased gene conversions in diploids leads to an increase of GC alleles relative to AT)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the selectionist hypothesis argue?

A

GC biases are an adaption to selection pressures.
- GC-rich genomes are more thermodynamically stable (due to stacking interactions)
- UV radiation produces thymidine dimers, thus high amounts of T are more susceptible to damage, and GC-rich sequences are more protected
- Lifestyle influences may lead to selection on GC content (e.g., nitrogen-fixing and aerobic bacteria tend to have higher GC content)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a GC skew?

A

When the freq(G) is very different to the freq(C) on a single strand.
- GC skew = (G - C)/(G + C)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an AT skew?

A

When the freq(T) is very different to the freq(A) on a single strand.
- AT skew = (A - T)/(A + T)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why do we observe an abrupt shift in the frequency of G and C at the origin of replication?

A

It is related to whether the strand is a leading strand or a lagging strand!
(See lecture 23 @16:30)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What bases do leading strands tend to be enriched for?

A

G and T.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What bases do lagging strands tend to be enriched for?

A

C and A.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How would the mutationist hypothesis explain the skew?

A

Asymmetrical mutation pressures for the leading and lagging strand.
- Difference in patterns of replication error
- E.g., Leading strand is more prone to cytosine deamination during replication because it is left as a single strand for longer (which would change AT to GC skews)

Evidence:
- The boundaries coincide with origin and terminus of replication (suggests a link with the replication process)
- Bias is stronger at third codon positions and in intergenic DNA (characteristic of mutation bias)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

On which strand is there more genes found?

A

More genes are found in the leading strand than in the lagging strand.
- The selectionist explanation for this is that it may prevent less “head on collisions” between replication and transcription machinery.

17
Q

What is another reason why asymmetric gene content between the leading and lagging strand may give us GC and AT skews?

A

Codon usage bias.

18
Q

What is meant by codon usage bias?

A

The non-random usage of synonymous codons (i.e., one codon being preferentially selected to code for an amino acid, for example CUG codon highly used to code for Leucine in E. coli)

19
Q

Is this preference for a particular codon due to mutation or selection?

A
  1. Mutation bias is a factor
    - AT rich genomes having a preference for codons ending in AT (and vice versa)
    - However, this is not always observed!
  2. Most agree that codon bias is due to selection
    - Two most popular selection hypotheses invoke “translational efficiency” and “translational accuracy”
    - Certain codons helping promote better translation?
20
Q

What is the translational efficiency hypothesis (in terms of codon bias)?

A

Using codons that are recognised by more abundant tRNAs will allow for faster translation and higher protein yield.

Evidence:
- The most highly expressed genes tend to show the highest level of codon bias

21
Q

What is the translational accuracy hypothesis (in terms of codon bias)?

A

Argues that rarer tRNAs are frequently mistranslated. Thus, using common ones prevents errors.

Evidence:
- Codons for highly conserved amino acids tend to show the most codon bias