MT Flashcards

1
Q

What year was the Thomas et al. (?) paper published?

A

2003.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What was the main objective of the Thomas et al. (2003) paper?

A

To compare gene contigs from multiple species. Looking for multispecies conserved sequences (MCSs).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the “BAC” in “BAC library” representing?

A

Bacterial artificial chromosome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What were the main findings of the Thomas et al. (2003) study into MCSs? Focus on human, chicken, and pufferfish.

A

The gene content and order was conserved in tetrapods, but the chicken seq. was 1/4 the length and the pufferfish seq. was even shorter, though they both contained all the same genes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Are orthologous regions in genomes all the same length?

A

No, they can vary while still remaining orthologous (conserved seq.s).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are paralogous genes?

A

Homologous genes in the same species that resulted from a duplication event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are orthologous genes?

A

Homologous genes in different species which resulted from speciation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are homologous genes?

A

Genes which derive from the same ancestral sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How did paralogous genes in humans and zebrafish complicate the Thomas et al. (2003) study?

A

An orthologous seq. in zebrafish overlapped with an unexpected gene in humans. That gene turned out to be paralogous to the CFTR region (that they were actually looking at).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What type of alignment allows you to find more distantly related seq.s?

A

Discontiguous BLAST.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Rodents are more closely related to humans than the ungulates are. Why then does the rodent seq. not align better with the human seq.?

A

Rodent seq.s have undergone neutral mutation at a more rapid rate than human or ungulate seq.s have.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a transposable element (TE)?

A

A seq. in the genome that transposes itself at then reinserts that copy somewhere else in the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can analysis of transposable elements (TEs) be used to determine relationship between species?

A

If 2 species have the same Te, it’s unlikely that they got it independently. So they must both have it from a common ancestor. More TEs in common = more closely related.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How did Nikaido et al. (2001) counter the common belief and show that sperm whales are actually in the same family as toothed whales rather than baleen whales?

A

By looking at the TEs that the whales had in common with these other groups!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What can phylogenetic tree branch lengths tell us about seq. divergence?

A

The longer the branch, the more change that seq. has undergone since the ancestral speciation event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What accounts for the length variations between the compared seq.s from different species?

A

The presence or absence of interspersed repeats.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What was the first method Thomas et al. (2003) used to confirm which sequences are more similar than would be expected by chance (MCSs)?

A

Compared neutral evolution at degenerate sites to 25bp regions across the seq. MCSs were anywhere that had > average conservation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What was the second method Thomas et al. (2003) used to confirm which sequences are more similar than would be expected by chance (MCSs)?

A

Mapped entire multi-species alignment onto a tree and picked the ones which minimized the # of substitutions (maximum parsimony).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What did the researchers conclude about changes to the MCSs?

A

That they were selected against (negative/purifying selection).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What parts of the genome are most likely to be conserved?

A

Exons and adjacent UTRs responsible for recruiting TFs and acting as regulatory elements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

If a study identifies a bunch of MSCs in introns with no known regulatory function, what should we conclude?

A

That these are somehow functional, probably involved in splicing and spacing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are 2 alternative hypotheses to the MCS theory?

A
  1. Some parts of the genome are more/less susceptible to mutation
  2. DNA repair efficiency varies by genome region
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is an ancestral repeat (AR)? How have these been used in evolutionary biology in the past?

A

A transposon/TE “fossil” that predates mammalian radiation. Largely nonfunctional, can be used to represent the background rate of neutral evolution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What did Bejerano et al. (2004) find by comparing the human, mouse, and rat genomes?

A

That there were regions of the genome (exonic and non-exonic) that are 100% identical in all three species (low odds that happens by chance).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

How many of the UCEs identified by Bejerano et al. (2004) corresponded to known coding seq.s?

A

111 of 481 (<25%!).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What was the function of the partly exonic UCEs Bejerano et al. (2004) identified?

A

Most encoded RNA binding proteins and proteins which were involved in splicing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Of the UCEs that Bejerano et al. (2004) identified in introns, where were the majority located in relation to known genes? What about the others?

A

Most were located within genes that regulate transcription (homeodomain-containing genes). ex: Hox and Pax.
Others were near annotated genes, and may have a regulatory role.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Where were the 3 longest UCEs that Bejerano et al. (2004) identified located?

A

In POLA introns. POLA encodes a subunit of DNA polymerase (very important).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Are UCEs present in all species? Elaborate.

A

Nope, they’re only present in amniotes (so not fish) and rare outside the vertebrates. Something must have happened to make them favourable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

At what rate do UCEs change?

A

1% per million years.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How are the UCEs in Pax2 and Pax5 related?

A

Both originated from a duplication in a common ancestor but are more closely related to orthologous UCEs than they are to each other. Since tetrapods and fish diverged they’ve stopped changing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

If you compare any 2 species with humans the UCEs you find might not all be the same. What does this tell you?

A

UCEs do occasionally go missing in one group or another. Also, any comparison with rodents is subject to their high rate of change.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What do Stephen et al. (2008) conclude about the origin of UCEs?

A

That they might have originated during the transition from marine to terrestrial living environments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What DNA seq. might we expect to find in a rock from Mars?

A

Only the most conserved, since any organism would have been isolated millions of years ago. So 16S and 23S rRNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What are 4 possible explanations for high gene variation between species?

A
  1. High mutation rate
  2. Gene conversion
  3. Frequency-dependent selection (favours rare alleles)
  4. Overdominant selection
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Why might the 6 major histocompatibility complex subunits be under selection for variability (overdominance)?

A

If it has more kinds of subunits then it can generate more kinds of proteins and be more likely to be compatible.

37
Q

How does the dN/dS ratio indicate selective pressures?

A

<1.0 is purifying, ~1.0 is neutral, >1.0 is positive selection.

38
Q

What is a substitution at a synonymous site?

A

A substitution at a part of the sequence which does not change the identity of the encoded AA.

39
Q

What is a substitution at a non-synonymous site?

A

A substitution at a part of the sequence which changes the identity of the encoded AA.

40
Q

How do we calculate the dN/dS ratio?

A

(# of ns subs / # of ns sites) / (# of s subs / # of s sites)

41
Q

What is one part of the genome that might be under positive selection rather than purifying selection?

A

The region encoding the MHC protein subunits (want high variability).

42
Q

What is the Jukes and Cantor correction actually attempting to correct for?

A

We can’t be certain that a given site hasn’t undergone multiple subs (“invisible”). We can try to account for these based on the amount of subs we do see.

43
Q

What did the paper by Hughes and Nei (1988) conclude about changes to the human MHC sequence?

A

That there was strong diverge in the antigen recognition sequence (ARS) but very little in other regions.

44
Q

If purifying selection acts on all positions what happens to the dN/dS ratio?

A

It is artificially closer to 1.0 than if purifying selection was only acting on the non-synonymous sites.

45
Q

What did the Resch et al. (?) paper show about selection at synonymous sites?

A

That 28% of genes experienced -ve selection, 12% experienced +ve, and the rest (60%) experienced neutral selection at syn sites.

46
Q

What is codon bias? What are the implications for postranslational modification?

A

Some codons are rarer than others and so slow the translation rate. This gives proteins more time to correctly fold.

47
Q

Give 1 reason why there might be selection at a synonymous site?

A

Some codons are more prevalent, so preferentially using them actually optimizes transcription. Can select for fast (rapid expression) or slow (correct folding) transcription.

48
Q

In general terms, how did the notothenioids develop a natural antifreeze that allows them to survive when other fishes blood freezes?

A

Gene coding for digestive enzyme diverged and gained a new function. First preventing the gut contents from freezing and then the whole body.

49
Q

What genetic event occurred in the ancestral trypsinogen gene of notothenioids to allow the AFGP to develop?

A

A duplication and then insertion of large repetitive sequence that became the polyprotein antifreeze sequence.

50
Q

What does a Blosum matrix tell us?

A

Which changes have taken place between evolutionarily divergent protein sequences.

51
Q

In “Blosum62”, what does the 62 represent?

A

The % identity used during calculation of the similarities between aligned sequences.

52
Q

What does the score in a Blosum matrix represent?

A

The frequency with with these AAs were aligned in a set of seq.s. > # indicates > frequency. 0 is just random.

53
Q

In a Blosum matrix, why do scores vary for perfect AA pairs?

A

Because for rare AAs the chance of it being aligned by chance is lower.

54
Q

What did Susumu Ohno (1970) famously write regarding evolution? Was he the first?

A

That new gene loci were required for evolution of new function. He unknowingly plagiarized a bunch of older people.

55
Q

When was the Lynch and Conery paper on gene duplication published?

A

2000.

56
Q

How can gene duplication be identified from looking for similarities at silent sites?

A

Multiple similarities at silent sites in several genes indicates that these genes were the result of a duplication event, since they all have the same amount of “wear” (car analogy).

57
Q

What conclusion did Lynch and Conery (2000) make about the fates of duplicated genes?

A

That genes are duplicated all the time but they have a very low survival rate.

58
Q

What conclusion did Lynch and Conery (2000) draw regarding dN/dS ratios of duplicated genes?

A

That initially the ratio is ~1.0, but that over time the ratio goes <1.0 and the genes undergo purifying selection.

59
Q

Define neofunctionalization of a duplicated gene. Give an example.

A

Duplicated genes get a whole new function! Very rare, ex: AFGP in notothenioids.

60
Q

Define subfunctionalization of a duplicated gene.

A

Duplicated gene splits ancestral function with other copy (share the load).

61
Q

Define Nonfuncitonalization of a duplicated gene.

A

Duplicated gene is lost. Selected against (most common).

62
Q

How might nonfunctionalization contribute to speciation?

A

A population with a duplicated gene splits. Each population loses different copies of that gene. If this happens enough the populations might become reproductively isolated.

63
Q

How can whole genome duplication be identified in the paralogs of a single species?

A

If all the paralogs have the same amount of “wear” (car analogy) then the whole genome likely got duplicated. The same proportion of changes.

64
Q

How can whole genome duplication (WGD) lead to speciation?

A

Duplication of hox genes can cause evolution of new/modified body plans (ex: invertebrates have 1 hox cluster but mammals have 4).

65
Q

How can we determine the age of whole genome duplication events?

A

Look for other species with or without the WGD and then narrow it down based on when these species diverged.

66
Q

What percentage of the ribosome is RNA (by mass)?

A

~60%!

67
Q

What is an operon?

A

A collection of genes that must be transcribed together because they have interdependent functions.

68
Q

What structure defines the end of an RNA transcript?

A

A hairpin.

69
Q

Why might it be advantageous to introduce a new hairpin in an RNA transcript?

A

If you want to end translation early in response to some changing condition.

70
Q

How can an antiterminator hairpin prevent termination in the presence of low tRNA for the currently desired AA?

A

It will pause the ribosome and give time for that tRNA to be found, rather than having the RNA read straight through to the stop codon.

71
Q

What is the function of iron response elements (IRE) in the body?

A

Control Fe2+ levels.

72
Q

What is the function of transferrin receptor? When is it upregulated?

A

Controls uptake of iron by the cell. Upregulated if iron levels are low to allow more uptake.

73
Q

What is the function of ferritin? When is it upregulated?

A

To bind potentially harmful free iron. Upregulated if iron levels are high to protect the cell.

74
Q

How does IRE-binding protein regulate ferritin/transferrin receptor production if iron levels are high?

A

Ferritin: Fe2+ binds IRE-bp, preventing it from binding and blocking production.
Transferrin R: Fe2+ binds ERE-bp, mRNA not stabilized, no production.

75
Q

How does IRE-binding protein regulate ferritin/transferrin receptor production if iron levels are low?

A

Ferritin: binds to hairpin before coding seq., preventing production.
Transferrin R: binds to hairpin after coding seq. stabilizing and allowing production.

76
Q

What does the IRE-bp bind in low iron? What about in high iron?

A

Low: binds to mRNA hairpins
High: binds to Fe2+

77
Q

Do most organisms have IREs? How are these conserved? Elaborate.

A

Yes, but not all the same ones. The loop bases are conserved but the stem bases are not.

78
Q

What are the main IREs in all vertebrates? What about in just mammals?

A

Vertebrates: Tfr, ferroportin
Mammals: DMT1

79
Q

What sequence is conserved in the loops of the IREs?

A

CAGUG.

80
Q

Do IREs regulate transcription or translation?

A

Translation! Act on the mRNAs.

81
Q

What parts of riboswitches are conserved?

A

The bases in the stems, but not the loops.

82
Q

What is a riboswitch?

A

An RNA sequence which binds to the metabolite and influences the secondary structure (forms hairpins etc.).

83
Q

How do riboswitches influence which genes get transcribed?

A

They bind under certain conditions and can prevent certain sequences from being spliced out. If this sequence is a STOP it prematurely ends termination.

84
Q

How many riboswitches have been identified? How many in eukaryotes?

A

~40, but only 1 in eukaryotes.

85
Q

Where would we expect to find riboswitches if we went looking for them?

A

Conserved sequences within introns.

86
Q

How can ribosomes almost act as transcription factors if we take into account the formation of antiterminator hairpins?

A

I will stall with low conc. of some tRNA which can then cause the transcription of more sequence coding for tRNA in response to low levels.

87
Q

How will dS or dN be influenced by gene duplication events?

A

A large number of paralogs will have approximately the same dS.

88
Q

How does polyploidy influence cell morphology? Why might this be desirable?

A

Increases nucleus and cell size. Agriculture favoirs polyploidy because it leads to larger fruit size.