Mark Isalan Flashcards

1
Q

General organisation of DNA in Eukaryotes?

A

Eukaryotic DNA is Heavily organized

DNA highly condensed to form chromosome –> organization driven by histone complexes which are positively charged so that they can interact with the negatively charged DNA

In the highly supercoiled form the DNA is not accessible to transcription factors and other enzymes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

General organisation of DNA in Bacteria?

A

Even though the Bacterial genome is less organised than eukaryotic DNA. bacterial genome is not organised randomly

DNA is still has heavily organised supercoiled circular nucleoids –> Refers to DNA is associated with several proteins such as Histone-like nucleoid structuring (H-NS) protein (positively charged) as well as other proteins such as transcription factors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

General Overview of DNA synthesis?

A

DNA synthesis - Occurs in 5’ to 3’ direction

Remember that DNA has polarity and is anti-parallel

5’ end has a free phosphate

3’ -OH has a free -OH

Synthesis of more DNA requires DNA polymerase (works in a 5’ to 3’ direction) which uses ATP, Mg++ and nucleotide triphosphates

During synthesis of the 3’ -OH performs a nucleophilic attack on the incoming deoxynucleotide triphosphate in order to add the base pair to the growing chain

Product - Add Nucleotide + Disphosphate released

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Is DNA a stable molecule?

A

No clear answer

  1. In one sense yes, as it can last for 100’s of years in permafrost –> providing readable DNA
  2. In another sense no, as DNA is always very active –> replication, transcription, unwinding –> resulting in many sources of damage and error.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are mutagens?

A

Mutagen –> Refers to a physical (i.e. U.V.) or chemical (i.e. Free radical species) agent that changes the genetic material

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How frequently is DNA damaged in one cell per day?

A

DNA is damaged approximately 10 000 times per cell per day –> all of which act as a source of potential diseases/disorders

Hence, this explains why the cell devotes a lot of energy to prevent DNA damage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Two main impacts of DNA damage?

A
  1. Block replication and/or transcription
  2. Cause alterations in the genetic code (mutation) –> impact the organism itself or offspring if germline cells are mutated.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the two main types of DNA damage?

A
  1. Chemical alteration to DNA –> which may be exogenous** (source of damage located outside the cell - environmental mutants such as UV radiation) or **endogenous (internal source of damage - internally generated damaging agents such as hydroxyl radicals – most common)
  2. Spontaneous damage to DNA –> DNA reacts with itself Includes deamination (losing an amine group) and depurination (losing a purine base) –> these changes tend to occur regardless of what you do, meaning they are inevitable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Outline how UV may lead to the formation of pyrimidine dimers (exogenous - chemical change).

A

Example of exogenous agent causing DNA damage

  1. UV light induces formation of pyrimidine dimers (T-T (most common) C-T and C-C) –> 2 adjacent pyrimidines are joined by a cyclobutane(4C) ring structure (Double C=C converted to single and remaining electron forms C-C bond with adjacent pyrimidine)
  2. Consequence - The bases no longer function as normal Watson and Crick base pairing –> no longer serve its function as a base - no information

This explains why Solar UV irradiation is the cause of most human skin cancer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Outline how DMS/EMS (Exogenous) causes DNA damage.

A

DMS/EMS both are alkylating carcinogens

Alkylation is the addition of methyl or ethyl groups to various positions on the DNA bases

For example,

Alkylation of the O6 position of guanine results in formation of O6-methylguanine –> this changes the W-C base pairing potential. Normal G forms three H-bonds but this no longer occurs when carbon 6 has become methylated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Is normal cellular DNA methylation a source of DNA damage?

A

No! Different to the alkylation due to DMS/EMS

Normal methylation from the cell is normally not carcinogenic

Takes place on CPG islands and effects the major groove whereas carcinogens disturb the Watson and Crick base pairing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Outline how benzo-(a)pyrene (exogenous agent) can damage DNA.

A

Many carcinogens (e.g.benzo-(a)pyrene) react with DNA bases, resulting in the addition of large bulky chemical groups to the DNA molecule

Result? –> messing up the normal W-C base pairing –> Base is misread by DNA polymerase (as thymine)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Are many carcinogens activated endogenously?

A

Yes!

Many carcinogens are activated endogenously, become free radicals, by reactions with cytochrome P450 enzymes

P450-important in clearance of compounds – ‘molecular dustbin’ –> may result in the formation of reactive intermediates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the two main types of Spontaneous Damage that occur in DNA?

A

A) Deamination of adenine, cytosine and guanine –> E.g. Amine to carbonyl –> change from hydrogen bonding potential (donor –> acceptor)

Deamination examples - Cytosine to Uracil and Adenine to Hypoxanthine –> changes the W-C base pairing potential.

(B) Depurination –> removal of purine group –> resulting from cleavage of the bond between the purine bases and deoxyribose, leaving an apurinic (AP) site in DNA.

Depurination - losing an entire base –> leaves sugar behind with empty –OH group –>

Note - same thing can happen with pyramidines (depyrimidination)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does changed W-C base pairing potential mean?

A

Basically, DNA damage leads to altered base pairing - chemical change to base results in difference base preference (Non-WC base pairing) - when it gets replicated the wrong complement base gets included.

Example

Adenine deamination changes the C-NH2 on carbon 6 to C=O, which changes the molecule from adenine to Hypoxanthine.

Hypoxanthine behaves more like guanine therefore it preferentially binds to cytosine.

Thus, in the short term Hypoxanthine does not bond well with Thymine creating a bulge in the DNA.

In the long term, during DNA replication hypoxanthine will bind to cytosine –> resulting in a permanent mutation which may result in deleterious effects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are common functional groups found in carcinogens?

A

Functional groups in carcinogens?

It can be a variety but all of them are reactive groups –> One common example would be aldehydes

Sometimes carcinogens aren’t even covalently reacting - e.g. intercalating agents sit in between the bases –> disrupting the DNA synthesis resulting in the wrong base being introduced –> e.g. ethidium bromide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the two general types of DNA repair mechanisms?

A
  1. DIRECT REVERSAL of chemical reaction responsible for DNA damage (in-situ) –> more common in bacteria
  2. EXCISION REPAIR –> Removal of damaged bases, replacement with newly synthesised DNA more common than direct repair in humans

Why? Because humans often lack some of the enzymes required for direct reversal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Outline the direct reversal of U.V. DNA damage in E. Coli?

A
  1. DNA damage via U.V. radiation
  2. Pyrimidine dimer created (cyclobutene rings)
  3. Photoreactivating enzyme uses photoreactivation (process driven by light as energy source) to break open cyclobutene ring to restore normal bases

How does the photoreactivating enzyme do this?

  1. Enzyme has a chromophore that can absorb light 300nm-500nm
  2. Transfers the energy to a non-covalently bound FADH-, which then transfer an excited electron to the pyrimidine dimer –> in turn splitting the ring.
  3. The now pyrimidine anion transfer its electron to FADH. (Radical) making the DNA good as new.

Note - Occurs in E. coli, yeasts, some plant and animal cells but NOT in humans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Outline how dealkylating enzymes can be used against the direct reversal of DNA damage.

A

Dealkylating enzymes –> removes alkylating group - enzyme has methyl acceptor group (–SH group)

  1. Enzyme comes in
  2. Finds bulge in DNA
  3. pulls of methyl group using –SH
  4. Base returns to its original form
  5. Enzyme needs to be regenerate using other proteins (reform –SH)

Specific Example

Alkylation - O6-methylguanine –> How can we fix?

  • Can be repaired by enzyme: O6-methylguanine methyltransferase which is widespread in prokaryotes and eukaryotes
  • Transfers the methyl group to its own cytosine residues -> reaction deactivates the protein which therefore cannot be strictly classified as an enzyme.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Types of Excision Repair?

A
  1. Base-excision repair –> Smallest type of excision - Base is removed leaving deoxyribose backbone intact so that another series of enzymes can add the correct base

2. Nucleotide-excision repair –> Nucleotide is removed leading to a gap in one strand (an oligonucleotide is usually removed) which is filled in by DNA polymerase

  1. Mismatch Repair –> Mechanisms for post DNA replication mismatches.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Outline how base excision repair is used to fix deamination of cytosine to uracil.

A

Base excision repair example - Convert Uracil back to Cytosine

  1. Uracil formed by deamination of cytosine, leads to a G:U mismatch –> results in DNA bulge
  2. Bulge is recognized by DNA glycosylase resulting in the bond between uracil and deoxyribose being cleaved by uracil DNA glycosylase –> leaves a sugar with no base attached in the DNA (an AP site)
  3. This site is recognized by AP endonuclease (cuts into intact DNA), which cleaves/nicks the DNA chain
  4. The remaining deoxyribose is removed by deoxyribose-phosphodiesterase –> leaves perfect gap for new nucleotide
  5. The resulting gap is filled by DNA polymerase and sealed by ligase - leads to incorporation of C opposite G.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Outline how nucleotide excision repair of thymine dimer (UV damage)

A

Nucleotide excision repair of TT dimer - Main mechanism in Humans

Note - This also occurs in many organisms or alternatively they use photo repair enzymes.

  1. Recognition of thymine dimers occurs by assembly of RPA, XPA and XPC-TFIIH at sites of damage - specificity is achieved mainly by the kinetic proofreading activity of TFIIH
  2. Helicase action by the XPD subunit of TFIIH generates a bubble around the dimer - creating the requisite DNA substrates for the structure-specific endonucleases XPF and XPG
  3. Cleave on both sides of the thymine dimer by 3’ and 5’ endonucleases creating nicks - XPF and XPG
  4. Oligonucleotide 24–32-nt in length (also known as the “canonical 30-mer”) dissociates from the duplex.
  5. The resulting gap is then filled by DNA polymerase and sealed by ligase (DNA pol I in E. coli/DNA pol β in human)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How does Nucleotide-excision repair differ between E. Coli and Eukaryotes?

A
  1. E. Coli - Catalysed by 3 gene products – uvrA, B, C
    a) UvrA recognises damaged DNA (Helix distortion),
    b) UvrB and UvrC (endonucleases) cleave at 3’ and 5’ sides, excise 12-13 bases oligonucleotide
    - UvrA, UvrB and UvrC form complex
    c) uvrD (Helicase II) binds to the oligonucleotide segment in order to displace it –> allowing for DNA pol and DNA ligase binding.
    - Mutations of these genes leads to high sensitivity to UV
  2. Eukaryotes - Catalysed by RAD gene products in yeast (7 different repair genes involved – highly conserved) –> analogous to humans

Mutations lead to xeroderma pigmentosum rare genetic disorder, affects 1:250,000 people, extreme sensitivity to UV light leading to skin cancers - all of which is due to the deficient ability to repair DNA by nucleotide-excision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

In Mismatch repair in E. coli, how does the E. Coli strand distinguish between parental and new synthesized DNA strand?

A

Main idea - Mismatch repair system detects and excises mismatched bases in newly replicated DNA

Important characteristic is that this system must distinguish parental strand from newly synthesised daughter strand because you want to change the base in the new strand (not change genetic code on the parental strand)

Different species use different methods to accomplish this…

In E. coli DNA is methylated by Dam methylase (Adds methyl group to Adenine in GATC) –> following replication the newly synthesised daughter strand will not be methylated resulting in hemi-methylated DNA –> so the cell can recognize any bulge on any unmethylated new DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Outline the key players in DNA mismatch repair in E. Coli

A

Mismatch repair system in E. Coli is called MutHLS

Three enzymes:

  1. MutS recognizes bulges
  2. MutL is a helicase that walks along DNA (looping it), until it finds a methyl group so it can recognizes parental from daughter,
  3. MutH is a endonuclease that cleaves and nicks the new strand.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How does this table below highlight the importance of DNA methylation on mismatch repair?

A

Dam methylation is essential for mismatched repair in E. Coli –> conditions for repair shown in table

Table highlights how methylation of the strands directs repair

What about the last two rows?

Used a virus (PhiX174) to probe this system

  1. This virus lacks GATC sites so it doesn’t get methylated or cleaved by enzymes in the E. Coli cell
  2. But when we introduce GATC sites we achieved directed repair of the new strand during replication
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Outline the different steps of the MutHLS Mismatch Repair Mechanism in E.Coli.

A

MutHLS Mismatch Repair Mechanism

  1. MutS – moves along the DNA strand until it recognizes a mismatch (bulge)
  2. Recruitment of MutL and MutH
    a) MutL (translocase) binds to MutS –> Translocase with ATPase activity and forms DNA loops, walking along DNA looking for hemi methylated Dam sites close to mismatch to allow for strand discrimination
    b) MutH (endonuclease) can then bind to MutL which can guide the cleavage on the opposite side of hemi-methylation (allows for discrimination between New and Old)

Note - Cut site can be away from the mismatch (12-13 nucleotides) either upstream or downstream –> doesn’t matter as the entire region is removed (stochastic) –> the presence of the MutHLS complex indicates the presence of a mismatch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Once a nick has been made by the MutHLS complex, how is the DNA region containing the mismatch removed?

A

Once the cut has been made exonucleases are used to excise the mismatched region –> the kind of exonuclease will depend on the nick is made upstream or downstream of the bulge.

Either end up with an exonuclease eating away at the DNA..

  1. 5’ to 3’ direction - Nick is made upsteam –> Exo 7 or Recj
  2. 3’ to 5’ - Nick is made downstream –> Exo 1

The gap that is created filled by DNA Pol III and sealed by ligase.

Note - Exonuclease is used as we are chewing away from an end of DNA (nick)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

In Mismatch repair in mammalian cells, how are the parental and daughter strands distinguished from eachother?

A

Instead of CH3 (methylation) groups you are looking for strand breaks

The newly replicated strand is distinguished from the parental strand because it contains strand breaks since eukaryotic DNA contains many replicons it will have many strand breaks due to…

  1. Leading stand –> no Okazaki fragments but there are multiple replication initiation points –> creating nicks that can be recognised at each replicon.
  2. Lagging strand –> presence of Okazaki fragments means that the daughter strand will have many breaks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

WHat is the equivalent of the MutHLS system in Eukaryotes?

A

Eukaryotes –> MSH complex responsible for mismatch repair

MutS

MutL

Helicase

Exonuclease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

In humans, what can mutations in hMsh2 and hMlh1 genes cause?

A

Mutations in hMsh2 and hMlh1 genes are a cause of inherited non-polyposis colorectal cancer

Affects 1:200 - Causes ~15% of UK colorectal cancers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What are the two ways that a double stranded break in DNA can be repaired?

A

When double stranded breaks occur we can have one of two things happen:

  1. Non-Homologous end joining - NHEJ
  2. Homologous Recombination - H.R.

Note – Normally Mismatch repair is not used as this happens after replication whereas H.R. happens during.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

How can double stranded breaks be created?

A

Double strand breaks can occur due to DNA damage

There are many factors that can cause a double stranded break (Physical/Chemical factors) - Frequently occur during DNA replication.

Note - Apart from Physical/chemical factors we also have nucleases that come in and cut DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What are the two common ways that double stranded breaks occur during DNA replication?

A
  1. DNA damage leading to a nick in the DNA backbone –> during DNA replication DNA polymerase action is forced to stop –> thus creating a gap or dsbreak in the newly synthesized daughter DNA

Image context - Remember that there is going to be another replication bubble which will replicate towards that gap

  1. DNA damage causes DNA lesions –> Results in blocking DNA polymerase during synthesis which will stall at the lesion –> We are left with a single stranded region that is more prone to breaking –> increasing the likelihood of a full dsDNA break
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Outline what NHEJ is?

A

DNA double stranded break –> What know?

One method of repairing the double stranded break –> Non-homologous end joining (Simplest Outcome)

NHEJ - When a series of proteins bind and stabilize the DNA ends and bring them together to allow them to rejoin and re-ligated.

More and more evidence is showing that bacteria are able to carry out such processes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Explain the step by step process by which NHEJ occurs.

A
  1. DNA double stranded break
  2. Ku (Ku70/K80 heterodimer) is a protein that binds to DNA double-strand break ends - high affinity for DNA ends - sequence independent since it binds to the sugar backbone - ring shaped structure that can accomodate DNA
    - Note that Ku is evolutionarily conserved from bacteria to humans
  3. Ku70/K80 heterodimer binding leads to the recruitment of DNA-PKcs - DNA-dependent protein kinase, catalytic subunit –> Activation of kinase activity –> Shown to lead to the formation of a synaptic complex that holds the two ends of the broken DNA molecule together
  4. Depending on the nature of the break, different DNA end processing enzymes may be required, including those that resect DNA ends, fill in gaps, remove blocking end groups, and make the ends ligatable.
    e. g. Artemis, PNKP, APLF, Polymerases μ and λ, etc.
  5. The final step in the repair of a DSB is ligation of the broken ends by DNA Ligase IV
  6. Dissolution of the NHEJ complex

(Anthony J. Davis and David J. Chen, 2013)

5.

  1. S
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Is NHEJ an error prone process?

A

Yes, it is an error prone process

Quite often we end up with flaps that associate incorrectly due to microhomologies (as shown) or the ends fold over forming hairpin or even sometimes reverse Hoogsteen base pairing.

e.g. GGG/G forming triplex and quadruplex structures –> source of slippage when DNA polymerases passes - resulting in base deletion (extra flaps are resolved/removed) or addition

One would have to predict knockouts by examining on a case by case basis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Pros and Cons of NHEJ high error rate?

A

Good - or evolution - Increase diversity

Bad - Mutations may reduce the fitness of the organism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What cells in the body exploit the high error rate of NHEJ?

A

Immune system - antibody production

Some cells exploit this high mutation rate –> cells involved in antibody production use (Known as VDJ recombination) –> increase the rate of mutation in antibody genes - creates new mutants - makes us more resilient to invaders

Note - Modern genome editing technology (e.g. CRISPR/Cas9) to make targeted mutations and knockouts  NHEJ is used after CRISPR makes a break in the DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Outline how homologous recombination can be used to repair ds DNA breaks?

A

Homologous Recombination (HR) –> refers to the exchange of DNA with homologous DNA - can be used to repair double stranded DNA breaks.

Classic way of looking at it is homologous recombination between chromosome in meiosis

BUT!

It can also take place in the cell between homologous DNA - remember we have two copes!

Idea - we are missing a piece of DNA –> Where can we get the correct DNA? Homologous DNA acts as a template

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Outline the two main ways in which HR is used to fix dsDNA breaks.

A

H.R. essentially lines up homologous DNA (readily found at the replication fork and swaps it between strands - relatively error free.

Variations of homologous recombination can occur to maintain the replication fork but two examples are…

  1. Fork regression –> lesion blocks polymerase –> end up getting the single strands coming off and priming on each other –> creating extra copies of homologous DNA
  2. Strand invasion –> Blue/Red strands are complementary to opposite strands so they can act as primers –> meaning that one strand can rip off and act as a primer for the other strand
    - So homologous strand invades and acts as a template for missing region
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Outline the general use of HR?

A

Basically, H.R provides a general mechanism for repair where intramolecular template information has been lost

Used in a variety of situations…

  1. Double strand breaks –> most common
  2. Lesions bypassed during replication - Repaired
  3. In the Long term - source of evolution –> due to imperfect recombination leading to parts of the genome being duplicated/deleted.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Outline the general process of Homologus recombination

A

Blue - Parental / Purple - Daughter

  1. Chemical lesion in one parental strand –> results in missing information during replication as lesion is not copied –> gap has been created which means that informarion has to be retrieved from homologous DNA.
  2. Break open the homologous strand
  3. Reform phosphodiester bond so that the genetic information is swapped (Homologous recombination)
  4. Anys gaps are then filled in with DNA polymerase and Ligase.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Outline the different possible HR combinations that are possible.

A

H.R. can occur in a variety of ways but involves the same basic processes –> as long as there is complementarity (5’to 3’ or 3’to 5’)

  1. Single crossover –> large segments swapped
  2. Double Crossover –> everything between the crossover is swapped.
  3. Intramolecular (within the same strand)
    a) Direct repeats - Deleting everything between the two repeats
    b) Inverted repeats - Invert the red region –> red region changes orientation relative to surrounding DNA
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Outline the different steps in the E. coli RecBCD pathway.

A
  1. Ds Break
  2. Eaten away by exonuclease RecBCD (5’ to 3’) until you reach chi site –> this leaves 3’ overhang
  3. RecA binds and stabilizes 3’ overhang
  4. RecA invades homologous DNA in genome –> strand invasion allows it to base pair with the homologous DNA - displacing the other strand forming a D loop.
  5. Once homology has been found - you have to nick and cut homologous strand using enzyme (allow for strand exchange)
  6. Fill in and ligate the nicks that have been formed - swapping the strands
  7. End up with 2 homologous strand that are paired up with a cross over between them which is known as a holliday junction (unstable structure that can migrate)
  8. Holliday junction can migrate (Branch migration driven by molecular motors) - when this happens the DNA is swapped –> swapping genetic info –> fundamental mechanism of HR
  9. Once missing DNA is exchanged - Holliday junction cut and re-ligated –> Known as resolution (random process – endonucleases that come in and cut the crossover region)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Further elborate how RecA binds and facilitates strand invasion for HR?

A

Homology search and strand invasion - RecA protein structure carriers this out

  1. It can bind to single stranded DNA (3’ overhang)
  2. Not 100% understood how it mediates strand invasion but we know that the first step is triplet formation.
  3. For triplet formation - Homologous DNA needs to be unwound –> short stretches doesn’t require much energy but longer stretches do require ATP hydrolysis.
  4. RecA coats DNA but allows enough space so that the DNA bind to it complementary homologous DNA
  5. Once homology is found, homologous DNA is nicked and ligated –> forming holiday junction –> allows for branch migration and geentic information exchange

Note - Helical nature of RecA filament means that it can form a triplex structure with a homologous DNA duplex

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Is the RecBCD pathway used for single and double crossover?

A

Yes, it can be used for either single or double crossover

Note - double crossover is possible but hasn’t been studied as much

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Explain what a Holliday junction is?

A

Holliday junction

During strand invasion, the invaded duplex is nicked which creates 2 x 3’ –OH strand which can then recombine with the opposite 5’ to 3’ homologous strand

This junction between strand (crossover) is known as the Holliday junction

Junction is a dynamic structure - moves along DNA unwinding in one direction and reforming in the other.

In the middle the DNA is not base-paired as they are being exchanged

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What proteins are present to stabilize the Holliday junction?

A

The Holliday junction stabilized by recombination proteins (RuvA/B) which are also essential to drive branch migration  which promotes exchange of strands.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

Outline the what exactly RuvA and RuvB are and their specific roles.

A

We know that both RuvA and RuvB stabilize the junction but more specifically…

  1. RuvA - flat structure that binds the 4-way Holliday junction –> A hydrophobic ‘pin’ in the middle helps to separate the strands as it is a lubricating hydrophobic pin in the middle as the DNA backbone is hydrophilic (charge) - Energy dependent process (ATP)
  2. The RuvB ‘motors’ bind either side of the RuvA complex and uses ATP to translocate DNA, opens it up and thus allows genetic information to be swapped.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

How much does the Holliday junction migrate?

A

How much migration? –> random/stochastic process – creates genetic variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What happens after branch migration? How do we resolve the structure?

A

Resolution of the holliday junction refers to the removal of holliday junciton

In order to return to original state –> There are two options..

Cut Horizontally or Vertically –> leads to two outcomes..

Horizontal cut –> also known as gene conversion (one gene jumps from one to another) - patches

Vertical cut –> Also known as crossover - information from one chromosome is swapped with the other chromosome – splice

Check

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What is the role of RuvC?

A

RuvC is the endonuclease in Bacteria - other organism have different endonucleases.

RuvC is responsible for the cleavage of the holliday junction

Position of RuvC binding and cleavage delineates/decides the outcome –> It is not known how this is coordinated between the patch and splice variations –> we think it’s a random process but there is a difference between different cell types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What does a heteroduplex refer to?

A

After cuttting the holliday junction we get Heteroduplexes (DNA that has inherited DNA from other DNA regions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

Is the RecBCD (HR - Strand invasion) prone to errors?

A

Process is prone to getting bulge/mismatches –> mismatch repair is used to repair any errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

Generally speaking, what important processes is H.R involved in?

A
  1. Contributes to much of the variation in offspring –> meiosis - gene shuffling –> Scrambles the genes of maternal and paternal chromosomes leading to non-parental combinations
  2. Forms physical links between homologous chromosomes to allow chromosome alignment during meiotic prophase
  3. Evolution – horizontal gene transfer –> i.e. bacteria taking in foreign DNA
  4. Important in DNA repair –> Alternative way to bypass lesion
  5. Exploited in biotechnology: genome editing with CRISPR/Cas

Note - H.R. is conserved in almost every single cell –> exceptions very minimal organisms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Does the meiotic recombination in yeast follow a similar recombination mechanism as seen in E. Coli (RecBCD)?

A

Yes!

Note that this time we get a double stranded break allowing for two overhangs to invade the homologous DNA

Forms a Double crossover during Meiosis since there are two branch points

Used to create offspring with different DNA combinations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Do H.R. and mismatch repair occur in isolation?

A

No! All these mechanisms of DNA repair may occur simultaneously

However, whether or not they do occur we can end up with different outcomes as shown in the attached image

Hypothetical…

Following recombination there is a possibility of a mismatch, two situations…

  1. Mismatch repair is repaired –> DNA sequence is restored to parental sequence - Homozygous ‘A’
  2. Not repaired (happens by random) –> Changes will persist and following recombination will be inherited by one of the daughter progeny –> Heterozygous
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

What is site specific recombination? How can it occur?

A

Specific recombination mediated by viral elements - Similar mechanism to normal H.R. –> uses enzymes called recombinases

Recombinases recognize sequence specific targets that are homologous and allows for recombination.

There are two main outcomes…

  1. Inverted repeat changes orientation
  2. Insertion or deletion which either adds/removes DNA.

Result of reaction is dependent on the orientation of the repeat sites

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Difference between site specific and general HR?

A
  1. Site specific Homologous DNA region that is targeted is very short i.e. 10/20 B.P –> Enzymes only recognize these specific homolgous sequences
  2. H.R. targets larger regions –> Basically you don’t need specifc homologous regions –> Enzymes function more generally
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What are integrases?

A

Integrases are an important class of transposition enzymes - Which allow for Site specific recombination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Outline how a Lambda phage integrase (Int) leads to the integration of a plasmid into the E. Coli Genome.

A
  1. Site specific Att sites (homologous regions) POP’ (phage) and BOB’ (Bacteria) line up with Lambda phage integrase

Note the region that is Homologous is ‘O’ - present in the middle of the Att sites.

  1. Lambda phage integrase allows for the crossover the of DNA
  2. Leads to the insertion of the Phage plamsid

Apart from the integrase itself - The process also requires some enzymes from the host cell to complete the process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Does Lambda phage integrase require energy to perform site-specific recombination?

A

No!

No ATP hydrolysis needed as energy is transduced using tyrosine in the enzyme active site –> traps energy in a high energy phosphate intermediate –> drives final crossover

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Outline the more precise mechanism behind Integrase Enzyme used for site specific recombination.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

Are there many different kinds of integrases and recombinases?

A

Yes!

Each recognizing specific homologous sequences

Example - Cre recombinase from yeast

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

What are Transposons?

A

Transposable genetic elements –> Transposons (‘jumping genes’) are mobile genetic elements that move randomly around the genome

Sequence homology is not required - Known as ‘selfish DNA’

Different types:

  1. Simple transposons contain only the genes required for their transposition (transposases) –> like mini viral like elements
  2. Complex transposons carry other genetic information e.g. antibiotic resistance

Note - Transposases similar to recombinases - cuts and swaps DNA around.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

General overview of the Bacterial genome structure?

A
  1. Bacterial genomes are parsimonious (efficient)
  2. Organized into operons
  3. No gaps - No introns
  4. Regulation/co-regulation - highly organized

Why?

Explained by increased rate of evolution –> due to their short life cycle (20 min) - allows for genome to evolve quicker –> optimizing it

68
Q

How do eukaryotic genome sizes compare to bacterial?

A

Eukaryptic genome tends to be a lot Bigger than the Bacterial genome

Humans - About 3,234 Mb in length

E. coli has roughly 3 Mbp (1Mbp – 1 million base pairs)

On average - Eukaryotic is 1000x bigger than E.Coli

Generally, eukaryotic genome is considered big and ugly  making it much harder to understand

69
Q

How has the number of estimated protein coding genes changed in the human genome?

A

Upon intially sequencing there was thought to be 20,000 protein coding genes

But…

Nowadays, there is thought to be 18,000 protein coding genes to create human –> as many genes turn out to be pseudogenes (aren’t expressed and made)

70
Q

Why is it hard to estimate the number of genes?

A
  1. Small genes can be hard to locate.
  2. Rarely-expressed genes are hard to detect via Expressed Sequence Tags –> basically we normally examine RNA transcripts to identify genes but it the gene is rarely expressed this makes the gene hard to detect.
  3. It turns out that many genes produce functional RNAs instead not proteins
  4. Gene density surprisingly low resulting in large number of gaps –> many gaps have unknown function
  5. Alternative splicing gives protein diversity rather than raw open reading frame number (i.e. like in E. Coli) —> more difficult to determine the different splice variants
71
Q

If Protein coding genes are not as common what is the rest of the genome made of?

A

Non-coding repeats make up at least 50% of the human genome

72
Q

What does a karyogram of a human genome show us?

A

Karyogram shows chromosomes - all 23 pairs (Mother + father) - diploid

  1. Ordered in size
  2. Shows banding - known as G-banding –> Tryptic digest then Giemsa stain
    a) Dark bands correspond with AT rich regions
    b) Light bands correspond with GC rich regions

Banding pattern can be used to…

  • identify unique patterns of bands on each chromosome, which can help…
  • identify chromosomes
  • diagnose chromosomal rearrangements
73
Q

What do sister chromatids and centromere refer to?

A

During prophase… replicated DNA comes together to form chromosome…

Which is formed of two sister chromatids held together at a constriction point called the centromere.

74
Q

What does chromosome number tell us? Is there any correlation we can draw?

A

No, Not really –> chromosome number has nothing to do with the complexity of the organism –> There is no particular reason for 23 Chromosomes, we assume that the numbers are simply evolutionary accidents

One pattern

  • More DNA in cell –> correlated with bigger cell –> allows for the production of a more robust/metabolically active cell
  • Prokaryotic cells normally have a smaller genome –> E. Coli 3 Mbp
75
Q

What do B chromosomes, Holocentric chromosomes and Extrachromosomal DNA refer to?

Other physical structures

A
  1. B chromosomes are extra (supernumerary) chromosomes to the standard complement that occur in many organisms - not essential for life

Originate in a number of ways including derivation from autosomes and sex chromosomes in intra- and interspecies crosses - Example in deer.

  1. Holocentric chromosomes – the entire chromosome acts as a centromere. Best known example C. elegans
  2. Extrachromosomal DNA  Plasmids  Organelle DNA in mitochondria/chloroplasts  important for function
76
Q

Do organelles have genomes?

A

Yes - Mitochondria & Chloroplast genomes

  1. Mitochondrial DNA (all eukaryotes) –> Covalently closed circular DNA (rarely linear, e.g. Chlamydomonas)

In humans: 16,569 bp, only 37 genes with only 13 protein ORFs.

  1. Chloroplast genome (oxygenic phototrophs) - Typically 120 – 170 kbp

Single closed circular DNA (rare exceptions), typically codes around 100 proteins, mainly to do with maintenance of chloro and photosynthesis.

How did they arise?

Mitochondria/chloroplasts –> evolved from parasitic and symbiotic bacteria that invaded eukaryotic cells. Hence, explaining their own genome. Overtime, they became better symbiotes and lost most of their genome.

Note: Genes that are lost from the organelle genomes either have been completely lost or migrated to host genome.

77
Q

Are there trends that can be seen on the chromosomal level in terms of gene distribution? What specific terminlogy is used to help describe the layout of genes on chromosomes

A

Generally speaking, the gene distribution is uneven and differ between eukaryotes

But!

Generally speaking, we observe a lower density noticeable around centromeres (less dense)

Terminlogy

  1. Gene-rich regions – High density
  2. Gene Deserts – Low density
  3. Multi Gene Families – Genes that are related occurring in more than one place (10 different places)
  4. Gene Superfamilies – 100/1000s of copies of a particular gene spread across the chromosome i.e. zinc fingers
78
Q

Is there a link between gene density and whether a eukaryote is “complex” or not?

A

One observation - Gene Density is possibly lower in more “complex” eukaryotes

Why?

This might be due to more complicated eukaryotes with a bigger body plan –> need more shuffling around so they need more space to do so –> However, there is no strong statistical data

What evidence is there to support this?

a) Introns etc. - “Simpler” eukaryotes have fewer introns e.g. yeast. But no eukaryote has NO introns.
b) Many more repeat regions in “complex” eukaryotes

79
Q

Why is it highly debated to call an organims more “complex” or “simple”?

A

Depends on your definition on complexity

It is hard to classify organisms as “simpler” vs. more “complex” –> debated topic as one could argue that yeast is more complex as it has a more organised/highly evolved genome

80
Q

Example of gene-rich region in the human genome?

A

Major histocompatibility complex – part of the immune system - codes for cell surface receptor.

A 700 kb class III region of MHC located on chromosome 6 which contains 60 genes in 700 Kb + 1 pseudogene (missing start codon).

This region also has very high GC content (54%) relative to the 41% over the whole genome.

81
Q

Exact definition of a gene-desert?

A

Some parts are Gene-deserts –> Defined as 1 Mb with no genes

Using this definition there are 82 deserts identified (3% of genome, 144 Mb) - Largest is 5.1 Mb.

82
Q

Different possible explanations for the high degree of DNA in the human genome that does not contain genes?

A

25% genome (500,000kb) has no genes.

  1. Regulatory regions
  2. Gene deserts be present for an evolutionary purpose - drive gene exchange via H.R in order to move one gene closer to another
  3. Could these gaps contain big genes (very large genes that span over large areas) - Dystrophin (spans 2.3Mb) –> big genes have slow synthesis - hard to detect
83
Q

What proportion of the DNA in the human genome is associated are protein coding genes/associated with genes vs. not associated with genes (intergenic)?

A

Genes or gene-related sequences –> Less than half (38%) of the genome has DNA associated with genes

Integenic (Between genes - i.e. not genes associated) –> 62 % of the human genome is made of intergenic region which is classified into two categories:

  1. Unique
  2. Repeats
84
Q

What two categories can intergenic DNA be divided into?.

A
  1. Unique - microsatellites (still considered a repeat - number of repeats differs between people though) as well unique bigger repeats
  2. Repeated - found all over –> include selfish elements - Lines, SINEs, LTR and transposons - selfish elements acting like viral elements.

Non-coding repeats making up to 50% of the human genome.

85
Q

What are characterisitics of Repeated Intergenic Regions?

A
  1. Has a different GC content than the rest of the genome (lower)
  2. Present in all eukaryotes
  3. Normally categorized by length
  4. Can take up roughly 10% up to 50% of genome
  5. No coding function
  6. Can be separated by CsCl centrifugation due to different physical properties
  7. Potentual roles - Some may play a structural role:
    a) Centromere
    b) Telomere - regulating mortality of cell  In Vertebrates TTAGGG ~ 2500 times in humans
86
Q

What is the simple/general way of dividing repeated intergenic regions into?

What is the alternative 5 group subdivision of repeat intergenic regions?

A

General way classificaton of intergenic repeats…

  1. Tandem Back to back next to each other which are normally caused by slippages by DNA poly
  2. Interspersed –> spread around - separated by big distances normally caused by jumping genes.

But….

Some researches classify into 5 categories:

  1. Transposon derived repeats (45% of genome) - enzyme associated that allow them to propagate.
    2) Inactive gene copies (processed pseudogenes) - RNAs are reincorporated - no regulatory sequence.
    3) Simple sequence repeats (2 to 5 nucleotides)
    4) Segmental Duplications (5% of genome): Inter- & Intrachromosomal - large chromosomal segments
    5) Repeated structural sequences (telomeres, centromeres etc)
87
Q

What is Satellite DNA?

A

Example of Repeated Intergenic Regions in Human DNA

  • No coding function
  • Often found in the heterochromatin in centromeres (tightly packed, mainly repeats)

Examples:

a) α-satellite family –> 171 bp repeat unit (alphoid DNA, mainly in centromeres).
b) β-satellite family –> 68 bp repeat unit interspersed with 3.3 kb repeat (including pseudogenes)

88
Q

How is the idea of Satellite DNA further broken down?

A

Tandemly repeated DNA –> Also known ‘Variable Number Tandem Repeats’ which can be classified into two categories:

  1. Minisatellite - 10 to 100 bp, Form clusters up to 20 Kb, Associated with structural features: centromeres.
  2. Microsatellite - Even smaller - Can be called ‘Simple Tandem Repeats’/’Simple Sequence DNA’, found in Telomeres (5’-TTAGGG-3’), typically <13 bp in length (cluster short < 150 bp), Interspersed with non-repetitive DNA.

Very common forms are Dinucleotide Repeats - 140,000 versions in the genome (chromosome 12: CACACACACACACACACACACACA)

89
Q

Why does Satellite DNA form?

A
  1. DNA replication

Both Microsatellites & Minisatellites are unstable - mutations in DNA repair systems can permit expansion which is a common occurrence in cancer cells.

Why is it unstable –> due to the way that DNA is replicated

During DNA polymerase action we can get mistakes –> priming further back add copies whereas priming further forward removes copies

  1. DNA recombination

Unequal Crossing Over (prophase I of meiosis) –> homologous DNA incorrect prime on eachother

Take away message - You get different number of repeats due to processes such H.R. and replication errors.

90
Q

How is satellite DNA used in forsenics?

A

The number of repeats vary from human-to-human and are useful in DNA-fingerprinting –> no 2 humans have the same number of VNTR in one region

91
Q

What are some examples of genome wide repeats?

A

Genome Wide Repeats - Transposon derived repeats

Characteristics: more than 100bp (some > 1 kb), Moderately repetitive (< 106 per haploid human genome), sometimes occur as: closely spaced clusters or as dispersed single copies, most copies are in intergenic regions whereas some are in introns.

Transposon-derived repeats which can be divided into 4 classes:

  1. Long Interspersed Nuclear Elements (LINES) - RNA based
  2. Short Interspersed Nuclear Elements (SINES) - RNA based
  3. Long Terminal Repeat (LTR) Retrotransposons - RNA based
  4. DNA transposons - DNA based

Really common - 42 to 45% of Human Genome = 3 types of old retrotransposons

92
Q

Outline the process of DNA transposition.

A

First discovered in Maize by Barbara McClintock

Different to other genome wide repeats - No RNA intermediate

93
Q

Is DNA transposition mutagenic in nature?

A

Yes, DNA transposition is Mutagenic in nature

Its mutagenic cause it causes double stranded breaks and creates mutations at insert site - disrupts genes

94
Q

Outline the process of LINES are created.

A
95
Q

Outline the process by which SINES are created

A
96
Q

What are the two other interspersed repeats?

Hint - Associated with Retroviruses

A

1. LTR retrotransposons - Fossil of Viruses (Retero viruses)

No recent evidence of transposition

They have viral elements like elements - long terminal repeat

Possible that they propagate in the presence of other retroviral infection - take advantage

  1. HERV (human endogenous retrovirus) - inactive and more complete retrovirus in the genome

Humans have evolved the KRAB domains that are transcription repressors in order to silence these ERV genes

97
Q

Is the phenomenon of Junk DNA true? Why would the cell hang on to all of this non-coding DNA?

A

Junk DNA? - Not true it is not Junk!

If it were junk then we wouldn’t have it because it simply becomes a metabolic burden –> rate of propagation counteracts rate of elimination - select for it

Maintenance of this DNA suggest value, for example:

  1. transposition an engine of evolutionary change? - mutations
  2. Discovering roles for non-coding DNA all the time
  3. Micro-RNAs/long intervening non-coding RNAs - control on RNA level.
  4. ENCODE shows nearly all genome is transcribed - so it makes sense that they have function.
98
Q

How can genomes acquire new genes?

Hint - Three ways

A
  1. Horizontal - genes jumping from other species/organisms (non-offspring)
  2. Exon shuffling - different splicing combinations
  3. Duplication and divergence - Low chance of gene duplication (1 % chance for 1 gene in 1 million years)
99
Q

How does gene duplication result in the a genome acquiring a new gene with a new function - what normally happens?

A

How gene duplication can lead to a new gene…

  1. Ancestral gene has promiscuous activity (i.e. main reaction + side reaction which has no biological function)
  2. Overtime the side reaction which had no initial function gains a function (bifunctional)
  3. We then get gene duplication and divergence - each gene carry out one of the functions from the bifunctional gene –> Results in increased fitness
100
Q

What are the 5 ways in which we get DNA duplication?

A
  1. Recombination - Unequal Crossing Over in meiosis between homologous chromosomes
  2. Recombination - Unequal Crossing Over between sister chromatids in mitosis or meiosis
  3. DNA Amplification during replication - Type of errors that occurs in replication
  4. Replication slippage - Smallest type of error creating microsatellites
  5. Retro-transposition - RNA transposon –> gene gets transcribed, reverse trasncribed into cDNA and inserted back into the genome
101
Q

Outline how recombination - unequal crossing over leads to DNA duplication

A

Remember there are two types of recombination

  • Between Homologous chromosomes in meiosis (they don’t pair up during mitosis - no formation of bivalent pairs)
    or. ..
  • Between sister chromatids in mitosis & meiosis

In both cases…

Unequal crossing occurs when similar sequences that are not in the same locus as each other on each chromosome undergo recombination.

e.g. repeat sequences that don’t align up properly which is then followed by recombination - resulting in duplication and deletion.

102
Q

Outline how DNA Amplification during replication leads to DNA duplication?

A

Error that occurs in the replication Bubble during DNA replication

DNA recombination with the misaligned repeat regions leading to duplication and deletion - daughter cell will inherit 2 copies or no copies

103
Q

Outline how Replication slippage leads to DNA duplication.

A

Replication slippage - Smallest type of error creating microsatellites

  • Random process - replication is a dynamic process/enzymes are not always present at the stand (they detach)
    1. Base pairing breaks/melts due to ambient temperature (bound to happen as its in a dynamic equilibrium)
    2. DNA loops out but primes/attaches at the wrong base
    3. Polymerase reattaches and starts adding bases - one strand will receive a insertion but the other a deletion - evident after another round of replication.

Probability of slippage is also dependent on bases present - AT less hydrogen bonds than GC.

104
Q

Generally speaking, what are the characterisitcs of genome duplication?

A

Characteristics

  1. Size of duplication ranges from 1 to 400 kb
  2. Bias towards centromeres
  3. As the average human genes are only 25 kb long duplicated regions tend to carry few genes statistically speaking unless it takes place in a gene rich region.
  4. Genome duplication may exchange exons  resulting in new gene combinations.
105
Q

Is genome duplication always a good thing?

A

No Obviously not!

On the one hand, it might increase fitness by acting as a source of variation - creation of new genes/functions

But

Recombination can lead to Genetic Disease… Due to the disruption of gene function

Example - Charcot-Marie-tooth syndrome

  • Physical weakness and difficulty walking
  • Results from CMT1A ectopic (abnormal) recombination between a pair of 24 kb long repeat elements on chromosome 17.
106
Q

Under what circumstances can we say that successful gene duplication occurs?

A

Under the following circumstances…

  1. 1 copy at least retains original sequence
  2. Gene gets duplicated and doesn’t change - increasing Synthesis rate
  3. 1 copy remains selectively neutral - It may accumulate mutations (~0.1% per million years) to form gene relics (‘Pseudogenes’) or become non-functional.
  4. 1 copy acquires new function or a sub-function ‘Neofunctionalisation’ e.g. new gene undergoes new/different selective pressures resulting in adaptation to provide a fitness advantage.
107
Q

Example of Neofunctionalization?

A

Example of Neofunctionalization - New gene function: Trypsin vs. Chymotrypsin

Trypsin cuts at Arginine and lysine

Chymotrypsin cuts at phenylalanine’s, tryptophan’s and tyrosine’s

108
Q

What are pseudogenes?

A

Pseudogenes - Characteristics:

  1. Not rare
  2. Copies of functional genes with altered/missing regions - often contain frameshift/nonsense mutations
  3. May serve regulatory roles - antisense RNA strands (RNA-RNA control).
  4. Increase genome size (cost/benefit)…

Normally speaking, evolution selects for certain things as the benefits outweighs the cost, but it is possible that things (e.g. pseudogenes) are just a by-product of evolution. Nevertheless, the fact that many organism have them would argue disagree with that.

109
Q

What are the two types of pseudogenes?

A

Types of Pseudogenes

  1. Non-processed’ pseudogenes - genomic region (DNA) is duplicated to form pseudogene –> pseudogene is non-functional due to incomplete duplication or inactivating mutations - not selected for on a evolutionary basis. Furthermore, pseudogene is missing regulatory region.
  2. Processed’ pseudogenes - DNA is transcribed to form RNA. RNA gets reverse transcribed to form cDNA which gets inserted into the same or different chromosome.

Evidence:

  1. Lack introns/promotors
  2. Contain poly(A) tail/flanking repeats

Why does this occur? Random stochastic process - we get random priming allowing reverse transcriptase bind form cDNA.

110
Q

Are Pseudogenes more common than nomral genes? Which gene family has a lot of pseudogenes?

A

Yes!

More pseudogenes in the human genome than actual genes - roughly 20,000 pseudogenes

Many pseudogenes in the ribosomal family (2000 copies) - They are processed pseudogenes meaning that they are created via retro-transposition (LINEs) –> they have a high expression rate.

Ribosomal pseudogenes are highly conserved –> 2/3rd human Ribsomal pseudogenes shared with chimpanzee genome - Implies possible recent origin

111
Q

Can pseudogenes be reactivated to gain function?

A

Sometimes Pseudogenes can be activated and regain new functions – 2 types:

  1. ‘Neofunctionalised’ - Take on new function - diverging from parental genes
  2. Subfunctionalised’ - Taking on a subfunction of parent gene

Example of pseudogene with Neofunctionalisation - Makarin1-p1

Pseudogene RNA is used to affect expression/regulation of original parent gene –> when Makorin1-p1 is expressed it is used to stabilize short transcripts of Makorin-1 gene (parent gene)

How does it stabilize?

Titrates out trans-acting destabilizing factor & prevents degradation (prevents binding) - basically Pseudogene thus supports active parent gene.

Note –> when a pseudogene gets reintegrated it can either pick up a promotor in forward or reverse direction - Make RNA (sense) to produce gene of interest or reverse RNA transcript (antisense) which can serve a regulatory function - alter parent gene splicing or degrade target.

112
Q

Can Pseudogenes can be reactivated to give a new protein?

A

Yes - retrogenes

Retrotransposition - This normally happens if a retrogene gets reintegrated by chance near a regulatory region –> over time the gene tends to evolve and become more and more functional (correct reading frame, etc.)

Retrogenes defining feature - missing introns

Example: Testis specific pyruvate dehydrogenase

Note –> when a pseudogene gets reintegrated it can either pick up a promotor in forward or reverse direction - Make RNA (sense) to produce gene of interest or reverse RNA transcript (antisense) which can serve a regulatory function - alter parent gene splicing or degrade target.

113
Q

When a pseudogene gets reintegrated into the genome, what are the conseqeunces of reverse transcriptase having a high error rate?

A

Consequence is that the sequence changes slightly..

Resulting…

  1. New Gene function
  2. Change in stability of any RNA-RNA interactions - when having a regulatory function
114
Q

What is non-neofunctionalism?

A

When a gene is copied and maintaining the same function

Example:

rRNA genes (Mycoplasma genitalium)/KRAB domains in Humans –> All copies nearly identical

115
Q

How are multigene families created?

A

Multigene families - groups of genes from the same organism that encode proteins with similar sequences.

Duplication allows for creation of multigene family

116
Q

What are the two arrangements of multigene families in the genome?

A
  1. Tandemly located (back to back): α-globin genes (chromosome 16) & β-globin (chromosome 11)
  2. Dispersed - e.g. 3 aldolase genes on different chromosomes.
117
Q

What are the 3 way in which multigene families evolve overtime?

A
  1. Concerted evolution - evolve in a similar fashion/share homology –> Known as paralogous - chance that they evolve in different ways is unlikely.
  2. Ectopic (non-allelic) Gene conversion - occurs when homologous recombination changes some of the members of a multigene family - may convert all or part of a gene’s sequence - results in evolution that is not independent.
  3. Allelic Gene conversion - This is when homologous recombination converts one allele to another.
118
Q

Outline the importance of the gene duplication in the globin superfamily.

A
119
Q

What type of clustering does the globin superfamily show?

A

Globin Cluster structure –> Globin family shows both Co-location of genes in the clusters as well as dispersal of clusters

Similar organization in all vertebrates but lengths vary and different distribution of pseudogenes.

Changes between embryonic and adult genes?

120
Q

Are ther benefits gene duplication in the globin family?

A

YiesshHHH –> natural selection favors genes to take up specific functions –> e.g. Different affinities for oxygen allowing

  1. Hemoglobin has separate proteins domains for O2 storage and transport (with different binding affinities!) –> allows for cooperative binding and allosteric regulation –> low affinity at low [O2] and high affinity at high [O2] –> this support the metabolic needs of larger animals (hence, allow them to evolve/exist) –> delivery of oxygen to oxygen deprived tissues
  2. Myoglobin simply has a high affinity for oxygen –>makes sense for muscle tissue as they want to hold on to as much oxygen as possible even when oxygen deprived –> can pick up oxygen from Hemeglobin at low oxygen conc.
  3. Human developmental Pathway: 3 stages: embryonic, foetal and adult –> during different points in the human developmental pathway different forms of haemoglobin (only possible because of gene duplication) are expressed

Why is this important?

Slightly different hemoglobin between fetal and maternal blood –> this allows for optimal oxygen transfer - this has to be controlled as oxygen is needed for survival of the fetus but can be toxic at high conc. (oxidizing agent)

121
Q

Are larger genome duplications possible, larger than genes/DNA segements?

A

Yes! Larger duplications than genes and segments are possible! –> have affected genome architecture in some species (including microorganisms and larger plants and animals)

But….

This is normally refers to duplicating more than one chromosome not singular chromosome duplication

This is because singular chromosome duplication is associated with a host of diseases/disorders –> Down syndrome (‘trisomy’ chromosome 21), Edwards syndrome, trisomy 18, Patau syndrom, trisomy 13 - Leads to Gene product imbalance and Reduced life expectancy

122
Q

Does whole genome duplication (WGD) ever occur?

A

Yes it is possible!

Genome sequencing etc. suggested major metazoan lineages have undergone WGD –> Organisms have different capacities to cope with this e.g. flowering plant use this as a major mechanism to acquire new properties + source of speciation.

123
Q

What does polypoidy refer to?

A

Polyploidy - Multiple complete sets of chromosomes

Example

Eukaryotes contain 2 Haploid gene sets (2n) - ‘N’ number of chromosome + number before = ploidy

124
Q

What organisms have a high degree of polyploidy?

A

It is thought that ≤ 80% of flowering plants species originated via recent polyploidy –> examples include: Oats, cotton, potatoes, bananas, coffee, sugar cane, peanut, apples

Useful as it Generates new properties - Larger genome results in large cell size - not really understood why

But…

It turns out that polyploidy can also occur in vertebrates –> More common in invertebrates, fish & amphibians but rare in mammals.

125
Q

What is the mechanism for Polyploidization?

A

Two main scenarios

  1. ‘Autopolyploidy’ (‘Auto’ = same) - Due to mistake (meiosis error) in forming gametes the cells aren’t fully reduced (production of diploid gametes) - upon fertilization –> result in new ploidy

If its viable can lead to speciation - 1-40% frequency of formation.

Theoretically, Endoreplication can also be a cause of polyploidy in autopolyploidy - replication of the genome without going through mitosis

  1. ‘Allopolyploidy’ (‘Allo’ = other) –> polyploid offspring is derived from two distinct parental species
126
Q

Outline what happens during the autopolyploidization by mutation that takes place in potatoes

A

Example driven by a recessive mutation - results high error rate during pollen formation - element of genetic control

Consequence of this mutation?

During 1st and 2nd division the spindles ends up being in the same direction instead of being at right angles to each other so they pull the chromosomes in the same direction - leading to 2n gametes.

Alternatively…

There is also a a recessive mutation that affects ovule formation - eliminate 2nd division in meiosis –> 2n gametes

Takeaway –> Both cases you end up with 2n offspring - if two 2n gametes come together we produce 4n gametes which are fertile.

127
Q

Is the mechanism for Polyploidization normally controlled?

A

We are not sure how the mechanisms are controlled, if they are controlled or if it’s a stochastic process.

128
Q

Problems with polyploidy?

A

May induce disease symptoms - Phenomenon known as ‘Genomic Shock’

Characterized by widespread activation of transposons, gene expression (Num. of transcription factors changes) and recombination

However, plant species deal with it quite well –> Hence, Short term it can be detrimental but long term drives evolution.

129
Q

If a 2n gametes fertilizes with an n gamete, can it reproduce with the parents?

A

No!

If due to Autopolyploidy –> you end up with 2n combining with n creating 3n offspring –> this offspring can not reproduce with the parents

Why? Unequal number of chromosomes during cellular divison

130
Q

Are autopolyploids or allopolyploids more viable?

A

Autopolyploids tend to be more viable that Allopolyploids - This is because the genes are similar, each chromosome will have a homologous pair which can then form a bivalent in meiosis.

131
Q

Why is autoploidy uncommon in animals with 2 distinct sexes?

A
132
Q

Elaborate on the idea of allopolyploidy? What are the two models proposed for how allopolyploidy occurs (conceptually)?

A
133
Q

What is endoreplication?

A
134
Q

Why are triploids in allopolyploids no good?

A

They are tolerated in first generation but offspring are infertile

Example - 2n + n –> 3n - mature adult plants

When 3n tries to reproduce and produce offspring via meiosis - only 2 chromosomes (out of three) can form bivalents –> chromosomes don’t come together correctly and we end up with unbalanced gametes.

135
Q

Outline how the concept of polyploidy is used to create fertile Triticale plants.

A

Example of Alloploidy - Triploid Wheat-Rye hybrid - two species

Wheat (triploid) produces a high yield and Rye (diploid) has high disease tolerance - useful to combine to produce Triticale

Problem! F1 generation has 21 chromosomes - lead to meiotic problems (improper bivalent formation) - infertile

Solution? Add chemical colchicine - cancer drug to F1 generation –> interferes with chromosomes - leads to doubling of F1 gametes

Hence upon fertilization of doubled gametes - 21 (gametes) to 42 –> 42 is even so all chromosomes can pair up we get allotetraploid

136
Q

Definition of monoploid and haploid?

A

Definitions:

Monoploid: basic set of chromosomes multiplied (the number of unique chromosomes in gametes).

Haploid: set of chromosomes present in gamete (irrespective of species number)

137
Q

What is an example of repeated polyploidy?

A

Chrysanthemum genus (Flower) - it Is believed that the single diploid ancestral species underwent 3 whole genome duplications.

Of the Chrysanthemum genus you have: Monoploid 9, Diploid: 18, Tetraploid: 36 and Hexaploid: 54 species –> all form bivalents during meiosis - produce fertile gametes

This indicates that the species is drifting away from ancestor –> behaving as a new species -

138
Q

After Whole Genome duplication occurs, how does the cell attempt to reach balance?

A

After WGD there are Drastic change in gene expression –> Hence, the cell attempts to achieve chromosome expression balance - meaning that it restores diploid like behavior.

This is achieved in several different ways:

  1. Genetic control - transcription factors are regulated via feedbacks loops to ensure correct expression
  2. Epigenetic - DNA methylation/chromatin reorganization  changes gene expression accordingly.

Consequence of doubling the number of genes –> allows organism to carry new mutations –> as you already have the wild type - new functions/ sub-functions

139
Q

What happens when there is a loss of balance?

A

Two outcomes…

  1. Crashed system - Gene expression not under control, increased transposon activation, etc.
  2. Alternatively –> if the organism is lucky it may lead to new variability which creates new functional cells.
140
Q

Why is it hard to identify ancestral polyploidisation?

A

Genome duplication takes place –> Over time we lose particular genes copies –> Another Genome duplication takes place –> Over time we lose particular gene copies

Hence, the number of gene copies is not constant - changes –> hence, making it hard to identify WGD

141
Q

Different ways to detect WGD?

A
142
Q

How to discern auto and allo-ploidy?

A
143
Q

Is there evidence for WGD of Saccharomyces cerevisiae?

A
144
Q

What is the Hox gene family? What does it do?

A

Example of WGD –> Hox gene family - homeobox genes involved in animal development –> they occur in clusters.

Hox genes - DNA binding proteins/code for transcription factors –> turn genes on/off in order regulate development

Hox gene T.Fs utilize the Homeodomain

Hox gene family –> provides a very quick way for evolution to change the body plan of an organism

Any mutations in the cluster - results in major changes in the body plan

145
Q

Generally speaking, how do Hox clusters influence the development of the embryo’s body plan?

A
146
Q

How does the Hox gene family provide evidence for WGD?

A

Genome duplication and sequence convergence (evolution) in genes and regulatory regions has resulted in morphological complexity thus allowing complex body plans to be formed –> Duplication & manipulation of Hox genes allow for variety the spice of life bbbyyyy

Example: Insects have 1 Hox cluster whereas vertebrates have 4 - indicating that WGD has allowed for the increased copies of these clusters to exist and to be fine tuned through evolution.

147
Q

What is the 2R hypothesis?

A
148
Q

Outline what is going on in the attached picture?

Hox clusters

A
149
Q

Genome Duplication Benefits?

A
150
Q

Barriers/problems associated with WGD?

A

Barriers normally occur in the short term….

  1. Infertility
  2. Genomic Shock
151
Q

What are the different ways evolution is using to increase genome complexity?

A
152
Q

Difference between Exon Shuffling and alternative splicing?

A

Note - They are NOT the same thing!

153
Q

What is the exon shuffling theory?

A

Note - Exon duplication and Shuffling are different concepts

But…

They come hand in hand

154
Q

How do the rates of exon duplication + exon shuffling compare to gene duplication?

A
155
Q

How can Illegitimate non-homologous recombination lead to exon shuffling?

A

Illegitimate non-homologous recombination is one-way exon shuffling takes place –> only requires a short primer/overlap/homology.

Structural domains of different proteins are joined together - due to the low levels of homology for this process can occur quite easily even though we are dealing with different genes.

Topoisomerases - nicks DNA and ligates non-homologous region –> leading to the shuffling of exons

156
Q

How can Domain/Exon duplication occur?

A

Domain Duplication - Similar to Gene Duplication

  1. Unequal crossing over
  2. Replication slippage + recombination in the replication bubble
157
Q

How can LINES (Long Interspersed Nuclear Elements) jumping around the genome lead to exon shuffling?

A
158
Q

How can DNA Transposons result in exon shuffling?

A
159
Q

What is something that you have to consider during exon shuffling (or any insertion/deletion in the genome)?

A
160
Q

What is the splice frame rule?

A
161
Q

Is there evidence for exon shuffling in genomes?

Is there any correlation between organism complexity and exon class? Why does this arise?

A

YES bby

162
Q

Exon shuffling example?

A
163
Q

Why does evolution favour exon shuffling? What is particularly beneficial?

A

Futhermore…

Shuffling promotes self-interaction capacity (very common) - Allows formation of homodimeric proteins –> These protein domains self-interact

These domains tend to be stuck in exon shuffling compatible structures –> leads to formation of families and superfamilies.

164
Q

Main differences and similarities between exon shuffling and gene duplication?

A
165
Q
A