Gene structure Flashcards

1
Q

Splicing (revision)

A
  1. OH from one branch attacks the P link between the last base of exon and first base of intron
  2. OH of last base of exon performs hydrophilic attack on P bond between last base of intron and first base of next exon
  3. Now have 2 exons ligated together; left w/intron in lariat structure
    * Does not require free energy (ATP)
    * Can splice over large distances (10’s of kilobases)
    * Exon skipping → ligate exon 1 and 3; skip exon 2
    * Cryptic splice site → splicing of exon 1 into middle of exon 2 → can cause frameshift in protein
    * Co-ordinated by spliceosome and co-factors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Spliceosome

A
  • 5 snRNAs (small nuclear) and ~50 proteins make up the spliceosome
  • Core proteins = 4 snRNPs (small nuclear ribonucleic particles)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Assembly and action of the spliceosome

A
  1. U1 and U2 assemble onto pre-mRNA in a co-transcriptional manner
    - U1 binds at 5’ of intron (donor site)
    - U2 binds at 3’ of intron (acceptor site)
  2. U1 and U2 snRNPs form the pre-spliceosome (complex A)
  3. Pre-assembled snRNP U4-U5-U6 is recruited to form complex B
  4. Complex B undergoes series of rearrangements to form catalytic B’
    - U1 and U4 are kicked out
  5. Complex B’ catalyses first step of splicing, generating complex C (contains exon free exon 1 and intron-exon 2 lariat intermediate)
  6. Complex C undergoes rearrangements catalysing 2nd step ligating 5’ exon to 3’ exon → post-spliceosomal complex contains the lariat intron and spliced exons
  7. Release of spliced mRNA and lariat w/help of RNA helicases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Different organisms have different levels of splicing

A
  • Number of genes in an organism’s genome is not a good assessment of protein diversity
  • E. coli only 0.1% of genes undergo AS; humans 95%
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Multiple forms of mRNA transcript variation –> diversifying the proteome

A
  • Exons retained or skipped
  • Introns excised or retained
  • 5’ & 3’ splice site positions moved: exons longer or shorter
  • E.g. DSCAM gene in drosophila
    o Has multiple exons; forms v. specific immunoglobin regions within the protein
    o DSCAM = membrane-anchored cell surface protein; role in neural development for axon and dendrite self-avoidance; 24 exons permit over 38,000 variants
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Alternative splicing (AS)

A
  • Allows related but different protein forms in different tissues
  • Supplements transcriptional control to control expression/function of gene etc.
  • Insertion/deletion of specific domains
  • E.g. it can regulate antibody and neuropeptide production
  • Can regulate expression by including exon w/no stop codon
    o Triggers nonsense-mediated RNA degradation
    o Regulates balance of functional to non-functional RNAs
  • Unprocessed RNA not transported to cytoplasm, if transported and translated, the protein is truncated due to stop codon in the intron
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Effect of AS on mRNA and protein

A
  • Rate of translation of mRNA
  • mRNA degradation susceptibility
  • Insertion/deletion of amino acids
  • Insertion/deletion of functional domains
  • Polypeptide truncation due to stop codon
  • Protein properties and functions:
    o Soluble or membrane bound
    o Subcellular location changes
    o Affinity changes for substrate
  • Consider:
    o Selection of alternative splice sites can be tissue and developmental-stage specific
    o Splice site selection must be tightly regulated → many genetic diseases can be caused by point mutations that activate cryptic splice sites or delete splice sites
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

3 groups of alternatively spliced transcripts

A
  1. 5’ transcript ends differ from one another
    - Due to different transcriptional start sites
    - Then pre-mRNA processed differently
  2. 3’ ends differ from one another
    - When different poly(A) sites are utilised for transcriptional termination
    - Different use in different tissues
  3. Middle portions differ
    - Can’t be explained by using different transcription start or termination sites
    - Example: troponin T gene (involved in function of skeletal muscle)
    o With different internal exons there is 64 different ways they can be combined to express different mRNAs (found in different muscle types)
    o Regulated by tissue-specific splicing factors acting on the pre-mRNA
    o Used in heart attack: take blood sample; do PCR to detect presence of heart version of troponin T → heart cells ruptured releasing that mRNA into bloodstream
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Alternative splicing regulation

A
  • Through splicing factors = proteins that recognise cis-acting sequences within the RNA transcript
    o They “Promote” or “inhibit” splice sites in different cases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Example: determination of sex in fruit flies (Drosophila)

A
  • Different protein products in males and females
  • Splicing cascade
  • Top of hierarchy = Sxl (sex lethal) gene → can autoregulate its own splicing
    o Sxl is master sex determination gene in somatic cells → inhibits male-specific binding
    o Sxl binds intron and inhibits binding of U2AF factors at splice acceptor sites so stop codon at exon 3 is excluded; splicing goes to next exon producing fully functional Sxl protein
    o Have positive feedback to make sure that whenever Sxl is expressed in females it does not include stop codon
    o Only expressed in females
  • Next: tra (transformer) gene
    o In males (no Sxl) → get normal splicing of exon U1 to beginning of exon U2AF; within that region there is a stop codon so get small, truncated protein product w/no function
    o Sxl binds at proximal splice site (intron 1); prevents U2AF binding and binds cryptic site in exon 2 (known as the ‘distal splice site’); allows the splicing out of a stop codon → produces fully functional transformer protein
  • Then regulates dsx (doublesex gene) transcription factor
    o Males: tra2 binds to binding site in exon 4 (ESE – Exon splicing enhancer); skip exon 4 (contains sequence that promotes transcriptional termination) so have splicing exon 3 and 5; get longer protein product; have transcription factors w/slightly different properties because of inclusion of exon 5 → they then bind to different targets and recruit different complexes to regulate gene expression
    o Females: tra binds to tra2 and recruit members of U2AF family to promote splicing to to exon 4; results in transcriptional termination; shorter protein
  • Tra then regulates the splicing of the transcription factor gene fruitless
    o Fruitless encodes a transcriptional regulator that determines development; it has isoforms expressed in males and females
    o Has male specific transcription start site at P1
    o When Tra is present in females, it promotes splicing form end of exon 2 onto exon 3 resulting in inclusion of stop codon
    o In males, splicing occurs before stop codon onto exon 3 → get a fruitless isoform w/male-specific region at N-terminus → this can regulate male-specific phenotypes incl. behaviour
  • Sxl can also directly inhibit MSL2 involved in dosage compensation
    o Males only have 1X chromosome so all genes on X would only be expressed in ½ ; to compensate that, MSL2 increases the expression across the whole chromosome
    o Because males do not have Sxl, translation of MSL2 is not inhibited, so it can upregulate expression across the X chromosome
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Splicing factors

A
  • Act positively (e.g. Tra) to promote the use of a splice site
  • Act negatively (e.g. Sxl) to inhibit the use of a site
  • Some are constitutively expressed, others tissue/cell-type -specific

Gene hierarchy:
* Each gene product controls splicing of the next gene in hierarchy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Fruitless gene in males

A
  • Has many aspects of male courtship behaviour
  • Orienting → tapping → singing → licking → attempting copulation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Regulating alternative splicing: response to signals

A
  • Levels of intracellular Ca2+ can impact splicing of a gene
  • When neurons depolarize → get high Ca2+ → get phosphorylation of specific factor which binds CaRRE site within pre-mRNA → STREX domain is left out; get splicing and formation of gene products less sensitive to calcium
  • Activation of neurons results in the alternative splicing of the K+ channel gene SLO
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Regulating factors

A
  1. SR proteins (serine-arginine rich domain)
    - Usually at C-terminal
    - Constitutively expressed SR proteins can interact with specifically expressed factors
    - Can influence splicing in 2 ways:
    a) Bind 5’ splice site & promote U1 snRNP binding
    b) Bind (ESEs) exonic splicing enhancers within downstream exon and promote U2AF binding
  2. Heterogenous nuclear Ribonucleoproteins (hnRNPs)
    - Inhibit splicing in general so prevent binding of U1 or U2AF factors

ESE = exonic splicing enhancer
ISE = intronic splicing enhancer

ESS = exonic splicing silencer
ISS = intronic splicing silencer

SR proteins (serine arginine repeats) = stimulate splicing
hnRNPs (heterogenous nuclear RiboNuclearProtein) = hinder splicing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Alternative Splicing dictated by

A
  • RNA sequences,
  • Constitutive or tissue-specific trans-acting factors
  • Splice site strength:
    o Ability to bind general factors (e.g. U1 snRNP)
    o Presence or absence of ESEs/ISEs (presence of ESE = stronger)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Link to transcription

A
  • Splicing occurs as transcription is still going on
  • Rate of elongation can affect splicing pattern
    o If polymerase is moving slowly = more likely to include exons with weak acceptor sites
    o Faster = more likely to skip exons with a weak acceptor site
    o E.g shown in vivo in Drosophila, using a mutant line w/slower RNA Pol II
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Identification of alternative splicing

A
  • Microarrays, now next-generation sequencing
    o Can sequence all mRNA produced in cell → analyse to find which exons are being included/excluded → ca do this in tissues, different cell types, etc
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Future research

A
  • Questions:
    o Alternative Splicing in complex tissues – how do you know which cells have a specific splice pattern?
    o How can you rapidly identify the target genes of certain splice factors in specific cell types?
  • What they did → Targeted DamID
    o Tagged U2AF50 w/a protein from E. coli DAM (DNA adenine methyltransferase -methylates specific A within GATC sequences)
    o Allows to profile protein-DNA interactions in cell-specific manner within Drosophila
    o Adapt to splicing factors; maybe see where they are directly interacting w/DNA as genes are transcribed and spliced
    o Looked promising; observed strong peaks at 3’ ends of introns; unfortunately, could not repeat it
  • They also:
    o Cloned other splicing factors and performed Target DamID on them
    o Most did not show any association w/DNA but Sxl strongly associates w/transcriptional start sites (surprise as not expected)
    o Think about whether Sxl could also bring sexual dimorphism in the nervous system through transcriptional regulation, as well as alternative splicing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Intron early theory

A
  • Introns found in all eukaryotic genomes
    o Except ‘nucleomorph’ in a species of free-swimming, biflagellate monads
  • Walter Gilbert
  • Introns originated in prokaryotes, then lost by ‘genome streamlining’ (compressing genome)
  • Early introns gains thought to be invasive and deleterious
  • Could help promote the ‘Exon theory of gene evolution’
    o If you have intronic sequences between protein domains, allows greater rates of recombination
    o Can shuffle protein domains in genome to create proteins w/new functions
    o Shuffling permitted by introns
  • Allowed the creation of complex genes (and a large protein collection!)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Intron late theory

A
  • Introns only evolved once eukaryote formed
  • Archaea and bacteria never had introns or spliceosome
  • Though that introns would have jumped into random placement in genes
    o Not necessarily corresponding to protein structural elements
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Possible ‘in-between’ model

A
  • Prokaryote with group 2 introns (self-splicing retroelements) → invaded archaea-like cell → ended up forming mitochondria known as last common eukaryotic ancestor (has intron-rich genes)
    o Introns formed after endosymbiotic event (ie. formation of mitochondria)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Roles of introns during life phases

A
  • Introns can be a host burden
    o Spliceosome complex is huge
    o Energy & time cost
    o Vulnerability e.g. need recognition of cis-regulatory sequences
  • Roles can be classified as:
    o ‘Sequence-dependent functions’
    o ‘Length-dependent functions’
    o ‘Splicing-dependent functions’
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Life phases

A
  1. Genomic intron
    o In DNA form sitting between exons
  2. Transcribed intron
    o From DNA to RNA
  3. Intron being spliced
  4. Excised intron
    o In lariat form
  5. Exon junction complex -harbouring transcript
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q
  1. Genomic intron
A

Introns within genome harbour transcription initiation sites (e.g. cis-regulatory elements_
Elements for which transcription factors can bind and regulate transcription of gene intron is located in
Can be enhancers (promoting transcription), silencers (repressing expression), TF binding sites
Often found in 5’
~40% of binding sites → introns
Example: AFP (α-fetoprotein)
Plasma protein made in the liver and yolk sack in the foetus
Regulates osmotic pressure
Has tissue specific expression
Can have P1 promoter before exon 1 or P2 promoter in first intron → leads to formation of different proteins
Introns harbour transcription termination sites
Intron sequences can regulate Polyadenylation + cleavage
eg. in Flt-1 gene
Soluble version inhibits angiogenesis by binding extracellularly to vascular endothelial growth factor
Has 2 transcriptional termination sites
Full length form (terminates after exon 14) → have membrane formed
If termination is between exon 13/14 → form soluble form inhibiting angiogenesis
Different termination sites identified using PCR
Use primer binding upstream at polyA site; extract RNA; size of PCR bands represent form being formed
Can harbour nested genes (whole genes)
~800 in Drosophila melanogaster
May have their own promoter & different expression profile
Are often non-coding RNA & protein-coding genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q
  1. Transcribed introns
A
  • RNA polymerase II: elongation rate up to 50 kb min -1
    o Intron transcription may take hours
    o Time delay between gene activation and translation of the protein
  • HES7 gene (helix-loop-helix transcription factor expressed in mice)
    o Oscillation of HES7 protein levels is important for directing mesoderm cells to form somites during embryonic development
    o Forms negative feedback loop
    o HES7 can repress its own transcription
    o Unstable protein
    o Delays due to time for transcription, splicing and translation; have initial expression of HES7; as levels increase, feedbacks to own promoter to inhibit its own expression; levels go down so mRNA goes down so protein levels go down; HES7 no longer repressed so cycle can start again
    o Introns are v. important for ensuring correct period of oscillation; length of intron can affect timing between peaks
    o Mutant where introns removed from HES7 → disrupted body plan → somites formed in tightly clustered manner → lethal to developing organism
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q
  1. Spliced introns
A
  • Splicing linked to transcription
  • Linked via RNAPII C-terminal domain
  • Splicing can affect: Initiation, Elongation and Termination
  • Initiation → U1 binds at donor site of intron → helps recruit splicing factors (TFIID and TFIIH)
    o TFIID = first protein to bind to DNA during formation of pre-initiation transcription complex of RNA Pol II
    o U1 can help promote initiation → exploited to increase expression of transgenes in cell culture/transgenic organisms
  • Elongation → U1 interacts w/RNA Pol II subunit (TAT-SF1) to increase efficiency of elongation
  • Termination → Endonucleolytic cleavage and poly(A) tail addition to mRNA
    o U2 binds at 3’ end and promotes action of CPSF protein to cause termination →CPSF can also help recruit and improve efficiency of U2 acting at specific donor sites
    o U1 inhibits action of CPSF
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q
  1. Excised introns
A
  • Once intron is excised it often undergoes debranching and degradation
  • Can have embedded RNA genes which are expressed upon intron removal
    o Often non-protein coding RNAs (ncRNAs) e.g. microRNAs (miRNAs), small nucleolar RNAs (snoRNAs)
    o miRNAs can help autoregulation → cleaved from intron of gene and feeds back on to regulate expression of parent gene
    o Intron containing miRNA sequence (mirtron) → exported from nucleus → dicer acts to cleave miRNA duplex and to unwind it → ss miRNA can be loaded to RISC complex → acts upon mRNA targets to either degrade them or inhibit their translation
    o Recent studies suggest miRNAs can be processed before splicing → ‘transcribed intron function’
  • snoRNAs (class of RNA often contained within introns)
    o 60-150 nucleotides long
    o Fundamental to RNA modifications in archae & eukaryotes
    o Can modify rRNA, snRNA, and tRNAs
    o snoRNAs are released after splicing
    o snoRNA loaded onto complex → transported to nucleolus → modify rRNA by methylation
28
Q
  1. EJC-Harbouring transcripts
A
  • Exon junction complex = proteins that bind to mRNA close to sites of where intron was
  • Formed of 4 core proteins (MAGO, YI4, eIF4AIII, MLN51)
    1. Splicing factor CWC22 recruits eIF4AIII to site fo where intron was → rest of complex assembles
  • EJC binds ~25 nts upstream of exon-exon junction on mRNA transcript
  • Complex present from splicing until translation
  • Roles:
    1. Nuclear transport
    o ECJ important for efficient export of mRNA out of nucleus
    o Mature mRNA bind to specific transport factors (e.g ALY/REF); shuttled through nuclear pore complexes; transport rates x10 faster for spliced transcripts
    o EJC can increase rate at which transcript is exported from nucleus/cytoplasm so increased levels of mRNA in cytoplasm thus protein expression
    2. Translation activation
    o EJC on mature mRNA enhances translation
    o Now known before that EJC core component MLN51 interacts with eIF3 (translation initiation factor)
    3. mRNA localization
    o Subcellular regions (targeted within cytoplasm)
    o Localisation permitted by shuttling proteins
    o oskar mRNA needs EJC for location
    4. nonsense-mediated decay (NMD)
    o A surveillance mechanism
    o Main role is to degrade mRNAs containing a premature stop codon
    o Important to prevent dominant-negative/gain of function proteins
    o Dominant -ve proteins can act antagonistically to wild type protein → mutations usually result in altered molecular function (often inactive) → e.g. may be part of receptor that can bind to ligand but do not relay any downstream information for signalling cascade
    o Normal mRNA translation, ribosome binds to transcript and begins AA chain elongation → continues until reaches EJC → displaces it → complete when reaches stop codon
    o In NMD, mRNA transcript contains premature stop-codon (possibly due to nonsense mutation) → if before EJC, mRNA decay is triggered → SURF and UPF1 recruited to ribosome; UPF2/3 recruited to EJC → re-organization of protein complex → phosphorylation of UPF1 which inhibits translation in cis and promotes interaction w/SM6 and endonuclease that cleaves mRNA which promotes mRNA deadenylation and decapping
    o EJC position acts as regulator determining whether transcript is defective or not
    o If it is >50 nts downstream of termination codon = premature
29
Q

Overlapping genes & examples

A
  • Original definition: adjacent genes, located on either DNA strand, sharing one or more nucleotides in coding sequence
    o Imprecise definition because have non-coding regions at ends of genes and non-protein coding genes
  • Originally discovered in ssDNA of phages and thought to be rare (not anymore)

Examples:
* Genes sharing the same locus on the same strand, however coding for different proteins
o Same first exon but majority of rest of gene is different;
* Genes sharing same promoter region
o Not physically overlapping
* Nested genes
o Gene is overlapping within single intron
* Embedded gene
o Exons of small overlapping gene fall within introns
* Genes on opposite strands with overlapping locus but no overlap in the exonic region
* Tail-to-tail overlap in the exonic region
* Head-to-head overlap involving 5′-UTRs and coding sequence

30
Q

Types of overlap

A
  • 2 strands of DNA in a helix so two sites of overlap
  • “Same-strand” overlapping → uni-directional
    o 3’ end of one gene overlaps with 5’ of other
    o The genes may be regulated by a common promoter
    o V. common in bacteria
  • “Different-strand” overlapping
    o Convergent = 3’ ends overlap
    o Divergent = 5’ ends overlap; could have active bidirectional promoters driving expression of both at the same time?
  • Genomic distribution of gene pairs is species specific → may depend on evolutionary constraints
  • Can have ‘Complete’/Internal’/Embedded’/ `Nested’ Overlaps
  • “Partial” or “Terminal” overlaps → Involving only small 5’ or 3’ overlap of coding sequence.
31
Q

Gene Phase

A
  • One gene is the ‘reference gene’ – base comparisons from that
  • “In phase” = overlapping genes result in same reading frames; produce same amino acid
    o Common in bacteria and viruses
    o 2 categories: Involving different initiation and different termination of translation
32
Q

“In Phase” genes

A

1) Initiation
* Alternative translation start site?
* New internal promoter formation
* Genes share terminator
* Different N-terminals
* Identical C-terminals
* Could be proteins that bind same substrate in C but catalyse different reactions in N domain
2) Termination
* Same initiator codon
* Termination at distinct codons
* e.g. CS3 Pili genes in E. coli → can form 5 polypeptides

33
Q

Example in phase overlap: Thermus flavus aspartokinase genes

A
  • Different starting places for translation
    • AskA (405aa protein); α subunit
    • AskB (161aa protein); β subunit
    • Initiation of translation dependent on Shine Delgarno sequence
    • Could be to regulate activity of askA
34
Q

“Out of Phase” genes

A
  • Overlap does not result in identical reading frames; on same or different strand
  • Phase 1 and 2 → dependent on nucleotide shifted
    o If both in Phase 0 → ‘in phase’
  • Common in prokaryotes and phages
  • Short overlaps often in phase 2 (greater occurrence of stop codons)
  • Large overlaps in phase 1 (due to genetic code probabilities)
35
Q

Example out of phase overlap: CDKN2A gene in eukaryotes

A
  • V. potent umour suppressor gene → when mutated can result in cancer
  • Identified 2 genes: Arf (p14) and Ink4A (p16)
  • Alternative first exons (1α and 1β) that are transcribed from different promoters
  • These are spliced to the same acceptor site in exon 2, which is translated in alternative frames
36
Q

Partial/Terminal overlap of genes

A
  • Small overlaps on 5’ or 3’ end
  • Common for prokaryotes with functionally dependent genes
  • Terminator site of 1 gene overlaps with initiator of another
  • Example: trytptophan operon
    o trpE-trpD- and trpB-trpA one base overlap
    o Proteins synthesised in equimolar ratios
    o Translation coupling dependent on this overlap
    o Proximity of the trpB stop codon to the trpA start influences trpA translation
37
Q

Translational recoding

A
  • Ribosomes can be directed to:
    o utilise alternative start sites
    o bypass or recode termination codons
    o Or site specific ‘Programmed Shift of Reading Frame (PSRF)’ → change reading frame when it reaches specific sequence; can make 2 different proteins from same mRNA
  • Ribosomal frameshift:
    o Happens when it pauses on mRNA and can shift backwards/forwards 1 nucleotide (or more)
    o Shifting depends on: mRNA regulatory sequence & structure; all mRNA must be unfolded → disrupts codon/anti-codon binding and lead to uncoupling
38
Q

-1 ribosomal frameshifting:

A
  • V. common in prokaryotes
  • mRNA requires:
    i) Slippery sequence
  • 7 nucleotides where shift takes place
  • X XXY YYZ (original reading frame)
  • XXX YYY Z (shifted reading frame)
  • XXX: 3 identical nucleotides, YYY: AAA/UUU, Z: not often G
    ii) Spacer sequence
  • 12 nucleotides or less
    iii) Downstream stimulatory sequence
  • Pseudoknot / kissing stem-loop structure
  • Provides energetic barrier for ribosome to overcome
  • It aids positioning over slippery site (makes ribosome pause)
  • Slippage may occur during distinct points of translation elongation cycle:
    o During accommodation of the A-site tRNA
    o During EF-G catalysed translocation
    o Just after before peptidyl transfer
39
Q

+1 programmed ribosomal frameshifting (PRF) example:

A
  • Saccharomyces cerevisiae: OAZ1
    o Mammalian equivalent: ornithine decarboxylase antizyme (OAZ)
    o Antizyme regulates activity of ornithine decarboxylase (ODC) which produces polyamines
    o OAZ stimulates ubiquitin-independent degradation of ODC
    o PRF plays key role in homeostatic mechanism that controls polyamine levels (negative feedback)
    o Polyamines can stabilise pseudoknot
    o Get +1 PRF which bypasses stop codon and allows mRNA to be translated into full length fully functional OAZ protein
    o Without polyamines have truncated product
  • HIV Virus
    o Gag produced (55kDa) → precursor protein that forms virus particle
    o Gag is translated from viral genomic DNA
    o Can have -1 PRF resulting in longer version of Gag (PolyGag ~160kDA) which includes sequence for protease, reverse transcriptase, and integrase; only happens ~5% of times
    o Researchers interested in targeting -1PRF mechanism in HIV to come up w/drugs that can bind to RNA and disrupt this mechanism; developed a drug that could stabilise stem-loop structure to have about ~50% of each Gag → due to disproportionate levels makes less efficient for virus to assemble itself
40
Q

Why have overlapping genes and PRF?

A
  • Allows for genome compression
  • PRF provides another method to increase the diversity of the proteome
    o Mechanism acting on single mRNA can make many different proteins
  • Stoichiometry: PRF and shared promoters allow proteins to be expressed at stable levels relative to each other – coordinated control
41
Q

Advantages/disadvantages of overlapping genes

A
  • Specially important for viruses
  • Useful as Capsid enforces limitations on virus:
    o Can’t package larger genome
    o Especially with icosahedral capsid
    o Can only increase size only with increased subunit number
    o Fitness cost (eg. Use more resources and takes longer to replicate)
  • Mutations = in theory, might mitigate the detrimental effects of mutation
  • Evolution = overlapping genes may be subject to evolutionary constraint
    o E.g. if one gene needs to change to adapt to circumstances, it is constrained by other gene (this other gene may be vital for function)
42
Q

Difference between prokaryotes and eukaryotes:

A
  • Eukaryotes:
    o Larger genomes
    o Contain introns so overlapping genes may be located in introns (less problem w/regards to evolutionary constraints)
    o More abundant different strand overlaps: could be due to more complex genome structure and sharing of different promoter enhancer elements
    o Avoidance of exon sharing retains flexibility of gene to adapt
    o A lower proportion of divergent different strand overlaps → maybe because 5’ region is more delicate
  • Prokaryotes:
    o Features exons primarily so exon overlapping is common
    o Unidirectional overlapping is the most common layout
    o Operons are a driving force
    o Have one promoter driving expression of 5 genes; overlap important for correct stoichiometry
    o PRF move prevalent → selected for more due to genome size restraints
42
Q

Difference between prokaryotes and eukaryotes:

A
  • Eukaryotes:
    o Larger genomes
    o Contain introns so overlapping genes may be located in introns (less problem w/regards to evolutionary constraints)
    o More abundant different strand overlaps: could be due to more complex genome structure and sharing of different promoter enhancer elements
    o Avoidance of exon sharing retains flexibility of gene to adapt
    o A lower proportion of divergent different strand overlaps → maybe because 5’ region is more delicate
  • Prokaryotes:
    o Features exons primarily so exon overlapping is common
    o Unidirectional overlapping is the most common layout
    o Operons are a driving force
    o Have one promoter driving expression of 5 genes; overlap important for correct stoichiometry
    o PRF move prevalent → selected for more due to genome size restraints
43
Q

Gene regulation by antisense transcription

A
  • Antisense transcription can impact on gene expression at 3 different stages:
    1) Transcription initiation
  • CKN2B and CDKN2A has v. long non-coding RNA expressed on antisense strand (ANRIL)
  • When ANRIL is transcribed it can bind to polycron complex; RNA can also bind to complementary DNA sequences
  • RNA acts to recruit polycron complex to chromatin resulting in histone modification (e.g. H3K27me, a repressive mark)
  • ANRIL can feedback and repress expression of tumour suppressors
  • Can also lead to DNA methylation (also repressive mark)
    2) Transcription elongation
  • Can get inhibition of expression of a gene from antisense expression of another gene → RNA Pol collision and stop so neither gene will be transcribed anymore
  • Ubiquitin directed proteolysis can remove RNA Pol so transcription can occur again
    3) Post transcription
  • Parent gene BACE1 (ß-secretase 1) → high levels in common late sporadic alzheimer’s disease
  • BACE1-AS (antisense expression) prevents BACE1 being degraded by microRNA
44
Q

Sense-antisense pairs as self-regulatory circuits

A
  • Fine tuning → antisense expression slightly modulates expression of the sense gene
  • Bistable switch → strong mutual repression (off and on states)
45
Q

What is RNA editing

A
  • Diverse mechanisms that change the sequence of the RNA transcripts encoded by genes in a wide range of organisms
  • Only in eukaryotes
  • Discovered by Rob Benne in 1986; followed up by Ken Stuart and colleagues
46
Q

Similarities between RNA editing and splicing

A
  • mRNAs, tRNAs and rRNAs can all be substrates for RNA editing and splicing
  • Alternative splicing and editing generate protein diversity
  • Splicing and editing are developmentally regulated; can change over time and be different in different cell types
47
Q

Differences between RNA editing and splicing

A
  • Splicing removes (large) RNA sequences encoded by a gene; editing only removes few bases from RNA
  • Splicing just cuts bits out; editing adds/changes the information encoded by a gene
  • Splicing is often a RNA catalysed reaction; editing is always protein catalysed
48
Q

Discovery of RNA editing

A
  • Trypanosome brucei –species of saliva trypanosome that causes African trypanosomiasis
    o Transmission between mammals by vector tsetse fly
    o Has 1 mitochondria; kinetoplast holds mitochondrial genome which has 4 million base pairs but only encodes 18 genes
    o Kinetoplast has maxicircles (~50) and minicircles (~10,000)
    o Maxi-circles encode components of the mitochondrial oxidative phosphorylation machinery
  • Found premature stop codon in cytochrome oxidase II (COXII) gene –surprising because highly conserved across genomes
    o Sequences RNA of gene after being transcribed
    o Found 4 U inserted → changed reading frame → stop codon was not read as stop codon → read through until predicted stop codon later
  • Looked at COX III gene → performed DNA sequencing → shorter than expected → sequenced RNA → more than half of the gene is comprised of inserted U
  • Now all genes in mitochondrial genome studied for RNA editing
    o 12 genes encoded edited mRNA
    o 6 genes encoded unedited mRNA
49
Q

RNA editing is an RNA repair process

A
  • Insertion/deletion of uridines can:
    o Form start codons
    o Correct frameshift mutations
    o Create complete open reading frames
    o Remove premature stop codons
    o Form appropriate stop codons
50
Q

Where does information come from to perform RNA editing?

A
  • Minicircles: encode for gRNA → instructs editing of transcripts in maxicircles after transcription
    o G-U base wobble → does not fit Watson-Crick base pairing
  • Maxicircles contain coding sequence for mitochondrial genes
    o Encodes mRNA and rRNA
  • Both cases have transcription of polycistronic RNAs → then cleaved into various genes and into individual guide RNAs (gRNA) → gRNA will bind to mRNA and edit them → translate into protein
51
Q

Anatomy of guide RNA

A
  • 5’ triphosphate –primary transcript
  • Anchor sequence –base pairs w/ pre-mRNA
  • Guiding sequence –directs insertion/deletion
  • 3’ polyU tail –post-transcriptionally added
52
Q

Mechanism of Trypanosome RNA editing

A
  • Addition of oligoU tail to gRNA
  • Annealing of gRNA and pre-edited mRNA
  • Can be endonuclease cleavage at mismatch site → uridine insertion (by TUTase; then ligation) or deletion (by exonuclease; then ligated)
53
Q

Editosome

A
  • Exact structure not known
  • ~20 proteins make up the editosome
  • Accessory proteins that bind gRNA and pre-edited mRNA
  • Core part → TUTase, exonuclease, etc
54
Q

Alternative RNA editing generates protein diversity

A
  • Can help organisms adapt to changing environments
  • Trypanosome life cycle:
    o Has changing mitochondrial activity → could be due to RNA editing
  • COX III gene can be alternatively edited → has different N-terminal because of different reading frame → makes AEP1
  • AEP1 localises to kinetoplast; involved in maintenance of kinetoplast DNA network
55
Q

RNA editing in Physarum (slimemold)

A
  • Additions of cytosines and uridines
  • Additions of GU and CU
  • C to U changes (deamination)
  • Extremely accurate
  • Occurs co-transcriptionally
56
Q

Mechanisms of RNA editing in plant mitochondria and plastids

A
  • Recent studies have identified the role of PPR proteins in site recognition
  • Some have deaminase activity → can perform recruitment and RNA editing
  • Appears to be a great diversity of “editosomes” in plants
57
Q

Apolipoprotein B in mRNA editing

A
  • Protein involved in transporting lipids through circulatory systems
  • C terminus contains low-density lipoprotein receptor domain
  • Unedited → CAA → longer (~x2 larger) → Apo-B100 expressed in liver
  • Edited → UAA (stop codon) → truncated version → Apo-B48 expressed in intestine
  • In ribosomal frameshifting, changing frame to include stop codon; here changing single base to create stop codon
58
Q

Adenosine deaminases that act on RNA (ADARs)

A
  • Most common type of editing
  • All ADARs contain 1-3 double stranded RNA binding motifs
  • ADARs found in a wide range of organisms, from yeast to mammals
  • Converts adenosines to inosines (A to I)
    o I can be interpreted as G
  • mRNAs, tRNAs, viral RNAs & non-coding RNAs can all be substrates
  • Alters specific codons to change amino acids or change/incorporate/remove stop codons
  • ADAR forms specific RNA secondary structure
59
Q

Editing of serotonin receptor in mammals

A
  • Editing can change the efficient of signalling of this receptor
    o More editing = less signalling efficiency (same as low serotonin)
  • Over- and under-editing of this mRNA are associated with the genetic disease Prader-Willi syndrome and with depression-associated suicide
  • Recent study: different levels of editing in receptor correlate w/alcohol preference in mice
59
Q

Editing of AMPA glutamate receptor in mammals

A
  • A lack of editing of GluA2 Q to R results in an excess of Ca2+ influx into neurons and causes postnatal death in mice
  • Low levels of GluA2 Q to R has been observed in patients with major depressive disorder and schizophrenia
  • ADAR 2 involved in these changes
59
Q

RNA editing and development

A
  • ADAR1 mutants are embryonic lethal
  • Phenotypes include impaired haematopoiesis and defects in liver formation
  • Embryonic stem cells have high editing levels and ADAR1 levels impact on efficiencies of cellular reprogramming
  • Exact roles and contributions to these developmental defects are not yet clear
60
Q

RNA editing and cancer

A
  • ADAR1 down regulation can lead to regression of chronic leukaemia in mice
  • ADAR2 down regulation inhibits cellular proliferation in different types of brain tumours (glioblastoma and pediatric astrocytoma)
  • ADAR silencing in breast cancer cell lines led to less cell proliferation and more apoptosis
61
Q

Key roles: RNA editing in mammals

A
  • Neurotransmitter receptor regulation
  • Embryonic development, including brain development
  • Cancer – regulation of proliferation and apoptosis
62
Q

Higher levels of RNA editing in nervous system

A
  • ADAR mutants in Drosophila and C. elegans lead to brain related phenotypes
  • 3 ADAR proteins in mammals
    o ADAR2 mutants die from seizures
    o Inactive ADAR3 is expressed exclusively in the brain
  • Genes that are recoded by RNA editing are enriched for neuronal genes
  • Alu repeats in the genome are heavily edited – Alu harbouring genes are also enriched for neuronal genes
63
Q

RNA editing as a driving force of brain evolution

A
  • Intensity of RNA editing is 35 fold higher in humans than in mice
  • Examination of 6 transcripts suggests an increase of editing from monkeys to chimps to humans
  • It is plausible that the expansion of RNA editing in humans may have led to increased diversity in the brain - driving neural evolution and higher cognition → still only a theory…
  • A lot of editing in repetitive regions (not much difference in editing between different tissues)
  • Editing in brain is v. unique
  • Higher levels of editing in arterial tissues
64
Q

RNA as a tool for acclimatisation

A
  • Cold-blooded organisms may utilise extensive RNA editing to respond to temperature changes and other environmental variables
  • Evidence: As temperature decreases ↓, editing increases ↑ at I321V in the octopus potassium channel Kv1.1, and at most Drosophila editing sites
  • An A → G change, and the sequence preferences for ADARs, often results in the replacement of a large –R group for a small one
  • George Somero et al. – hypothesised substitution of small residues at hinge regions within proteins tends to reduce activation energies to allow enzymes to work as efficiently at lower temperatures