RNA processing (13/14) Flashcards
What does the term RNA processing cover?
5’ end capping and 3’ end polyadenylation
Splicing
Editing
Surveillance (quality control)
Where does most of the processing happen?
Inside the nucleus where mRNA is present as pre-mRNA. This processing is specific to eukaryotes.
Heterogenous nuclear RNA (hnRNA)
pre-mRNA: RNA that compromises transcripts of nuclear genes made by RNA pol II. Only exists in eukaryotes.
Pre-mRNA processing (short summary)
The RNA is modified in the nucleus by additions to the 5’ and 3’ ends and by splicing to remove the introns. This is essential for the stability, export and translation of the mRNA strand.
5’ end capping (when it happens, simplified process, functions)
RNA capping generally occurs before further processing events. It happens as soon as the first 25-30 nucleotides are incorporated into the nascent transcript.
The process is catalyzed by various enzymes, such as RNA triphosphatase, gunylyltransferase and methyltransferase. Most mRNAs are modified by addition of 7-methylguanylate cap (m7G), which attaches to the phosphate bridge in an unusual 5’-5’ linkage involving 3 phosphates.
Functions:
- A capped mRNA is more resistant to 5’-3’- exonucleases, hence the de-capping stimulates mRNA degradation (decay).
- The cap is recognized by various protein factors (like the cap-binding complex CBC for export, or the translation initiation factor eIF4E) to facilitate: Splicing, export and translation (in the cytoplasm)
CBC
Cap-binding complex
3’ end polyadenylation (definition, where it happens, function, simplified process)
The addition of a poly-A tail (typically around 100-200 nucleotides) to a pre-mRNA. Happens both in the nucleus and in the cytoplasm.
It influences virtually all aspects of mRNA metabolism, conferring mRNA stability (protection from 3’-5’- exonucleases), promoting export and translation.
The process of polyadenylation is directed by a poly(A)-signal, on which the mRNA strand is cleaved. The cleavage is coupled to the addition of about 200 adenosines (As) to the 5’ cleavage product.
The polyA signal (in mammals)
Consists of the hexanucleotide AAUAAA and the G-U-rich elements. The cleavage occurs between these two elements.
Step 1 of 3’ end polyadenylation: RNA cleavage (proteins and auxilary cleavage factors involved)
The polyA signal functions as a landing platforms for different proteins.
- CPSF binds to AAUAAA
- CstF binds to the U-rich element
- CFI and CFII function as auxilary cleavage factors
CPSF
Cleavage and Polyadenylation Specificity Factor. Binds AAUAAA.
CstF
Cleavage Stimulation Factor. Binds the U-rich element.
CFI & CFII
Auxilary cleavage factors
Step 2 of 3’ end polyadenylation: Addition of the poly(A) tail
Once the cleaving occurs, the polyA tail is added to the mRNA. This addition is dependent on PAPs, which can be both canonical and non-canonical (in mitochondria).
As the poly-A tail grows, it associates with PABII for protection. Nuclear or cytoplasmic poly-A binding proteins depending on whether it is nuclear or cytoplasmic polyadenylation.
PAP
Template-independent poly(A) polymerase.
Canonical and non-canonical, e.g. in the mitochondria.
PABII
Nuclear poly(A) binding protein. Increases the PAP affinity to RNA, allowing efficient polyadenylation, and protects the RNA.
Alternative cleavage and polyadenylation (APA) (defintion, frequency)
APA occurs when a single gene has multiple polyA sites, and can result in multiple RNA transcripts and/or protein isoforms.
APA is widespread -> More than 70% of all human genes are subjected to APA.
Simple example of APA
There are 2 potential cleavage polyadenylation sites (2 possible cleavage events) in pre-mRNA. Which one is used depends on the associated proteins (like poly-A binding proteins) and additional proteins associated with the 3’ UTR.
Downstream is usually a strong poly-adenylation signal used to induce cleavage and addition of a polyA tail. This will result in a long 3’ UTR that has more potential for regulation.
Another upstream site will produce a very short mRNA when used. This will be too short to contain anything to regulate (no landing site). Various binding proteins (RBPs) or small RNAs like miRNAs will influence translation and stability of the mRNA (either repress or promote translation).
Various types of APA (general explanation, 3 scenarios, in-depth)
APA can produce a great variety of different mRNAs, but also at protein level depending on where the polyadenylation site is. If the alternative sites are in noncoding region, the protein doesn’t change. However, if they are within the coding region, then you also produce protein variants.
Contitutive polyadenylation: Single protein product. The 3’UTR will produce various mRNA variants depending on which of the cleavage sites the mRNA chooses. The protein, however, will not change.
UTR-APA: Single protein isoform, but protein output may vary between mRNA isoforms. Alternative cleaving and polyadenylation sites can interact with introns and exons. Introns can be maintained (instead of spliced) if there is a polyadenylation site within it. A poly-A tail added to this noncoding region makes it actually coding, which will produce a variant which is different in both the 3’UTR and also the protein sequence (with others than the usual exons).
CR-APA: Similar as above happens in alternative exons. Exons can be spliced in several different ways, which may produce a protein with extra bits and alternative 3’UTRs.
Pre-mRNA splicing (eukaryote/prokaryotes)
Excising introns and connecting the exons into a continous (pre)mRNA -> The protein-coding regions of pre-mRNA are joined together.
Mostly in eukaryotes, rare in prokaryotes. Here the genetic information is straight-forward and relates directly to the information translated. It becomes more complicated with splicing in for example mammals (gibberish has to be made into the actual code). Function of splicing is then to make sense of the code.
Relating number of introns with an organism’s complexity
There is a large variation within eukaryotes in number and distribution of introns. Complex organisms have intron-rich genes. Budding yeast has essentially no introns, whereas humans have 8 introns per gene.
Certain genes are very long but mostly consist of introns, e.g. the gene dystrophin with 98% introns.
Example of a spliceosomal RNA: U1 snRNA (Sm proteins)
All snRNPs, except U6, contain a conserved sequence that binds the Sm (Smith antigen) proteins.
Sm proteins are recognized by own antibodies (anti-SM) generated in an autoimmune disease (lupus erythematosus, SLE). These proteins are interacting with the Sm binding site in snRNAs and are involved in splicing.
RNA-RNA interactions that are key in splicing (+ what happens at the branch point?)
There are many interactions between the sequences in the splice sites that are mediated by the snRNPs.
snRNA engage in the interaction at the 5’ splice site by guiding the snRNPs’ interaction with the branch site. There are also some interactions between the snRNAs which are important for stability.
The branch site on the mRNA can be bound by either branchpoint binding proteins (BBPs) and later U2 snRNAs. This leads to the flipping out of the A which can engage in the nucleophilic attack.
The process of splicing (goal, sequence motifs, 2 transesterification steps)
Goal: To produce spliced mRNA which can be translated by discarding introns.
Splicing is mediated by special sequence motifs. An intron is framed in by a 5’ exon and a 3’ exon, with a 5’ splice site and 3’ splice site spanning the intron. Some of the residues in these motifs are conserved. Within the intron there is also a branch site A.
Splicing proceeds via 2 transesterification steps.
- Variant A in the splice site performs a nucleophilic attack on the phosphoryl group in the 5’ splice site. As a consequence, the intronic sequence is conformed into a conformation called a lariat (cowboy’s loop), which leaves a free 3’ hydroxyl at the end of the 5’ exon.
- The free 3’OH performs the second nucleophilic attack on the phosphate group in the 3’ splice site. The result is a lariat (spliced intron) and the joining of the two exons together. 2 phosphodiester bonds are broken, while 2 new are formed (doesn’t require energy).
Splicing is very energetically consuming, but it mostly comes from the spliceosome.
Spliceosome
The splicing workhorse. The spliceosome is a huge complex (ca. 12 MDa) containing many proteins and also some RNA: 5 x snRNAs (small nuclear). They also associate with particular proteins called snRNPs (“snurps”). In addition, there are some other core splicing factors and some auxilary proteins.
Most important are the snRNPs.
Biogenesis of snRNPs (snurps)
snRNAs are exported into the cytoplasm with the help of export factors. In the cytoplasm, they interact with various factors including Sm proteins which form (stabilizing) rings around the Sm sequence (Sm core assembly). With the help of other proteins, they go back into the nucleus and are processed in the cajal bodies, before they end up in the nuclear speckles.