Transcription Flashcards
Define transcription
The process by which cellular DNA directs the synthesis of a complementary strand of RNA which reflects the genetic information encoded within the DNA
Three types of RNA
Messenger RNA (mRNA) Ribosomal RNA (rRNA) Transfer RNA (tRNA)
How is transcription carried out?
The transcription of DNA to RNA is carried out by RNA polymerases
Similarities between RNA and DNA polymerase
LIKE DNA polymerase, RNA polymerase:
- Uses nucleoside 5’-triphosphates as precursors
- Catalyses phosphodiester bond formation between NTPs
- Uses DNA as template
- Base-pairing interaction determine corrects base
- 5’ → 3’ growth of new nucleic acid chain
- Growing strand is the reverse complement to template
Differences between RNA and DNA polymerase
UNLIKE DNA polymerase, RNA polymerase:
- Uses RIBOnucleoside 5’-triphosphates (ATP, GTP, CTP, UTP)
- Can initiate de novo synthesis of a new strand (no primer needed)
- A single strand of RNA is produced (only one strand of DNA is used at a time for RNA synthesis)
- Only short stretches of DNA are transcribed
Transcription in Bacteria vs Eukaryotes
- All cells contain RNA polymerase
- In bacteria, one RNA polymerase synthesises all of the cell’s RNA
(except for the short RNA primers used in DNA replication) - In eukaryotic cells, there are around five different RNA polymerases. Each synthesises a different class of RNA
- The process of transcription in bacterial cells is less complex
Structure of E. coli RNA Pol
E. coli RNA polymerase, a large 449 kD protein complex, consists of:
- A core tetramer of subunits (involved in DNA template-binding and phosphodiester bond formation)
- A sigma subunit that only associates with the core enzyme for its critical role in transcription initiation
Four Main Steps of Transcription
- Binding of RNA polymerase to the specific site where a gene starts (called the promoter)
- Initiation of transcription at the start site
- Elongation in the 5’ → 3’ direction
- Termination of transcription
Start of Transcription
- RNA synthesis is initiated at specific sites on the DNA
- The DNA strand that serves as the template is known as the antisense or non-coding strand
- The other DNA strand is known as the sense or coding strand and has the same sequences as the transcribed RNA (except for the replacement of U with T)
Step 1 - RNA Pol Binds to Promotors
- RNA polymerase binds to transcription initiation sites known as promoters
- Specific promoters are recognised by the corresponding sigma factor
- E. coli promoters are approximately 40 bp in length and are located on the 5’ side of the transcription start site
Step 2 - Promotor Recognition and Transcription Initiation
- The complete RNA polymerase holoenzyme (core tetramer plus sigma) binds to non-promoter DNA LOOSELY, allowing the enzyme to move rapidly along the DNA in search of the sigma subunit’s corresponding promoter
- At the promoter, the core enzyme binds TIGHTLY and the sigma factor dissociates from the core RNA polymerase which carries on with elongation in the 5’ → 3’ direction
- Note that different sigma factors recognise different promoters
Step 3 - Elongation: RNA Chain Growth
- As the polymerase advances, the DNA unwinds ahead of the RNA’s growing 3’ end and rewinds behind it
- In the region being transcribed, the DNA double helix is always unwound by ~ 17 bp, forming a “transcription bubble”
Step 4 - Termination of Transcription
- In many E. coli genes, the transcription termination signal is an inverted repeat of GC-rich sequence followed by 4 or more adenines
- The transcribed RNA forms a self-complementary stem-loop “hairpin” structure that is terminated by several U residues
- Weak hydrogen bonding at site of transcription results in RNA being released from DNA template and from the enzyme
Transcription in Eukaryotes
- The fundamental principles of transcription are similar in bacteria and eukaryotes
- However, eukaryotic transcription is distinguished by having multiple RNA polymerases and complicated control sequences
- The eukaryotic transcription machinery is also far more complex
Eukaryotes have several RNA Polymerases
Eukaryotic nuclei contain three distinct types of RNA polymerase:
- RNA Polymerase I
- Located in nucleoli (dark straining nuclear bodies where ribosomes are assembled)
- Synthesise most rRNA precursors - RNA Polymerase II
- Located in nucleoplasm
- Synthesises mRNA precursors - RNA Polymerase III
- Located in nucleoplasm
- Synthesises 5s rRNA precursors, tRNAs and others
Note that eukaryotic cells also contain separate mitochondrial and chloroplast RNA polymerases
Different polymerases recognise different types of promotors
- In eukaryotes the RNA polymerase complex does not include a removable sigma factor
- Instead, a number of accessory proteins (e.g. transcription factors) identify promoters and recruit RNA polymerase to the transcription site
- Eukaryotic promoters are more complex and diverse than bacterial promoters
- The three main eukaryotic RNA polymerases recognise different types of promoters
RNA Pol II
RNA Polymerase II synthesises mRNA precursors for both constitutively and non-constitutively expressed genes
- Constitutively transcribed structural genes are expressed in all tissues
- Selectively transcribed structural genes are expressed in fewer cell types
Sequence/Promotor Elements in Typical Eukaryotic Gene
TATA box (TATAAAA)
- Located approximately 25-30 bp upstream of the +1 start site
- Determined the exact start site (not in all promoters)
- Binds the TATA binding protein (TBP) which is a subunit of TFIID
GC box (CCGCCC) - Binds Sp1 (specificity factor 1)
CAAT box (GGCCAATCT) - Binds CTF (CAAT box transcription factor)
Other Regulatory Elements
- Other regulatory elements of RNA polymerase II act to determine the frequency or efficiency of transcription
- Can be located many bp UPSTREAM or DOWNSTREAM of start site or even within genes
- Such transcription elements include:
- Enhancers (increase transcription rate)
- Silencers (decrease transcription rate)
- Response Elements (target sequences for signalling molecules)
- Regulatory proteins (activators or repressors) may bind to these elements
Post-Transcriptional RNA Processing
- Eukaryotes lack precise transcription termination sites
- This is largely due to the fact that most primary transcripts undergo various post-transcriptional modification processes in order to acquire biological activity
- The three major classes of RNA are altered in different ways:
- mRNA undergoes 5’ capping, 3’ polyadenylation and splicing events
- rRNA undergoes endonucleolytic cleaving, methylation and splicing
- tRNA undergoes nucleotide removal, addition and modification
mRNA Processing
- 5’ capping
- 3’ Polyadenylation
- Splicing
mRNA Processing - 5’ Capping
- Eukaryotic mRNAs have a cap structure consisting of 7-methylguanosine residue joined to the transcripts initial 5’ nucleotide via a 5’ - 5’ triphosphate bridge
- Capping involves several enzymatic reactions
- The cap is added to the growing transcript when it is ~ 30 nucleotides long
- The cap identifies the eukaryotic translation start site
- The cap also protects the polynucleotide from degradation by 5’ exonucleases
mRNA Processing - 3’ Polyadenylation
- Mature eukaryotic mRNAs have well defined 3’ ends terminating in poly(A) tails of ~250 nucleotides
- The poly(A) tail is generated from ATP through the action of poly(A) polymerase
- Multiple copies of a poly(A) binding protein associate with the tail and help protect the 3’ end of the mRNA transcript from 3’ exonucleases
- The poly(A) tail may also serves as a handle for the proteins that deliver mRNA to ribosomes
mRNA Processing - Splicing
- A major difference between the structural genes of eukaryotes and bacteria is that the coding sequences of most eukaryotic genes are interspersed with unexpressed regions
- The primary transcripts are also called pre-mRNAs or heterogeneous nuclear RNAs (hnRNAs)
- Pre-mRNAs are processed by the excision of non-expressed intervening sequences (introns), following which the flanking expressed sequences (exons are spliced (joined together))