Dr. Robert Weinzierl (TF Focus) Flashcards
Definition of a core promoter?
A core promoter is defined as the minimal DNA region that is sufficient to direct low levels of activator‑independent (basal) transcription by RNAP II in vitro
Core promoters typically extends approximately 40 bp up‑ and down‑stream of the start site of transcription (+1) and can contain several distinct core promoter sequence elements
Are core promoters very diverse in Higher eukaryotes?
Core promoters in higher eukaryotes are highly diverse in structure and each core promoter sequence element is only found in a subset of genes
What does an ideal TATA-box promoter sequence look like?
TATA-box is a core promoter element that is typically located 25-30 bases upstream from TSS and is defined by a A/T rich consensus that is flanked by G/C rich sequences (emphasising the position of the TATA box)
Example shown is derived from adenovirus major late promoter –> One of the best examples of highly active TATA-boxes
What is one of the main problems around using a TATA box element as a core promoter element in higher eukaryotes?
Eukaryotic genomes are huge in comparison with bacterial genomes
Hence, given that a TATA motif can arise by chance every 4 Kb, it is too short/weak to confer specificity
Thus, normally TATA boxes are accompanied by other core promoter cis-regulatory elements + TATA boxes are only found in a small proportion of core promoters
What protein complex recognises the TATA box motif?
The TATA-Box is specifically recognised by the TFIID protein complex.
TFIID contains a TATA-binding protein (TBP) that recognises the DNA motif
How large is TFIID and how does it recognises the TATA box?
TFIID is a large multiprotein complex (~10-12 different subunits; species-dependent)
Only one of these subunits is responsible for recognizing the TATA box: the ‘TATA-Binding Protein’ (‘TBP’)
Hence, TBP can specifically bind to TATA-boxes on its own –> none of there other proteins are involved in TATA recognition and binding
Outline the structure of TBP and what properties allow it to bind to DNA.
TBP is a crescent/U-shaped protein that is able to wrap around the DNA double helix
TBP contains +ive charged Lys and Arg residues which interact with the -ive charged backbone + conserved phenylalanine residues are placed in the DNA minor groove - allowing it to make hydrophobic contacts (reasonably strong binding)
Hydrophobic interactions cause the DNA to kink by 80o
Is TBPs saddle structure highly conserved across species?
Sequencing of TBP-encoding genes from many organisms has shown that the ‘saddle’ structure is highly conserved
This saddle structure is present in TBPs from all species
On the DNA level → in the saddle region we see a drop in % conservation in distant species whereas the N-terminal region tends to be be more divergent.
In higher eukaryotes there is a substantial N-terminal extension present that may contain poly-glutamine (poly”Q”) stretches
Why do bacteria promoters tend to be A/T rich?
In bacterial promoters the –10 element is also AT-rich, but this element is not functionally equivalent to eukaryotic TATA boxes
The bacterial –10 region facilitates the localized DNA strand-separation (‘promoter melting’; ‘transcription bubble formation’) to allow RNAP to initiate transcription. The eukaryotic TATA-box is located much further away from the transcription start site (-25 to –30!) and this part of the promoter is never part of the transcription bubble!)
What is the functional importance of the TATA (A/T rich region) box in the promoter region?
In eukaryotic TATA boxes the deformability of the minor grove width in regions of A/T base-pairs is important
This increase deformability allows for more intimate contacts between the TBP saddle and minor groove → lack of protruding -NH2 in G bases allows for this
What does physical data on DNA bending at TATA boxes reveal to us?
TATA region already has a tendency to bend
Oligo DNA strand with TATA region will bend on its own without the presence of TBP → upon addition of TBP the bending angle increases
Recap - What are the two main factors that are important for TBP binding?
TATA-boxes work on two levels:
- Sequence-specific contacts between TBP and the TATA-box consensus sequence
- TATA-boxes on their own influence the higher-order structure of DNA by bending it slightly. This localised bend attracts the binding of TBP to the TATA-box (matches conformation).
Note that pre-bending can be achieved with a variety of A/T rich sequences, thus explaining (at least to a certain extent) the high degree of sequence heterogeneity of eukaryotic TATA-boxes
What important fact should you always keep in the back of your mind when thinking about TFIID/TBP binding to a TATA sequence?
Always important to consider DNA accessibility which depends on the chromatin environment
Most TATA-like sequences are ‘hidden’ by chromatin structures and are thus never accessible to TBP
After TFIID binding to the TATA-box, what other two players bind to the protein complex?
TFIIB and A bind either side of TFIID → stabilising the complex (synergistic effect)
What does X-ray crystallographic data reveal about TFIIB binding to the TFIID complex/DNA?
TFIIB is a protein made from a single polypeptide that folds to form two similar domains (Note - TBP is a dimeric protein fused together)
TFIIB contacts DNA either side of the TATA box → conveys additional specificity as it recognises GC elements upstream and downstream of the TATA box called BRE – B recognition elements upstream
Evidence → structural and biochemical data
How do TFIIA and TFIIB interact with TBP?
How is TBP activity regulated within the cell?
TBP negatively auto regulates it’s accessibility to promoters through homodimerization, meaning…
- TBP monomers can dimers together forming a homodimer that does not bind to DNA in turn regulating activity levels → dimerisation defective yeast mutants show high levels of activator independent gene exrpession
- Furthermore, TBP in it’s monomeric state is more vulnerable to degradation
Hence, dimerisation prevents excessive levels of TBP binding/activity and also protects it from degradation in the cell (safe storage)
What is an example of a TBP antagonist?
Some transcription factors prevent formation of a productive transcription initiation complex
NC2 (Dr1-DRAP1), a negative cofactor, inhibits transcription initiation via direct interactions with the TBP-DNA binary complex
Function → prevents pervasive transcription (stops random transcription of genes) → streamlining the transcription machinery
What is the structure of NC2 and how does it prevent the transcriptional activity at the ?
NC2 is composed of small two subunits (alpha and beta) that are conserved among eukaryotes
NC2 bindings to TBP to form a stable complex → NC2 blocks binding and assembly of TFIIB (required for RNAP binding) in turn blocking RNAP binding → preventing initiation complex formation
What does the textbook description of RNAPII transcription initiation via a TATA motif?
Do most promoters have TATA boxes?
Bioinformatic analysis of metazoan genomes suggests that the prevalence of the TATA box has been overestimated in the past, and that the majority (>75%!) of core promoters do not contain TATA boxes
Many of the TATA-motifs even deviate form the consensus
What are examples of other promoter elements can be present in the promoter region?
>80% of RNAPII-transcribed genes lack TATA-boxes
There are further consensus motifs that may be present to various extents in different promoters:
Initiator Element (‘Inr’)
Downstream Promoter Element (‘DPE’)
Motif 10 Element (‘MTE’)
TCT motif (polypyrimidine initiator; in ribosomal protein gene promoters)
These other promoter elements are even less clear than the TATA box
What does promoter positional information reveal about TATA, Inr, DPE and Dref elements?
TATA and Inr → conserved dense peak at a specific position → conserved location
DPE and Dref (drosophila) → broader peak
Summary picture of the proximal promoter elements?
MTE – minus ten element
Review - Kadonaga, J.T. (2012). Perspectives on the RNA polymerase II core promoter. Wiley Interdiscip. Rev. Dev. Biol. 1, 40-51.
What roles do other proteins in the TFIID complex play?
TBP is responsible for the sequence-specific binding to TATA-boxes
Some of the other subunits of the TFIID complex, ‘TBP-Associated-Factors’ (‘TAFs’) recognise the additional sequence elements TAF2, TAF6, TAF9
TAFs making sequence specific contacts with MTE and DPE
Inr element binding is not fully understood
What are the two main types of transcription initiation patterns?
The terms “focused” and “dispersed” refer to opposite ends of a continuum of transcriptional patterns
Focused initiation, transcription starts from a single nucleotide or within a cluster of several nucleotides → associated with regulated genes (on/off)
Dispersed initiation, there are several weak transcription start sites over a broad region of about 50 to 100 nucleotides → housekeeping genes
Picture of the two different models of transcription initiation patterns?
How does chromatin structure influence the transcriptional state (stable & labile)
Nucleosome positioning/density influences the type of transcription
Labile → focused transcription → all the core elements are recognized
Stable → dispersed → requires gene-specific transcription factors to drive transcription
How does TBP poly-glutamine expansion change over evolution?
TBP Polyglutamine expansion takes place as move through the vertebrate evolutionary lineage → more complex organisms = longer Polyglutamine repeat tail
Evolution progresses → Polyglutamine expansion
What types of diseases are associated with the TBP polyglutamine tail?
What is cerebellar ataxia?
What does DNA analysis of the TBP polyglutamine tail reveal about the healthy and pathological forms?
CAG - codes for glutamine
Most of us have around 35-36 glutamine residues at the TBP N-terminus
Some people have a pathological amount → corresponding to 47-55 residues
The number of CAG repeats is inversely correlated with the age of onset for cerebella ataxia → Meaning higher repeat number corresponds to a younger age of onset
Under the microscope, what does cerebella ataxia pathology look like?
Main message - TBP aggregates accumulating in cells.
- TBP containing an expanded polyQ stretch is expressed at the same level as its normal counterpart and forms neuronal intra-nuclear inclusions containing other proteins involved in protein folding or degradation
- polyQ expansion reduces in vitro binding of TBP to DNA
- polyQ expansion causes abnormal interaction of TBP with the general transcription factor TFIIB
Note - SCA17 is another gene that is responsible for driving disease progression
What do mouse models reveal about polyglutamate expansion and cerebella ataxia pathology?
All WT mice survived
But mice with glutamine expansion had significantly reduced lifespan → correlated with the level of polyglutamine expansion
How does TBP binding differ between a healthy and pathological state (cerebella ataxia)?
What link was found between TBP and depression?
CAG repeat sizes in the TBP gene were investigated in two well-characterized Dutch cohorts → including 2165 depressed and 1058 non-depressed individuals aged 18–93 years
A CAG repeat length exceeding the median in both alleles was associated with an increased risk for lifetime depression → showing us that repeat polymorphisms act as complex genetic modifiers of depression
What are Super-core promoters?
Using knowledge on pre-existing promoters we are able to construct artificial ‘Super-core promoters’ that drive high levels of gene expression (higher than strongest natural promoters i.e. viral promoters)
Mediated by introduction of expression vectors → useful for a wide range of scientific and biotechnological applications
What does the in-vitro assay using the detergent Sarcosyl reveal about super-core promoters?
In vitro transcription reactions can be limited to a single round of transcription by adding the detergent Sarcosyl → Sarcosyl prevents further initiation but does not interfere with transcript elongation (1 round of initiation - no re-intiation)
The super core promoter is ‘stronger’ than the two other promoters because it supports a higher frequency of transcript initiation → ~40% of available DNA templates are used in the in vitro assay (this is very efficient!)
Likewise, Footprinting studies show that the super core promoter bind TFIID with higher affinity than other promoters
What does the construction of the super-core promoter reveal about the nature of promoter regions?
Shows us that all of the core promoter motifs contribute to the strong binding of TFIID to the super core promoter → meaning higher specificity of each TF to the promoter elements contributes to an overall stronger interaction (additive)
Example → Super core promoter (SCP1) compared to viral promoters (CMV and AdML) in a Luciferase assay
Do the number of RNAP’s differ between species? Do mitochondria and chloroplasts have their own RNAP?
Eukaryotes
RNAP1 → exclusively transcribe ribosomal RNA
RNAP2 → very diverse → mRNA product
RNAP 3 → more diverse than 1 but still not as diverse as RNAP1
Archaea - RNAP similar to RNAP 2
Bacterial - RNAP – most distinctive → divergent evolution
What does the following image highlight about the evolution of RNAPs?
When examining the different RNAP structures we can start to understand how closely related they are based on structural similarities
Takeaways…
LUCA – Last universal common ancestor + Color categorizes similar subunits
Bacteria RNAP branched off earlier → evolved separately relative to Archaea and Eukaryotes
Even though bacteria subunits evolved separately there are still several homologs (as shown by color)
Characteristics of Eukaryotic RNAPII?
Outline the 3D structure of yeast RNAPII?
What are the two main conformations for RNAPII?
RNAPII has a ‘open’ and ‘closed’ clamp conformation.
The conformation is found prior to DNA binding whereas the closed conformation correlates to the DNA bound state.
X-ray crystallographic data suggests that there is a highly localised hinge mechanism required for RNAPII activity.
What does the electrostatic potential of the RNAPII surface show us about it’s interaction with DNA?
Inside of RNAP is positively charged whereas outside is negatively charged → favours the bound state of RNAP to the negatively charged DNA
Outline how RNAPII catalyses the synthesis of a novel mRNA molecule?
- Downstream DNA enters RNAPII
- Inside the DNA molecule is unwound forming ssDNA, exposing the template strand for RNA synthesis
- Nucleotide triphosphate enter into RNAPII via the pore and binds to the complementary base present on the DNA template
- Mg2+ ion catalyses nucleophilic addition of the nucleotide to the growing RNA strand → results int the formation of a short (10B.p) DNA-RNA hybrid
- As RNAPII moves along, the nascent RNA strand dissociates and exists the protein complex + the template and non-coding strand bind back together
What happens to the angle of the DNA strand as it moves through the RNAPII complex?
DNA that goes in and out is bent by 90 degrees → Strand separation allows the bending to be energetically favorable (difficult if it were DS)
What is a problem with the conventional NTP funnel that feeds the nucleotides into the RNAP active site?
Problem with the classical funnel structure that leads up to the catalytic site → Pore is very narrow (1 nucleotide wide) → if the wrong nucleotide diffuses into the active site (¼ chance of being correct) it would have to diffuse back out before another nucleotide can move in → inefficient
What is the alternative model for NTP entry into the RNAPII active site?
Additional channel → wider & exposes 3 + 4 nucleotides at a time to incoming NTPs
What structure in RNAPII is responsible for the translocation of RNAPII across DNA?
Remember translocation is an active process that needs to be directed
Who’s responsible → Long alpha-helix - bridge helix → pushes RNA-DNA hybrid through the protein complex
Outline the bridge helix trigger loop translocation mechanism for RNAPII.
Bridge helix (green) bends (kinks), which pushes the DNA-RNA hybrid out of the catalytic site → subsequently, it returns to relaxed conformation
Trigger loop (blue also involved)
Further reading?
Are large quantities of abortive transcripts produced during transcription intiation?
The process of RNAPII breaking free from the initiation complex appears to be a difficult process → evident by the high levels of abortive (short) transcripts that are produced
1-5% of initiated transcript will end up as full length RNA but once RNA breaks free from the initiation complex the % of successful transcripts increases
What has FRET analysis revealed regarding the cause of the large number of abortive transcripts?
Initiation → DNA comes together as shown by FRET distance decrease (scrunching) which leads to the creation of DNA Loops. This creates tension in RNAP which results in two outcome which are that RNAP continues or aborts
Outline the domains in TFIIB and their importance in transcription initiation?
- The N-terminal Zn-ribbon domain of TFIIB interacts with the RNAPII (via the ‘dock domain’ on RNAPII)
- Helps to open up the DNA at the start site using the B-linker (N-terminal region of TFIIB enters the catalytic cleft of RNAP) → The DNA template strand slips into the cleft and is scanned for the transcription start site with the help of the ‘B-reader’ that approaches the active site (important for start site recognition and positioning in active site)
- Synthesis of the RNA chain and rewinding of upstream DNA displace the B-reader and B-linker, respectively, to trigger TFIIB release and elongation complex formation
Do we understand how the transition from the intuition to elongation complex takes place?
What id the CTD tail in RNAPII and why is it important?
CTD grows with evolution → longest stretch in humans
2x Proline in consensus – high density → bad news for secondary structures → meaning that it doesn’t form regular structure (flexible tail)
Function → Y, S and T contain free -OH groups → targets for Post-translational modifications –> phosphorylation, ubiquitination → provides an opportunity for additional regulation i.e. phosphorylation by TFIIH kinase
Is the CTD tail present in all RNAPs?
The CTD is only present in eukaryotic RNAPII, not in RNAPI, III, IV and V!
Prokaryotic (bacterial, archaeal) RNAPs also lack an equivalent structure
What phosphorylation states of the CTD tail are associated with initiation and elongation? What molecular processes is the phosphorylation of CTD associated with?
Degree of phosphorylation changes during the process of transcription → hypo to hyper
TFIIH is a versatile factor – helicase activity for DNA melting, kinase activity for phosphrylation of CTD)
How does DNA melting during transcription initiation take place?
Outline the structure of TFIIH.
Very large complex (multiple polypeptides) → uses ATP hydrolysis to unwind DNA start site
The TFIIH core forms a crescent-shaped complex spanning from Ssl2 to Rad3
Ssl2 binds downstream DNA consistent with its role in DNA opening → Ssl2 uses ATP hydrolysis to translocate on DNA. If the Ssl2 location is fixed, Ssl2 action results in a reeling of DNA into the active centre
What does a ribbon structure of the initiation complex look like?
Huge molecular machinery → provides a decent explanation to why there is a high degree of abortive transcripts → a lot going on
What other things are happening alongside transcription elongation that play an important role in mRNA formation?
RNAPII transcription and pre-mRNA processing are coordinated events (downstream processes are equally as complicated)
5’ Capping, splicing and 3’ poly-adenylation are occurring during transcription and RNAPII (and the CTD particularly) plays a role in the regulation of these events
Why is it important to understand the various experimental techniques for transcriptional assays?
To obtain insights into gene expression mechanisms a number of different technical approaches need to be applied → Biochemistry, molecular biology, structural methods, cell biological methods etc.
Every method has inherent limitations and artefacts, so a coherent picture can only be built up by looking for consistencies emerging from a number of techniques
What are eukaryotic In-vitro transcription assays?
This is when purified DNA templates are transcribed in the test tube with nuclear extracts and/or highly purified transcription factors
The ultimate goal is the re-construct regulated gene expression from recombinant (i.e.completely defined) transcription factors
But! This goal has up to now not been achieved with any eukaryotic system → For example, human RNAPII and TFIIH cannot currently be assembled from recombinant subunits in active form!
Hence, they need to be purified from eukaryotic cell line instead
What does the overall strategy for in-vitro transcription assays look like?
Basically you are trying to get all the components required for transcription into a test tube so that you can create an experimental system whereby you can manipulate variables (i.e. presence of specific TFs).
Hence, it would be ideal if you all the required proteins using a recombinant system → not that this had not been achieved with RNAPII and TFIIH (large complexes).
How were the different core TFs in the basal initiation complex named?
Nomenclature for names come come from the original experiments
Ion-exchange chromatography and increasing salt conc was used to remove different subunits from the phosphocellulose column → systematic names given based on the time they exited the column
Things to consider when performing in-vitro transcription assays?
What are the two most common ways of quantifying the level of transcripts produced from in-vitro transcription assays?
Requirement for techniques that allow quantitation of RNA originating from a specific region of the DNA template
Two major methods are used for quantitating in vitro transcripts: primer extension assay & ‘G-less Casette’ assay
What is the primer extension assay?
A radiolabelled DNA oligonucleotide primer is hybridized to the RNA and extended by reverse transcriptase
This will result in the appearance of a single-stranded, radiolabelled cDNA product of a specific size
Gel electrophoresis allows quantitation + allows identification of transcription start site (Reverse transcriptase will polymerise in the 5’ direction to the start of the DNA strand)
What is the G-less Cassette assay?
Theory
G-less cassette → template strand has all C removed (mutagenesis) → RNA will not have any G nucleotides
Hence, when we introduce RNase T1 which requires a G residue to cleave → all other RNAs that initiated at other sites will be removed (removing background transcription) → giving us an accurate idea of the desired transcript which can be quantified after running on a gel
What are the dis- & ad-vantages of using a in-vitro assays?
How are in-vivo transcription assays performed?
In-vivo transcription assays → carried out inside living cells (cell line, organism, organdie, etc.)
Basics → Transfer DNA of interest into cell → see whether it is transcribed by coupling our gene of interest to a reporter gene to tell us that the DNA is being transcribed (GFP, luciferase, etc. )
Luciferase → sensitive + good for quantification
Can be transient or stable assays
What is the difference between in-vivo transient and stable assays?
Transient vs stable assay
Transient → introduction of plasmid à resulting in rapid expression – quick
Stable → transfection with stable integration into the genome
Advantages and disadvantages of transient in-vivo transcription assays?
What is an example of a commercial plasmid used for transient transfection?
pGL3-Control Vector
How can these plasmid constructs be used for in in-vivo transcription assays?
Both cases we are examining the ability of a given inserted DNA sequence in promoting/enhancing levels of luciferase expression
Looking for promoter sequence (Left) → Insert DNA into restriction site → if the inserted contains a DNA promoter then elevated levels of Luc are expressed which can be visualised/quantified
Looking for enhancer sequence (Right) → No enhancer → we have a low basal level of expression but we can introduce our DNA and if expression increases, we know that we have an enhancer element
What do we need to perform stable transfection assays?
Early stages identical to transient transfection assay (transfection using plasmid), but plasmid must contain a dominant selectable marker → creates selective pressure for integration in the genome
Example: Drug resistance gene (G418 drug resistance)
Result - DNA becomes stably integrated into genome and is expressed in chromosomal environment
Note → process of stable integration takes much longer than transient transfection assay
Advantages and disadvantages of in-vivo transcription assays?
How could one use performing identification of control sequences in in-vivo systems?
Basically, incorporate gene (with reporter) into in-vivo system and examine the gene is expressed
If proper expression/regulation is obtained → methodically delete regions of the sequence and repeat
How could one use performing identification of control sequences in in-vivo systems?
Basically, incorporate gene (with reporter) into in-vivo system and examine the gene is expressed
If proper expression/regulation is obtained → methodically delete regions of the sequence and repeat
What experimental method is used to investigate the human transcriptome?
Transcriptome → collection of all the transcripts in a cell → use RNAseq
Counting frequency → indicates abundance in samples
RNAseq → can be used to detect transcript isoforms
Why is analysing the transcriptome of cancer useful? What problem are we faced with when trying to do so?
Compare cancers to normal tissue → provides insight to transcript dysregulation (over/under-expression of specific proteins)
Problem → cancer cells are not homogenous cell populations - meaning that healthy and cancerous tissues are mixed together
Consequence → When performing RNAseq we end up with a mixture of transcriptomes
Ideally we need to isolate the cancer tissue → Laser capture microdisection
What is Laser capture microdissection (LCM)?
Laser capture microdissection → dissect out cells based on specific characteristics – i.e. morphology
Using IR to dissect out the cancerous cells
What needs to be performed after laser capture microdissection when analysing the transcriptome of the selected cell?
How is mRNA amplification carried out?
What are the different technologies for performing transcriptomics and their relative throughput & capacity?
Number of Samples - Capacity
Number of genes queried - throughput
RNAseq → has a high capacity & throughput
What is qPCR with Taqman?
TaqMan is a quantitative PCR technique (qPCR) → It can be used to quantitate transcripts, if coupled with RT-PCR
- In the intact TaqMan probe, energy is transferred (via FRET) from the short-wavelength fluorophore on one end (green circle) to the long-wavelength fluorophore on the other end (red circle), quenching the short-wavelength fluorescence. After hybridization, the probe is susceptible to degradation by the endonuclease activity of a processing Taq polymerase.
- Upon degradation, FRET is interrupted, increasing the fluorescence from the short-wavelength fluorophore and decreasing the fluorescence from the long-wavelength fluorophore
- The rate at which fluorescence increases is directly correlated to the initial transcript level
Does transcription elongation impact levels of gene expression?
Initially, initiation was considered the driver behind differential gene expression but it turns out that transcript elongation has also an influence (pausing/arrest site)
What are the different ‘decisions’ that can be taken by RNAPII during transcription elongation?
Outline the role of the bridge helix in RNAPII translocation.
Nano-mechanical process – using bridge helix to push DNA-RNA hybrid along
What are the two main models that are used to explain RNAPs translocation during transcription elongation?
Powerstroke → rNTP hydrolysis drives movement
Brownian Rachet → kinetic energy from aq environment provides the energy for the kinking of the bridge helix (random kinetic input) → more passive
Explain both the powerstroke model and the brownian ratchet model using the attached diagram.
Possible to have a mixture of both → Not mutually exclusive
In the Brownian model RNAP is initially quite mobile until NTP moves in favoring forward movement → then allows for condensation + pyrophosphate release
Why do elongation rates differ between in-vivo and in-vitro systems?
Using the example of c-myc explain how elongation in different stages of the cell cycle influence it’s expression?
Basically…
- C-myc (oncogene) regulates 100’s of genes for cell proliferation
- Cell needs to regulate intracellular myc expression
- How? Make it hard to transcribe the gene - excessive levels can lead to cancer formation
What does the attached image show us (relation to C-myc)?
Pausing and arrest sequences in c-myc
P (pause) → does not need Efs to continue
P/A → arrest site → requires elongation factors to continue
A/T → Arrest + possible termination
Past the first 500 nucleotides the DNA transcript lacks the P and A sites → large initial hurdle
What basal factors are able to stimulate transcription elongation?
What do the large number of stalling events near the initiation site tell us?
In drosophila → stalled sites at the +25-50 site quickly after initiation sites → prevents other RNAPs latching onto DNA and initiating transcription (blockage) → mechanism to downregulate expression
What do large scale studies in Drosophila reveal about RNA polymerase stalling?
What is required to convert stalled RNAPs into actively elongating enzymes?
What two main roles do elongation factors perform to keep the transcription party going?
What is transcriptional arrest?
What does transcriptional arrest look like diagrammatically?
RNAP takes a couple steps back once arrested (slides back) → mechanism not fully understood
Consequence? → Some of the nascent RNA hangs out of the catalytic site (into the cone shaped channel) due to the backwards movement → the 3’ end moves away from the catalytic site (not in close contact with Mg2+) → impossible for the RNAP to add RNAPs