Transcription in eukaryotes Flashcards
How does transcription termination occur in eukaryotes
Tissue specific control
- So, if we take him as are eukaryote
- He will have red blood cells which will make loads of beta globin
- he will have muscle cells that produce actin and myosin
- He will also have neurons which will make neuropeptides
- But essentially these cells contain all the same DNA
- They contain the exact same DNA content there is no difference in these cells in terms of DNA, yet these cells are very different both structurally and functionally
Why are cells genetically the same but different structurally and functionally?
- Make sure right genes are switched on in the right cells at the right time
- But essentially these cells contain all the same DNA
- They contain the exact same DNA content there is no difference in these cells in terms of DNA, yet these cells are very different both structurally and functionally
- Neurons
- Cells differentiate due to transcription of different genes e.g. actin in muscle
- Eukaryotes need to make sure right genes are transcribed in right cells known as tissue specific control – the ability to switch on genes in right cell type
Transcription control is exerted at 4 main levels, what are these?
- Binding of RNA polymerase: promoters and transcription factors
- Long range control: locus control regions
- Chromatin remodelling: histones and histone deacetylases
- DNA methylation: CpG islands and imprinting
When do promoters differ?
What does this help with?
Promoters for different genes are different
Each contain a combination of sites to which specific protein factors bind
All of these factors help RNA polymerase to bind in the correct place and to initiate transcription
The eukaryotic system is complex, what are the types polymerase and what genes do they transcribe?
Transcription still involves RNA polymerase
In eukaryotes however there is not just one, there are three:
- 1 transcribes the ribosomal RNAs
- 2 the mRNAs
- 3 the tRNAs
All genes that are transcribed and expressed via mRNA are transcribed by RNA polymerase II. Has a Zn binding site for DNA to bind to, has 12 subunits instead of 5
Tell me about the structure of eukaryotic RNA polymerase II
- Similar structure to Bacterial Polymerase
- Larger - 12 subunits instead of 5
- Unlike Bacterial Polymerase, it cannot initiate transcription - no sigma factor
- Requires many transcription factors
- Has to deal with DNA packed in nucleosomes
What do transcriptional activators help attract?
RNA polymerase II to the promoter, which helps to regulate rate and tissue specificity of gene expression
Other proteins control unwinding of chromatin to allow access for transcription factors
How do proteins control gene expression?
- They do this by binding to DNA
- In the major groove of the double helix
- The same in both prokaryotes and eukaryotes
Structure of eukaryotic promotors (recognised by RNA pol II)
- The promoter of eukaryote looks like this
- Divide into 3 parts
- Core promoter followed by upstream sequence element and an enhancer
- Start by talking about core promoter region
- This is where transcription begins
- Start point arrow like bac plus 1 and usually A like bac
- Region around state highly conserved and usually pyrimidine rich
- The TATA Box located approximately 25 bp upstream of the start-point of transcription is found in many promoters. The consensus sequence of this element is TATAAAA (so it resembles the TATAAT sequence of the prokaryotic -10 region but please do not mix them up). The TATA box appears to be more important for selecting the start-point of transcription (i.e. positioning the enzyme) than for defining the promoter.
- The Initiator is a sequence that is found in many promoters and defines the start point of transcription.
- The GC box is a common element in eukaryotic class II promoters. Its consensus sequence is GGGCGG. It may be present in one or more copies which can be located between 40 and 100 bp upstream of the start point of transcription. The transcription factor Sp1 binds to the GC box.
- The CAAT box - consensus sequence CCAAT - is also often found between 40 and 100 bp upstream of the start point of transcription. The transcription factor CTF or NF1 binds to the CAAT box.
- In addition to the above elements, Enhancers may be required for full expression. These elements are not part of the promoter per se. They can be located upstream or downstream of the promoter and may be quite far away from it. The mechanism by which they work is not known. They may provide an entry point for RNA polymerase or they may bind other proteins that assist RNA polymerase to bind to the promoter region.
Tell me about Core promotor TATA box
- General transcription factors for RNA Pol II (TFII)
- Position RNA Pol II, separate DNA - initiation
- Release RNA Pol II from promoter – elongation
- Needed for all genes
TFII: transcription factor for RNA pol II
What happens with the core promotor TATA box first?
What happens with the core promotor TATA box second/next?
Tell me about the structure of the PIC?
- Pre- initiation complex (PIC) is assembled
- Elongation
- TFIIH
- 9 subunits: ATPase, Helicase, Protein kinase
TFII and elongation
Tell me about elongation and the TFIIH central?
- C-Terminal domain (CTD) phosphorylated
- Conformation change – tightens grip
- General TFs dissociate
- Acquires new proteins – including elongation factors that help process the RNA and increase elongation rate
Formation of RNA polymerase II pre-initiation complex
The core promoter- TATA less promoter has what?
Where are these located?
- Have an INR (initiator) and DPE
- DPE is a downstream promoter element
- Located +28 to +32 (3’ relative to the start site)
- DPE have the sequence AGAC
- Recognised by TFII I
Structure of eukaryotic promotors (recognised by RNA pol II)
- The binding of PIC is however week just like in bac activators
- It needs other proteins to help it stabile bind
- Upstream bund interacts with PIC and stabilise interaction just like in bacteria
- These sequences are known as use
- The GC box is a common element in eukaryotic class II promoters. Its consensus sequence is GGGCGG. It may be present in one or more copies which can be located between 40 and 100 bp upstream of the start point of transcription. The transcription factor Sp1 binds to the GC box.
- The CAAT box - consensus sequence CCAAT - is also often found between 40 and 100 bp upstream of the start point of transcription. The transcription factor CTF or NF1 binds to the CAAT box.
- In addition to the above elements, Enhancers may be required for full expression. These elements are not part of the promoter per se. They can be located upstream or downstream of the promoter and may be quite far away from it. The mechanism by which they work is not known. They may provide an entry point for RNA polymerase or they may bind other proteins that assist RNA polymerase to bind to the promoter region.
How do Upstream sequence elements affect transcription?
Transcription can be enhanced by the binding of transcription factors to sites upstream of the PIC
Tell me about upstream sequence elements- growth hormone deficiency?
- Growth hormone (GH) is required for normal growth
- GH deficiency results in reduced growth - 1 in 5000 infants
- In 1990, deficiency was found to be due to a mutation in Pit-1 transcription factor
Tell me some and also provide an explanation about some upstream sequence motifs
1. Motifs bound by general transcription factors
e.g. the general TF, Sp1 binds to GGGCGG
Sp1 is found in all cell types
2. Motifs that confer tissue specific expression
e.g., MyoD binds to CANNTG (N=any base)
MyoD is a muscle-specific transcription factor
Note all cells have CANNTG but only tissue specific cells have the MyoD TF expressed.
3. Motifs that confer response to particular stimuli
e.g., Oestrogen receptor binds to AGGTCANNNTGACCT
We now know a lot about tfs and sequences they bind to
What we now know is that there are sequences or motifs that are bound by general tfs
For example, the tf sp1 which binds the sequence GGGCCGG, sp1 is a tf found in all cell types
The gene needs GGG at its 5 prime end and sp1 binds here to ensure a high level of transcription
Myod only in muscle no other cell types
Gene has sequence more rna produced if estrogen is present
Typically, rna pol 2 will have 1 or 2 of these sequences in the first 100 bp up stream of transcription start site - general or cell specific or respond to stimuli
Sequence determines wot tfs bind and when and wot cell types gene switched on and also in responseto hormones etc.
What are enhancers?
Regulatory sequences that act at a distance
How do enhancers work?
Where were they first discovered?
- Transcriptional activators bind
- Help RNA Pol II bind
Discovering first enhancer:
- Simian Virus 40 (SV40) – promoter
- found that the deletion of a 72 bp sequence led to a 100-fold
- decrease in expression
- The first enhancer was found in a virus
- If you take away a sp1 site, you get a 2-fold decrease in expression whereas this is a massive decrease – a major effect
- Hence, they called it an enhancer
Properties of enhancers elements
- They can activate transcription when placed thousands of bp away from the TATA box
- They act in either orientation
- Can act when placed upstream or downstream of the TATA box, or when placed within an intron
Mechanism of enhancers action
- Enhancers are sequences of DNA to which a large number of transcription factors bin
- 8 proteins and 6 different subunits. Bind to different region in the 72 BP sequence