Structure and expression of eukaryotic gene - Professor Latchman Flashcards
At what level does gene regulation primarily occur?
At the level of transcription
What are untranslated regions of an mRNA
Segments of a gene that are transcribed into mRNA but are not translated into the final protein structure
How many RNA polymerases are there in prokaryotes and eukaryotes?
1 in prokaryotes and 3 in eukaryotes
Describe the 3 RNA polymerases in eukaryotes, what they transcribe and their sensitivity to alpha amantin
RNA polymerase I - rRNA (45S precursor of 28S, 18S and 5.8S), insensitive to alpha amanitin
RNA polymerase II - all protein coding genes, snRNAs U1,U2,U3, very sensitive to alpha amanitin
RNA polymerase III - tRNA, 5S rRNA, snRNA U6 etc, moderately sensitive to alpha amanitin
Describe how researchers found out which RNA polymerase transcribed the protein coding genes
Alpha Amanitin is lethal to life as it inhibits RNA polymerase, different RNA polymerases have differing sensitivities to the compound. Researchers took a cell extract and gradually added small amounts of Alpha Amanitin and looked at what type of transcription was prevented. They knew that RNA polymerase II was the most sensitive so when the first type of transcription they lost was of all the protein coding genes they new that RNA polymerase II transcribes these genes.
Why are the other RNA polymerases not regulated as much as RNA polymerase II?
Because RNA polymerase II is the RNA polymerase that transcribes protein coding genes, these genes need to be specific to each cell type. Other polymerases that transcribe rRNA or tRNA need to be present in all cells i the body and therefore are not regulated to the same degree.
What is the most highly conserved promoter in eukaryotic protein coding genes? describe it
The TATA box, it is a region around the -30 region of a gene and it has the bases TATA in that order. It is found in the vast majority of protein coding genes, this tells us that it is very important but it does not tell us in what way it is important.
Describe the Biochemical tests that were carried out on the TATA box and what conclusions were drawn from it
Researchers mutated the TATA box using site directed mutagenesis and they made the sequence read GCGC. They found that this mutation in the TATA box prevented transcription from occurring. Next they moved the wild-type TATA box down upstream form the -30 region to the -60 region. They found that transcription did occur but the start point of transcription had moved to the -30 position. They drew two conclusions, The TATA box not only controls the fact that transcription can occur, it also controls where transcription starts.
Describe the process by which the TATA box leads to transcription
TFIID associated with TFIIA recognises the TATA box and TFIID binds to it. TFIIA facilitates the binding. TFIIB recognises the TFIID bound to the TATA box and binds to TFIID. RNA polymerase II is in complex with TFIIF and RNA polymerase recognises the complex and binds. TFIIH and TFIIE bind to the RNA polymerase. TFIIH is a kinase that phosphorylates the RNA polymerase and this allows transcription to start. The RNA polymerase in complex with TFIIF shoots off from the basal transcription complex and starts transcribing the gene. TFIIH phosphorylates the RNA polymerase at the C-terminal domain of the largest subunit at a serine molecule in a highly conserved region. TFIIE and TFIIH dissociate, TFIIA and TFIID are left on the TATA box, this allows the cascade to happen again without another DNA recognition event.
What is the Basal transcription complex?
The full complex that is made during the cascade. This complex allows transcription to occur at a certain rate and this is known as the basal rate of transcription.
What are upstream promoter sequences?
These are sequences that are upstream of the promoter that increase the rate of transcription by interacting with the basal transcription complex and the TATA box.
Two examples;
GC rich sequences
CAAT sequence - also termed the CAT box
What binds to the GC rich sequence?
SP1, stands for species one
What binds to the CAAT box?
CTF (CAT transcription factor)
Are GC rich sequences and CAT boxes rare or common?
These upstream promoter sequences are very common, they are both found in the majority of protein coding genes. They are usually found on housekeeping genes that all cells need to transcribe as all cells create SP1 and CTF, so by having a GC rich sequence or a CAT box these two can bind and increase the rate of transcription. A gene that is only transcribed in a specific cell type would not have these sequences as the cell will probably make SP1 and CTF so if the gene had these sequences it will be turned on in all the cells that don’t need it.
How do SV40 virus genes get transcribed in Human cells?
They have promoter sequences that are recognised by the host genome. SV40 has a TATA box and a GC rich sequence that SP1 can bind to