Eukaryotic gene transcription Flashcards
What are 6 major differences from prokaryotic transcription?
- 3 types of RNA pol instead of 1
- mRNA undergoes processing
- compartmentation of transcription and translation = slower gene expression
- more complex regulation
- presence of introns
- DNA is packaged into chromatin
what is the relationship between the DNA sequence and RNA transcript?
RNA will look like non transcript strand aka complementary to the template strand except it will have U instead of T
how do Eukaryotic RNA polymerases differ from DNA polymerases?
- do not need a primer to initiate nucleic acid synthesis
- use nucleotide triphosphates as substrates (NTPs vs dNTPs, UTP instead of dTTP)
- Generally slower in catalytic rate
NTPs vs dNTPs
- ribonucleotide is converted to deoxyribonucleotide by nucleotide reductase
- ribonucleotide has 2 OH groups, deoxyribonucleotide has 1 OH
- nucleoside is composed of a base and sugar without phosphates
- nucleotides have at least one phosphate
- nucleotide triphosphate has 3 phosphates
DNA replication vs RNA transcription
Replication: synthesis of both strands, helicase pulls template strands apart, no rewinding after replication fork passes, requires primer
Transcription: synthesis of one strand, RNA pol moves to find initiation site and unwinds DNA, re-winding occurs as the bubble passes, no primer is needed
conservation between prokaryotic and eukaryotic RNA polymerases
both have highly conserved cleft domain, diversity occurs on exterior regions
describe the 3 RNA pol in eukaryotes
RNA pol I transcribes rRNA
RNA pol III transcribes tRNA, 5S rRNA, and other small RNAs
RNA pol II transcribes mRNA - CTD terminal tail is important
how do things inhibit transcription?
- bind DNA at transcription initiation complex site which prevents elongation of RNA chain by RNA pol II
- inhibitors can serve as protection mechanisms such as actinomycin D is produced by streptomyces strains
How big are RNA polymerases?
- consist of multiple (11-15) subunits
What is a requirement for eukaryotic RNA polymerases?
they can’t identify promoters on their own, so they need other proteins to bind to the promoters first to form a docking site
5 important facts about RNA pol I
- transcribes only one type of gene: rRNA gene (but NOT 5S rRNA)
- accounts for 50% of transcriptional activity
- rRNA gene is transcribed and processed into three RNA products: 18S, 5.8S, and 28S RNA
- Organization of the rRNA gene is conserved across different species
- there are 200 copies of the rRNA gene in most genomes arranged in tandem repeats
Structure of eukaryotic ribosomes
- made up of large and small subunit (which are made separately)
- 40S and 60S make 80S (not additive)
characteristics of human genomic rRNA variants
- variant rRNA alleles give rise to physically and functionally heterogenous ribosomes that contribute to pathology and human disease
- mutations can alter how quickly ribosomes make proteins or can make ribosomes work better
5 basic parts of rRNA gene transcription and processing
- transcriptional units occur in tandem (number of repeats varies)
- Only one strand is template/transcribed, the other is non-template
- Transcript is produced that still contains spacers
- Spacers are removed (akin to intron splicing)
- Mature rRNA molecules are generated (18S, 5.8S, and 28S)
where does transcription of rRNA gene take place?
- in the nucleolus (inside nucleus)
- each small nucleolar ribonuclear protein (snoRP) complex includes enzymes that modify nucleotides, ribosomal proteins, snoRPs, and exo/endo-ribonucleases
what are the 4 steps of RNA modification/folding?
- 90S pre-ribosome is modified by methylation or pseudouridine formation
- initial cleavage of pre-rRNA into pre-40S and pre-60S
- additional cleavages and export from nucleus to cytoplasm
- final maturation occurs to produce 40S and 60S rRNAs
What is the role of promoters (4 points)?
- define where transcription starts
- determine direction of transcription
- determine rate of transcription
- determine regulatory properties of transcription of the particular gene
What does frequency of initiation depend on?
strength of the promoter
- if weak, sends a few pols spaced out
- if strong, sends many pols close together
highly transcribed = strong promoter
3 steps of assembly of the Pol 1 Preinitiation complex (PIC)
- upstream binding factor (UBF) binds upstream control elements (UCE) and core element within rDNA promoter
- Stably bound UBF recruits SL-1 initiation factor which is composed of many TBP associated factors (TAFs)
- Stable UBF-SL-1 complex recruits initiation competent RNA pol I in an interaction mediated by RRN3
How does positioning of elements affect rate of PIC assembly?
if way upstream, will be slower
if closer to start, will be faster
what is the purpose of rRNA gene promoter-looping
compensates for the large distances between mammalian upstream control elements and core elements
- if UCE and core elements are too far apart to form a dimer, looping the DNA in between them brings them close enough to form dimer
What is SL1
- a multi-subunit protein
- consists of TBP (TATA binding protein) and TAFs (TBP associated factors)
- TBP + TAF complexes are used by all 3 eukaryotic RNA pols, differing only in type of TAFs and number included
how are RNA pol I molecules continuously engaged in transcription
- linking termination and re-initiation on tandem gene repeats
- 2 mechanisms depending on species
- xenopus (frog): RNA pol I terminates only 180 bps away from the next promoter, next promoter “grabs” RNA pol I and re-engages it in transcription of next tandem gene. Tandem array facilitates active re-engagement of RNA pol I
- mammals: long spacer region can be up to 30,000 bp in length, so looping spacer regions brings termination site and next promoter into close proximity and the next promoter can grab RNA pol I
What is the function of RNA pol III?
transcribe genes coding for all tRNAs, 5S rRNA, many snRNAs through polygenic sequence
What are the 3 types of RNA pol III PICs?
Type I: (5S rRNA) promoter occurs within gene
Type II: (tRNA) promoter occurs within gene
Type III: (U6 RNA component for splicing) promoter occurs upstream of gene
what are the 3 promoters recognized by RNA pol III?
- tRNA genes (transfer RNA; protein translation)
- 5S rRNA genes (ribosome biogenesis)
- U6 snRNA genes (intron splicing)
structure of tRNA gene promoter
- internal promoter downstream of the TSS
- consists of boxes A and B
- ordered sequential binding to form docking site
- TFIIIC binds boxes A and B within transcript (not within promoter) then recruits TFIIIB (made up of TBP and 2 TAFs: BDP1 and BRF1) which recruits RNA pol III
- no TATA box
Structure of 5S rRNA gene promoter
- contains internal promoter site downstream of the TSS consisting of boxes A and C
- TFIIIA binds box C, which recruits TFIIIC at box A
- leads to recruitment of TFIIIB and pol III
- no TATA box
structure of U6 snRNA gene promoter
- contains canonical promoter site upstream of the TSS consisting of TATA box and proximal sequence elements (PSEs)
- TBP binds TATA box while snRNA-activating protein complex (SNAPC) binds PSEs which recruits dfifferent form of TFIIIB (With BRF2 instead of BRF1) to transcribe U6 snRNA
what is the function of RNA pol II?
- responsible for transcribing all protein-coding genes and many microRNA (miRNA) genes
why is pol II most complicated?
- transcribes a different transcriptome in each cell in order to have cellular differentiation
- mRNA product also undergoes significant processing in nucleus before being transported to cytosol and translated into protein
What are the 4 steps of the eukaryotic transcription cycle?
- start with DNA and RNA pol
- PIC is built in DNA to start initiation
- RNA pol II binds and begins transcription until it reaches termination site
- RNA is released and pol is released to start a new cycle
6 steps of PIC assembly on a TATA box containing promoter
- TBP (found in TFIIB) is recruited to TATA box
- TFIIA gets recruited
- TFIIB gets recruited (rate limiting event)
- RNA pol II is recruited with TFIIF
- The unphosphorylated CTD tail on pol II is incorporated into PIC
- TFIIE gets recruited (rate limiting event) and TFIIH gets recruited and phosphorylates the CTD tail which releases it from the PIC
what does TBP do when bound to TATA box?
- binds within minor groove which bends DNA and causes it to open up, allowing access to the other factors
- brings proteins bound within the promoter into closer proximity
What does recruiting TFIIB do to DNA?
- causes zig zag distortion of the DNA helix
forms a pocket in TFIIA
How is TBP part of the TFIID complex?
- TFIID binds to TATA box through TBP subunit
- TBP binds to the minor groove of DNA
3 major roles of TAFs (TBP associated factor)
- activation (histone acetyltransferase (HAT) is associated with gene transcription)
- TFIID tethering to non-TATA promoters
- Promoter security
What are the two general types of promoters recognized by RNA pol II?
Type 1 (25%): contain TATA boxes at position -25
Type 2 (75%): do not contain TATA box
Characteristics of type 1 RNA pol II promoters
- contain TATA box at -25
- have initiator sequence around TSS
- tend to be highly expressed genes
characteristics of Type 2 RNA pol II promoters
- do not contain TATA box
- tend to be housekeeping genes
- subtype A contains initiator sequence around the transcription start site YYANA/TTY, and a downstream sequence
- subtype B has no initiator sequence but has a GC rich promoter with no distinct transcription start site
How does TBP interact with a TATA-less promoter?
- TATA-less promoter with initiator motif (Inr) and downstream promoter element (DPE): TFIID uses other factors to bind to Inr and DPE, TBP is recruited with the complex
- TATA-less promoter with GC boxes: TFIID uses Sp1 to recognize GC sequence, TBP is recruited with the complex
What is a unique feature of the large subunit of RNA pol II that is critical to gene expression and regulation?
- carboxy terminal domain (CTD)
- consists of repeating amino acid sequence forming heptad repeats:
(Tyr/Ser/Pro/Thr/Ser/Pro/Ser)n aka (Y/S/P/T/S/P/S)n - humans n= 52; drosophilia n = 44; yeast n = 26
- Tyr/ser/thr can be phosphorylated at various times and place
- proline provides bends and kinks to polypeptide chain
what is important about heptad repeats?
- they are all similar and highly conserved, but not identical
- any protein that interacts with DNA will be conserved from yeast to humans (even though the CTD doesn’t directly interact with DNA, it interacts with transcription factors that do bind DNA and are highly conserved, so CTD will be conserved as a consequence)
- different heptad repeats serve distinct functions (N-terminal half supports only capping of mRNA; C-terminal half supports capping, splicing, 3’ end processing of mRNA)
what is function of CTD?
- binding scaffold for proteins involved in mRNA processing
- strategically located next to exit channel for mRNA
- when phosphorylated it links stage of transcription with RNA processing, elongation, and termination
- phosphorylation affects recruitment of chromatin re-modifying enzymes
What are 2 possible modifications of CTD?
- Tyr and Ser all have OH groups than can be modified by phosphorylation which changes structure and function of amino acids (on/off)
- Isomerases change isomer of amino acids - depending on cis or trans conformation they will have different functions
4 important facts about the CTD
- mutant RNA pol II that lack CTD transcribe genes at a slow but unregulated rate and cannot be activated
- CTD is highly de/phosphorylatable (only unphosphorylated form can enter PIC, phosphorylated form transcribes the gene)
- unphosphorylated form physically binds TBP, keeps RNA pol II locked at promoter
- phosphorylation pattern of CTD can differ from gene to gene and changes as RNA pol II proceeds down the gene
what 3 things known about dynamic CTD phosphorylation pattern?
- Ser5 phosphorylation predominates near beginning of genes
- Ser2 phosphorylation predominates near end of genes
- Tyr1 phosphorylation increases downstream of initiation site, decreases before poly-A site, impairs recruitment of termination factors, and recruits elongation factors
Principle of CTD patterns
- modifications both trigger and block recruitment to generate CTD code
- explains transcription cycle coordination based on differential phosphorylation of Tyr1/Ser2/Ser5
- different motifs are recruited in different proteins
What are possibilities of CTD code?
- S2 phosphorylated first (and only)
- S5 phosphorylated alone
- S2 and S5 phosphorylated
- since CTD can have up to 52 heptads there are many different combos/codes
Structure and function of TFIIE and TFIIH
- TFIIH consists of five subunits, one has helicase activity which helps melt DNA and allow formation of transcription bubble, one has kinase activity and can phosphorylate the CTD of RNA pol II at Ser 5
- TFIIE stimulates the kinase and helicase activities of TFIIH
What is the trigger hypothesis?
- RNA pol II CTD wraps around the the entire complex - assembled but not active
- TFIIH kinase domain phosphorylates the CTD tail
- CTD tail releases the entire complex and is actively transcribing
TBP is important to create a bend in DNA that allowsTFIIB to bind PIC which causes another bend that brings in TFIIA
How do transcription factors mediate pol II transcription?
- long distance effects of enhancers occur by DNA bending
- silencers repress transcription
- insulators restrict transcription to a single loop in a chromosome
What is Mediator?
multi-subunit complex recruited to PIC that links enhancers with the PIC
what two things do most transcription factors have?
DNA binding domain/motif (DBD) and transcriptional activation motif (ACT)
How does the mediator work?
- links an activator protein (bound to enhancer sequence) to RNA pol II
- can link multiple transcription factors at once by looping the DNA, effect is the same gene can be transcribed in different ways depending on which TFs bind to the mediator
- different subunits of Mediator function in specific biologic processes, so if missing subunits, it will not function in certain processes
- has many subunits (might not all be used all the time) so that in cases of stress where an immediate response is needed, they can start right away instead of waiting for the TFs to get recruited
important notes on mediator
- enhances recruitment of RNA pol II and other initiation factors
- acts as a bridge between DNA-bound TFs and PIC
- can enhance basal transcription in absence of bound activators
- individual components can have gene-specific and tissue specific functions
- several forms exist that can activate or repress transcription depending on which factors are present in mediator
what are characteristics of transcription factors?
- can be activators or repressors
- directly bind to specific DNA sequence or response element
- specific TFs have at least 2 functional domains (DNA-binding domain DBD; Transcription activating domain ACT) that are independently folded and often function separately which often bind DNA as dimers
what are the three classes of transcription activating domains?
- Acid domains (regions rich in Glu and Asp)
- Glutamine-rich domains
- Proline-rich romains
specific transcription factor: cAMP response element binding protein (CREB)
- found in genes whose transcription is induced by cAMP
- common conserved sequence in promoters called cAMP response element (CRE): 5’ TGACGTCA 3’
1. when glucose levels drop, induces release of glucagon which increases cAMP
2. cAMP activates protein kinase A (PKA)
3. PKA finds and phosphorylates CREB which is bound to CRE in nucleus
4. Phosphorylated CREB recruits CBP
5. CBP increases rate of transcription of genes required to make glucose
specific transcription factor: SREBP - regulation of transcription by cholesterol
- cholesterol is necessary for membrane synthesis and is precursor for steroid hormones/bile acids
- decreased cholesterol causes genes whose protein products increase cholesterol to be turned on
- these genes all possess a sterol response element (SRE) in their promoters, the protein that binds to this sequence is SREBP
-when cholesterol is low, SREBP is transported to the golgi where it is processed by two proteases into active form which turns on target genes
- when cholesterol is high, SREBP is kept in the ER, not processed, and does not activate genes
- SREBP cleavage activating domain (SCAP) is a sterol sensor that escorts SREBP to the golgi
specific transcription factor: steroid hormones
- cortisone reduces inflammation by altering transcription
1. TNFα induces inflammation, causes binding of NFₖB to IₖB
2. IₖB is phosphorylated and degrades which frees NFₖB to move to nucleus
3. NFₖB binds target gene promoters that release cytokines
4. Cytokines drive inflammation and trigger the release of cortisone
5. cortisone interacts with glucocorticoid receptor to be able to bind to IₖB gene promoter which increases transcription of IₖB
6. Increased IₖB binds to NFₖB which inhibits NFₖB from acting