gene expression + chromatin structure Flashcards
why do we regulate gene expression?
so different cells can express different genes and perform different functions
differentiation depends on gene expression changes
what are the 5 ways in which we can regulate gene expression? …… TBC?
- transcriptional control
- RNA processing control and mRNA degradation control
- RNA transport control
- translational control
- chromatin structure (eukaryotes)
why does it make the most sense to regulate transcription to regulate gene expression?
it is the top of the regulatory hierarchy
you don’t make mRNA unless you’re going to make protein - a waste of energy if you regulate after this point
reduces superfluous intermediates
how is the eukaryotic gene organised?
has a regulatory and transcribed region, with the coding region (that is translated) appearing within the transcribed region
what is contained in the regulatory region of the eukaryotic gene?
enhancers + promoters (TATA box)
simple switches + complicated micropressors
what is contained in the transcribed region of the eukaryotic gene?
transcription start site (TSS)
5’ UTR
the coding region:
[exons
introns
the stop codon for translation]
3’ UTR
cleavage + polyA signal
what is the promoter?
the TATA box
where RNA polymerase makes contact with DNA
what do enhancers do?
regulate what is going on at the promoter
what are transcription factors?
proteins that bind to regulatory DNA sequences at specific recognition sequences + distort DNA structure
they make it easier/harder for the RNA Pol complex to bind + transcribe RNA
what does ‘reading’ a DNA sequence mean?
DNA binding proteins have a 3D architecture that recognises the specific 3D structure of the the outside of the DNA strand
DNA bps form specific interactions with nucleotides on major groove of B-DNA strand
what are the key DNA binding protein domains?
helix-turn-helix motif
zinc finger
leucine zipper
what is a helix-turn-helix motif?
2 helices held at a fixed angle
COO- helix = recognition helix: fits into the major groove of DNA
amino acid side chains recognise specific DNA binding sequences
other helix = residues contact the DNA backbone
what is a zinc finger?
what is a leucine zipper?
how do we measure DNA-protein interactions to ensure DNA binding proteins have bound?
electrophoretic mobility shift assay (EMSA)
gel electrophoresis followed by autoradiography
why are genes switched on/off in prokaryotic organisms?
its how bacteria adapt to the changes in composition of their growth medium
what is an example of a repressor in bacteria?
tryptophan repressor
(h-t-h motif)
used when tryptophan is available in the medium as there is then no point in making it in the cell
how is the code for making tryptophan in E coli encoded?
trp operon is a polycistronic mRNA molecule
coordinately controlled + on unless repressed
5 genes are needed to produce 3 enzymes for the 3 step synthesis
5 genes close together on bacterial chromosome, under control of 1 promoter that produces one massive mRNA that produces all 5 enzymes when translated
operator sits between promoter and 5 genes: controls whether RNA polymerase can bind or not
gene is normally on but tryptophan repressor can block access : genes not transcribed + enzymes not produced on
what gene controls production of tryp repressor?
trpR
tryp repressor produced frequently in inactive state, activated by tryptophan itself binding (indicating it is present in high enough levels int he cell) – repressor activation is concentration-dependent
activated tryp repressor can bind to operator to inhibit further production of tryptophan
what is an example of a gene that is typically off ?
lac - the lactose-processing gene - encoding beta-galactosidase
typically a repressor bound to the operator that controls lac genes
how do you activate the lac genes?
allolactose binds to lac repressor (that is bound to operator) and causes it to conformationally change and unbind from DNA, allowing for transcription + higher levels of lactose metabolism
what is negative gene regulation?
signalling molecules do not interact directly with DNA
they interact with repressor proteins
what is positive gene regulation?
a signalling molecule binds with an activator to generate a complex that interacts with DNA
directly stimulates gene expression by increasing affinity of RNA polymerase for the promoter
what is constitutive vs regulated expression?
C = expressed all the time
R = expressed under a particular set of physiological conditions
why does E. coli need to be able to metabolise lactose?
mainly rely on glucose as source of carbon and energy but can use lactose if this is scarce
what is the action of beta-galactosidase?
hydrolyses lactose into galactose and glucose
what 2 other proteins are synthesised together with beta-galactosidase? what does this tell us?
galactoside permease + thiogalactoside transacetylase
expression levels of a set of enzymes that all contribute to the adaptation to a given change in the environment, change together
coordinated unit = operon
what is the structure of a prokaryotic operon?
regulator gene : encodes regulatory (repressor) protein
operator site: regulatory DNA sequence
set of structural genes
what binds to the operator site in prokaryotes to prevent transcription of the structural genes?
the repressor protein coded for earlier in the operon
what is the structure of the lac repressor?
tetramer of 37kDa subunits
2 pairs of subunits come together to form the DNA binding unit
bound tightly to operator in absence of lactose
how does the lac repressor block transcription?
binds to operator site which is adjacent to and downstream of promoter site, blocking the progress of RNA polymerase
where does RNA polymerase bind to an operon?
promoter
how does lac repressor locate operator site?
it binds 4 x10^6 times as strongly to the operator DNA as to random sites in the genome
rate constant is very high, repressor finds operator by diffusing along DNA molecule (1D search)
when 1 DNA-binding unit of lac repressor binds to operator site, the other unit can bind to 1 of a few sites with similar sequences and the DNA in between forms a loop
what is allolactose?
galactose and glucose bound with an a-1,6 linkage
the inducer of the lac operon
a side product of the beta-galactosidase reaction
how does the inducer allolactose work?
binds to lac repressor and greatly reduces repressor’s affinity for the operator
binding leads to 2 subtle conformational changes so 2 DNA binding domains can no longer contact DNA at same time
what is a corepressor?
a small molecule that binds a repressor (such as pur), allowing it to bind DNA specifically, blocking transcription
what is catabolite repression?
when the presence of an abundant energy source represses the production of enzymes used for metabolising other sugars
what does glucose do to levels of cAMP in E. coli? what does it mean for other enzymes when glucose is removed?
it lowers it
allows transcription of other catabolic enzymes by acting through the identical-subunit-dimer: catabolite activator protein (CAP)
where does CAP bind in the lac operon?
an inverted repeat at approx -61 from TSS
bends DNA in a manner favouring interactions with RNA pol
what activates CAP?
cAMP
how does CAP-cAMP complex increase rate of transcription?
creates energetically favourable contacts between CAP and RNA pol that increase likelihood of transcription being initiated at CAP-bound sites
increases 50x
outline the mechanism of turning on and off the lac operon, depending on both mechanisms
requires there to be no glucose in the cell and also requires the presence of lactose
allolactose causes lac repressor to unbind from DNA
low glucose causes high cAMP -> CAP activated -> higher levels of transcription of lac genes
what is a general transcription factor?
assembles the promoter machinery (basal transcriptional machinery)
what is a regulatory transcription factor?
acts at the enhancer to slow down/speed up transcription
(activator protein)
what is the key aim of promoter machinery assembly?
binding of TATA box binding protein (TBBP) to TATA box
what is the mechanism of promoter machinery assembly?
- TBBP subunit of TFIID binds the TATA box
- TFIIB joins complex + assembles at promoter
- TFIIF brings RNA polymerase II into complex
- TFIIE and TFIIH join complex
- TFIIH has kinase activity and phosphorylates C terminal tail (CTD) of RNA polymerase II
- RNA polymerase II is released from tight binding of DNA so it can move along to gene to catalyse transcription
(all general TFs)
what needs to happen after promoter machinery assembly?
regulatory TFs need to bind to enhancers to cause transcription
where are enhancers and how do they work?
can be up to 25 kB up or downstream
reg TFs bind to them and the intervening DNA loops out so the TF and enhancer combo can directly communicate with the basal transcriptional machinery at the promoter
favour or discourage transcription
what are the necessary things that have to occur for regulators to have their effect?
promoter bound to basal machinery
enhancer bound to TF
what is the structure of a transcription factor?
TFs have modular structure:
a DNA binding domain (h-t-h)
activation/repression domain (acidic, glutamate/proline rich)
which organisms tend to use enhancer sequences?
eukaryotes
how does communication between basal machinery and enhancer activate the BTM?
what is the structure of a nucleosome, assuming structure of a histone is known?
repeating units of histone octamers with stretches of linker DNA
supplementary 146 bp of DNA
shortens DNA from 680 to 100 angstroms
main structural units of chromatin
how is DNA wrapped around histones?
200 bp of -ve DNA wraps histone proteins due to high conc of +ve arginine + lysine
core = two H2A-H2B dimers and a H3(_2)H4(_2) tetramer
wrapped DNA enclosed by superficial histone H1 at the entry/exit point of linker
amino acid tails protrude from each histone protein
what does the compaction of chromatin mean for susceptibility to cleavage by DNase1?
what is the experiment elucidating this?
less susceptible
chicken embryos and Hb synthesis sites
in 20 hour old embryos, globin genes in precursor erythrocytes are insensitive to DNase1
as matures + Hb production begins, adjacent genes become susceptible to digestion
in tissues that don’t produce Hb, genes remain resistance to DNase1
what are areas of high susceptibility to cleavage known as? why are they susceptible?
hypersensitive sites
have a scarcity of nucleosomes
what is loosely packed chromatin called? what is its property?
euchromatin
have high transcriptional activity
what group does histone acetyltransferase modify on the amino acid tails of histones? what is the effect?
adds acetyl groups from acetyl CoA to lysine residues on amino-terminal tails
lysine has NH4+ at physiological pH: acetylation = uncharged amide, reduces affinity of tail to DNA
histone complex loosens
what does the now-acetylated lysine tail interact with? what is the effect?
bromodomains: acetyl lysine binding sites of 110 amino acids on regulatory transcription proteins e.g. TATA box bp and TAF1.
bind at H4
what are chromatin remodelling complexes? what do they do?
contain bromodomains with helicase-resembling domains
use energy from hydrolysis of ATP to trigger conform changes in histone through shifting nucleosome position
renders genes more accessible and promotes binding of transcription complexes through protein-protein interactions
promotes gene expression
multi-methylation of lysines on what histone is involved in compaction?
H3K9
multi-methylation of lysines on what histone is involved in activation?
H3K4
what can you covalently modify in the a to affect gene expression?
specific example?
lysine and arginine residues
trimethylation of lysine 27 on histone H3 = gene repression
what covalent modification of DNA occurs? how?
methylation of cytosine and adenosine in the DNA
methyltransferase catalyse methyl group from (S-adenosyl methionine) SAM to C5 of cytosine in DNA
what are the 3 types of methyltransferases? what are their roles?
DNMT1: maintains methylation from parent strand
DNMT3a, DNMT3b: drive de novo methylation of A and C
how does presence of a methyl group impair gene expression?
impedes recruitment of transcriptional machinery thus prevents activation of genic regions (A. Dhar)
what is the function of regulation of gene expression?
transcription of some genes over others allows for variation in protein synthesis across cells
repressor proteins prevent excess cell growth and suppress tumours
what are the methods of gene expression regulation?
- acetylation of lysine on histones
- methylation of amino acids and DNA bases
- nuclear hormone receptors e.g. oestrogen
- DNA binding by transcription factors: homeodomains (zippers, h-t-h, cys-hys-zinc finger)
- activation domains on TFs
how are the nuclear hormone receptors structured?
contain ligand-binding domain receptive to hydrophobic molecules e.g. oestrogen, thyroid hormone, steroid hormone
contains central DNA binding site w zinc domain
molecule to receptor binding = complex formed: transport to nucleoplasm then binds to 5’ - AGGTCAN-NNTGACCT - 3’ sequence in DNA
recruits coactivators (p160 family): SRC-1, GRIP-1
p160 family act as enzymes that upregulate chromatin remodelling: affecting gene expression
what are the DNA-binding unit homeodomains?
h-t-h: helix inserted into major groove of DNA
leucine zipper: 2 alpha helices stabilised by leucine, involved in DNA site recognition
cys-hys-zinc-finger: tandems of small domains w 2 cysteine + 2 histidine bound to zinc + alpha helix which binds bps in major groove
what do gene activator proteins do?
displace promoters from nucleosomes
as direct effect of TF or effect of interaction between regulatory proteins + general TFs
what are the properties of active chromatin?
increased core histone acetylation: alters non-histone factors + all interaction types
phosphorylation of HI-type linker histones: causes histone to not associate with chromatin: creates binding site for reg. factors on DNA
increases incorporation of specific histone variants: affects nucleosome structure + packing
what does the post-translational histone code determine?
gene expression levels
what are the properties of inactive chromatin?
desphosph core histones
dephosph HI histones
diminished histone variants
subnuclear localisation of genes w silent genes near periphery