MT Cell Bio Flashcards
structure of chromatin?
- always condensed in mammal/human cells
- most condensed (extremely condensed) during M phase, very visible chromosomes
- less condensed in interphase
- chromatin are in nucleosomes: they consist of DNA, histones, and non-histone proteins
- each nucleosome looks like ‘a bead’ on a string on beads, when chromatin unfolds/decondensed.
- Heterochromatin and Euchromatin:
how condensed is chromatin at each phase?
- more condensed during M phase (mitosis, condensed into distinct chromosomes)
- less condense, but still tightly packed even during S phase/interphase
structure of nucleosome
- nucleosome is the single basic unit of chromatin
- consists of DNA, histone, non-histone chromosomal protein (each contributing same mass)
- experiment to uncover chromatin/nucleosome structure: nucelase digestes the linker DNA (used to linke nucleosomes for more condensed chromatin)
- each nucleosome: complex of 8 histone proteins, 2 molecules each of H2A, H2B, H3 and H4. A double stranded DNA of 147bp and non-histone proteins which vary
- structural motif on histones: histone fold
structure of histones
- 2 copies of each: H2A, H2B, H3, H4
- each protein has a structural motif which allows them to form the octamer core: alpha helices called histone fold
- H2A-H2B dimer first combines with a tetramer H3-H4, then binds with another H2A-H2B dimer to form the histone core octomer. Formed through handshake interaction
- DNA then wraps around this core through extensive H-bonds, between backbone of amino asid and the sugar phosphate backbone of DNA. Also lots of salt linkages and hydorphobic bonds hold the 2 together
- DNA is v negatively charged (backbone), hence 1/5th of the aa in histones are lysine or arg, which have a very basic charge to neutralize and form stronger attractions
- highly flecible structure, serve as binding sites to lots of other sets of proteins: gene expression modification
How does nucleosomes have a dynamic structure, why?
- if not dynamic the genetic mechanisms such as gene expression, DNA replication, protein synthesis would be hindered or effected
- loosening DNA-histone to allow translation…etc, DNA partially unwraps the histone for exposure to other binding proteins
- the histone itself also binds to other porteins and is very flexible in conformation to allow dynamic changes of the nucleosome
- chromatin remodelling complexes help change the nucleosome structure: uses ATP to loosen the DNA
- ATP hydrolysis of that allows histone to be pulled along the DNA, to allow nucleosome sliding and loosening of DNA
- other non-histone proteins such as other histone chaperons can bind to histones to allow replacements and exchanges of histone domains
what are the types of non-histone proteins that may bind to nucleosomes?
- can bind to histones or DNA to make modificantion
- chromatin remodelling complexes: uses ATP hydrolysis to pull histone core along DNA to expose certain parts of DNA through nucleosome sliding
- histone chaperones: bind to histone to exhcange or replace histone or histone domains
- writers, readers, erasers.
- DNMT for DNA methylation…etc
how are nucelsomes in chromatines normally packed?
- very tightly and stacked ontop of each other in a zigzag model
- form a tetranucleosome normally w 30nm fibre
- linked tightly through histone tails, esp the H4 histone tail, histone tails of each nucleosome interact with each other forming linkages which bring nucleosomes together
- histone H1 (linker histone) less well conserved, binds to both histone and DNA, and changes the exit path of DNA from the nucleosome which allows closer packing of nucleosomes
- less well conserved because eukaryotic organisms have multiple H1s but they all binds to diff aa sequences on the histone.
how do nucleosomes remain dynamic
- histones w many binding sites available
- chromatin remodelling complexes: ATP hydrolysis driven
- histone chaperons: ATP driven
- h1 linker histones
- spontaneous wrapping adn unwrapping of DNA
- cells also contain lost of different complexes to target diff parts of DNA
what is the most important feature discovered about chromatin structure?
- certain types of chromatin structure can be inherited Iepigenetic inheritence)
- structure of chromatin in a cell can be directly passed down to its descendants
- cell memory inhertied in based on a chromosome structure rather than a DNA change
what is heterochromatin and its importance
- heterochromatin is a highly condensed form, the rest is less condensed euchromatin
- highly concentrated and compact in telomeres and centromeres
- DNA of heterochromatin contain few genes, and when a gene enters heterochromatin state it becomes switched off
- 10% genes are heterochromatins
- vary in which genes are heterochromatin, determined by other factors
- heterochromatin DNA restricts gene expression
what is Euchromatin
- majority of genes are euchromatin
- less tightly bound during interphase
what is the position effect?
- when a euchromatic gene/DNA is translocated into a heterochromatic area
- causes certain genes to become silenced
what is an example of positional effect
- in drosophila, translocation of a gene for normal eyes (white+ or wt)
- during early stage embryo, when this positional effect occurs, the white+ gene is in heterchromatin state, it becomes silenced and no pigment produced (loss of red pigment, and eyes turn white)
- this then becomes inherited by all of the flies progeny called: positional effect variegation.
Another example:
Barr bodies in X chromosomes of female mammals. Where one of the X chromosomes are inactivated randomely through heterochromtin, and this state is inherited by all cell’s progeny.
Lecture 2: epigenetic regulation of chromatin
Why have chromatin?
- package DNA to be inside cell
- protect DNA from breakage
- easier control of genes that need to be active in certain cell types
what is the strategy of heterochromatin
- positive feedback
- heterochromatin structures begets more heterochromatid
- which is then inherited by daughter cells, and expands further
- histone modification by enzymes
- histones can be modified and is reversable
- addition of acetyl groups to lysine tail by HATs, and HDACs -> activation bcuz removes the positive charge on lysine, hence less attraction
- methyl groups added to lysine by histone methyl transferase and histone demethylases -> methyl = silencing
- the recruitment of these enzymes are dependent on TFs and is very specific to sequences
- some can exceed longer than the TF binding and impact cell memory, and inherited by cell progeny.
what is the effect of modifying histones? Example?
- recruitment of more proteins to modify chromatin
- Example: trimethylation of lysine on H3 tail attracts HP1 protein which helps with the spread of heterochromatin
- modified histone acts with recruited proteins to determine gene expression, hence governs structure and function of cell
- histone varients in regulating chromatin and gene expression
- apart from the 4 highly conserved core histones (H2A, H2B, H3 and H4)
- also variant of histones that can assemble into nucleosomes: i.e H3.3 variant of H3 allos for transcriptional activation functions, H3 variant CENP-A allows centromere function and kinetochore assembly; H2AX allows DNA repair and recombination, H2AZ allows gene expression and chromosome segregation, and H2A macro allows transcriptional repression and X chromosome inactivation
- H4 has no variants yet
- these histone variants are incorporated into already formed histones by chromatin remodelling complexes through ATP hydrolysis in the histone exchange process
- variants are not highly conserved
- Most histones are still the standard 4, but there are a few variants
- major histones are synthesised during the S phase of interphase and assembled into nucelosome of the daughter DNA right after the replication fork
- variant histones are synthesised throughout interphase
- variants are incorporated during exchange histone process in a VERY HIGHLY SELECTVE MANNER
How are covalent modification of histone variants used to control chromosome function?
- using the histone code, the distinct marking on nucleosome through histone variants and histone modifications (such as methylation…etc)
- some combinations have specific meaning to determine chromosome function such as to makr a single chromatin is newly replicated, to mark what gene expression needs to take place.
- various other regulatory proteins which contain small and specific domains to these histone code can bind and recognize these marks
- marks are dynamic, added or removed dependent on environment and signals
- very accessible and dynamic because histone tails extends outwards, so can even be modified when condensed
- SUMMARY: combination of histone modification/histone code, come together to be recognized and attract binding of regulatory protein!
what is a reader complex?
- a protein complex linked together on the histone which has a combination of histone modeifications/codes to mark DNA, which attract additional proteins to regulatr or initiate biological functions at the right time
mechanism of histone modification -> gene expression changes
- E.g spread of heterochromatin along chromatin (positional effect variegation)
- A regulatory protein recruits a protein which causes histone modification/marks, and hence recruits a reader protein which recognizes the histone modification and binds to it
- covalent modification on histone tails act as a mark, a nd in combination attracts other components such as a protein complex w catalytic activities and binding sites to allow for biological functions: like gene silencing, gene expression
what are some examples of histone modification and its meaning
- many modification on H3 tail
- i.e trimehylation of the 9th lysine causes heterochromatin formation and gene silencing by attracting HP1 protein which induces heterochromatin and spreads it
- trimethylation of 27th lysine causes gene silencing through polycomb repressive complex
what organizes chromatin structure and prevents confusion of adjacent chromatin structures?
- certain DNA sequences act as domains to mark boundaries of chromatin domains, and separate them to prevent confusion
- E.g in precursor cells of red blood cells, HS4 sequence separates active chromatin domaun with human B globin locus vital for red blood cell function, from a region of silences chromatin. If this seqeunce was deleted, the beta globin locus is invaded by condensed chromatin, and causes its silencing (position effect variegation) causes severe anemia
- in experiments where they explored the addition of HS4 sequence in mammalian genome, if protects gene from silencing by preventing spread of heterochromatin This is because barrier sequence has clustor of binding site for histone acetylase enzyme. This is because, lysine methylation is required for heterochromatin spread, however the methylated lysine is incompatible w acetylated lysine on close side chains. Hence acetylsing lysine before heterochromatin region prevents the spread of heterochromatin, by prevent methylated lysines from forming.
- othe rmethods are also present to prevent heterochromatin formation
How are chromatin structures in centromere different, why?
- histone variant CENP-A in centromere can form special structures,
- histone variants often have more long lasting marks on chromatin
- centromeres are regions of chromosomes required for attachment of spindle fibres during metaphase and anaphase to allow correct segregation.
- centromere = heterochromatin
- in complex organisms like humans, the centromere is embedded ina a stretch of centromeric chromatin which persists throughout interphase even though the mitotic fibre only attaches during M phase.
- this chromatin (where centromere is in) has special H3 variant CENP-A and additional proteins which allow a very dense nucleosome arrangement to from kinetochore, which is a special feature required for mitotic spindle attachment
- researches and studies have shown that formation of centromere requires an assembly of proteins and histone variants such as CENP-A, but might not require specific centromere DNA sequences like alpha satelites (which were prev. though tb essential)
examples of centromere formation, and comparison of requirements for centromere formation
- In yeaste s.cerevisiae, require 125bp for centromere, require a dozen of proteins and CENP-A variant.
- in a contrasting example in more complex organisms like humans, we require thousands of bp for centromere, and do not contain centromere specific DNA sequences, instead we have alpha satelites. However, alpha satelites are also found in other non-centromere chromatins, which suggests that they are not essential for centromere formation. We also have CENP-A variants
- extreme example: neocentromeres have been observed to form spontaneously on fragmented chromosomes, some of these positions were originally euchromatin and lacked alpha satelite DNA.
- shows the dynamic and that centromere is not defined by one factor
variation and abnormality of centromeres across tree of life
- linking to genome lectures
- during evolution, joining of chromosomes have sometimes caused a chromosome to have many centromeres or non-centromeres
- using genetic comparison to identify evolutionary history by identifying inactivated multiple centromere regions on one chromosome to show joining of chromosomes
how is centromere chromatin structure inherited?
- centromere activity like above need to be inherited to all cell types for correct segregation of cells
- de novo centromere formation requires initial seeding event: which is the incorporation of H3-H4 tetramer with the CENP-A variant to form a protein complex
- the DNA sequence for centromere often contains alpha satelites, and seeding event happens more often there
- the H3-H4 tetramers from each nucleosome are directly inherited by sister DNA helices at replication fork
- the presence of one CENP-A recruits more all the way along the chromatin - a propogation event
- once a tetramer H3-H4 has CENP-A in it, in the next cell division, it gets inherited again.
how are chromatin structure inherited generally?
- some of the specialized chromatin components are distributed to each sister chromosome after DNA duplication, along with the specially marked nucleosomes that they bind.
- After DNA replication, the inherited nucleosomes that are specially modified, acting in concert with the inherited chromatin components, change the pattern of histone modification on the newly formed nucleosomes nearby.
- This creates new binding sites for the
same chromatin components, which then assemble to complete the structure. The latter process is likely to involve reader– writer–remodeling complexes operating in a manner similar to that previously illustrated
what is an example experiment which suggests that actiavting and repressing chromatin structures can be inherited epigenetically
- in enoculated egg of Xenopus, insert nculeus of donor cell which expresses MyoD gene (master transcription regulator for muscles), which is switched off during early embryo stage
- when nuclear transferred into an enucleated egg, it starts to produce MyoD (which is not meant to happen, hence showing epigenetic memory)
- we know it is epigenetic memory because the mechanism for producing MyoD is due to the activation of its promoter. Promoter is activated if the histones around it is H3.3 has a 4th lysine methylated
- when using a mutant MyoD which prevents 4th lysine from being methylated, no histone modification leads to low MyoD production
the importance of chromatin structures
- crucial to evolution as condensed chromatin packaging allows evolution of eukaryotes
- chromatin structure allows for gene expression and hence specialisation of cell types depending on which genes they express
- complexed cell memory in comparison to bacteria
- local variations of chromatin allow gene expression regulation
- chromatin structure/modifications allow for structures like centromere unqiue to eukaryotes allow for inheritence
- polycomb group allow for short-term silencing of genes dependent on environment
- whereas heterchromatin with HP1 protein and histone variants allow for more long-term persistent effects
- some chromatin structures are short lasting, some are more persistent
- very dynamic and flexible
what are polycombs, lampbrush and polytene?
- polytene: giant chromosomes found in drosophila:
- lampbrush: oocytes, for them to build supplies
- polycombs: short change of gene expression
hwo do chromosome loops/chromatin loops effect gene expression?
- looped on the top is more highly expressed
what are the 3 ways of chromosome modification
- histone variants
- post- transcriptional modifications: ‘markers’ to histone tails/to DNA
- methylation of DNA
what are the 4 common ways of post translational histone modification and its effects?
- Acetylation: acetylation of lysine residues on histone proteins: catalyzed by HATs. reduces postive charge on histones, decreases interaction w DNA, hence more relaxed chromatin structure more accessible DNA to TFs and RNA pol
- methylation: Methylation to lysine or arginine, by HMTs. can be mono-di- , tri. Methylation of histone can lead to diff things. H3 at lys 4 trimethylated leads to transcriptional activation, in other places like tri-me at H3 K27 leads to transcriptional repression
- phosphorylation: phosphate group to ser, threonine, or tyrosine on histones executred by kinases. Causes chromatin condensation and segregation during mitosis and meisosis. Also play roles in activation of immediate early genes such as response to stress/growth factors. E.g phosphorylation of H3 serine 10 leads to chromatin relaxation and gene activation
- ubiquitylation: adds ubiquitin protein to lysine residues on histones by a E1, 2, 3 cascade enzymes. Signal histone removal or serve as marker. i.e H2B ubiquitylation of K12 associated w transcriotional initiation. Very depends of type of signal.
give a overview of how histone modification can impact gene expression (include writes, erasers, and readers)
- writers: modify amino acid tags, i.e acetylation…etc
- readers: interact w the modified histone modification (inclusing combinations of histone marks), to change local properties of chromatin
- reader/writer complxes can recruit more regulatory proteins such as TFs….etc, modification ‘read’ by specific binding domain modification, and recruit enzymes which alter local chromatin properties
- the reader/write complexes and changed chromatin structures can effect biological function
- erasers: move modifications
how are genes kept active?
H3K4me3 keeps genes active by recruiting chromatine remodeller NURF.
NURF binds to H3K4me3 and remodels chromatin by promoting nucleosome sliding allowing TFs access to binding sequence
how is H3K9me3 maintained?
- H3K9me3 represses gene activity
- this maintenance requires RNA through RNA induced transcriptional silencing as a part of RNAi to silence genes useing RNA molecules and hsitonbe modification
1. RITSC (silencing complex) is directed to the new transcript mRNA by the antisense RNA
2. then a regulator complex CLRC interacts w RITSC, which then causes the Clr4 subunit of the regulator complex to methylate H3K9. Through their interaction of antisense RNA, the CLRC and the RITSC, it targets Clr4 to specific genome site where silencing is required
3. HP1 then recognizes the methylated H3K4me and binds to it, contributing to the formation of heterochromatin
what is methylation of DNA, and how does it effect chromatin function?
- vertebrates: cytosine can be methylated
- usually methylation of DNA causes inactivity (but exeptions present)
- DNMT establishes AND maintains cytosine methylation
- erasing DNA methylation uses TET enzyme, which oxidizes 5meC
How does DNA methylation inhibit transcription?
- CH3 blocks the binding of TFs
- DNMT recruits trancription inhibiting proteins like HDACs
- methyl binding domains recruit HDACs too
mechanism for DNA methylation + histone modification
- when a gene is active, its histone modified is H3K4me3
- this marks the gene and allows it to inactive DNMT, hence stops the DNA methylation. The active gene mark recruits and binds to ADD domain of DNMT which inactivates it
- when a gene is inactive, it doesnt have marks which inactivated DNMT
- DNMT hence is active to methylate cytosines.
how are epigenetic marks maintained and inherited?
- Maintained and inherited both histone modification and DNA methylations
- Histone modification is conserved.
* histones are recylcles and distributed during DNA replication. Old histones then stay with one of the daughter strands and provides a ‘template’ for new DNA strands to recruit same histone pattern
* histone chaperones help guide and promote modification of histones onto new histones during exchanges
* positive feedback: after the initial seeding event, positive feedback mechanism allows them to recruit more histones and add same modifications by enzymes. - DNA methylation is conserved
* during semi-conserved DNA rep, the mother strand retains DNA methylated patterns. DNMT1 recognizes this and adds same methylation to daughter strand
what are limitations of epigenetic markers in terms of cell memory
- DNA replication dilutes histone markers
- initially during replications these epigenetic markers may be lost as it uses a recycle of new and old histones. A un modified new histone may be incorporated
- however, in post-replication, enzymes which recognize old histones will re-establish this epigenetic pattern
- allows gene expression patterns to be maintained in cell line through many divisions
what is the mechanism to restore repressive histone marks?
- repression due to H3K37me3, applied by the polycomb repressive complex 2 (PRC2) are diluted because the parental histones are randomely distributed to new strands and combined w new unmodified histones
- post- replication: PRC2 complex recognizes the remaining H3K27me3 on the parental histones and binds to them
- allosteric activation and spreading: binding of PRC2 to H3K27me3 can allosterically activate the complex. It starts adding more methyls to H3 histones through positive feedback
- cohesin holds the 2 sister chromatids tgt during this
what is genomic imprinting?
- in mammalians, when a parental inherited copy of a gene is activated and the other is inactivated (vice versa)
- can unmask mutations that would normally be covered by the other functional copy
- in embryo: genes subject to imprinting are methylated
- i.e Angelman syndrome, a disease in nervous system that causes speech impairment and mental disability, results due to the expressed mutated gene on one chromosomal homolog and the silencing (imprinting) of the intact gene on the other homolog
what is an example of genomic imprinting?
- insulin growth factor 2 (igf2) in mouse
- cis-regulated
- in maternal inherited chromosome, the cis-regulated sequence is blocked due to an insulator element which is activated by CTCF binding and blocks communication between cis-regulatory sequence and Igf2 gene, stopping it from expressed
- Because of imprinting, which means that the male gene is methylated. The gene for the insulator protein is methylated, which prevents it from binding to CTCF. which allows cis-regulatory seq to communicate and transcribe the igf2 in male mice, allowing normal growth
how can a whole chromosome’s chromatin structure be altered and inherited
- in mammals, to balance out gene product ratio on sex chromosomes, one of the X chromosome of females (XX) is inactivated (X-inactivation)
- this allows 2 X chromosomes to be in one cell nucleus without too much gene product, hence careful gene expression
how can epigenetic mechanisms ensure stable patterns of gene expression in inheritance to daughter cells
- positive feedback loops allow more stability by buffering against fluctuations of any one Transcription regulator
- self-propogation occurs in 2 ways.
- cis epigenetic mechanisms: affects only one chromosomal copy
- trans epigenetic mechanisms: affects both chromosomes
- NOTE: not all epigenetic modifications are long lasting, MOST don’t have cell memory, butthose who do likely self propogate and have a positive feedback loop
definition of epigenetics
'’the study of changes in gene or chromosome function that are mitotically and/or meiotically heritable and that do not entail a change in DNA sequence”
central dogma of molecular biology
DNA-RNA-Protein
basic facts about eukaryotic DNa replication?
- semi-conservative
- both strands synthesised from a 5’ -3’ direction
- template strand reads 3’-5’
- leading strand is continuous from 5’-3’, lagging rads from 3’-5’, but still synthesises in small DNA fragments (OKazaki fragments) from 5’-3’
comparison of eukaryotic DNA replication with prokaryotic DNA replication
- euk: multiple replication forks, Initiated by ORC
- proks: 1 replication fork, initiated by DnaA DnaB,
What is the mechanism of replication in eukaryotes? simple overview
- origins bound by ORC (initiation)
- DNA helicase (Mcm2-7) loaded onto ORC inactive form
- DNA helicase is activated generatingssDNA that can be replicated by polymerases
importance of maintaining DNA sequences
- short term survival of a cell depends on DNA sequences being conserved, long term survival of a species requires DNA sequences to be changeable over generation to allow adaptation
- mutation rates are extremely low in DNA to allow the functions of genes to be preserved.
- 1/10^10 nucleotides per cell division
- important to conserve genes in somatic cells but more importantly in germ line cells
formation of pre-replicative complex?
- late M phase/early G1 phase
- known as lincensing
- The ORC binds to the origin of the replication form, and cdc6 then combines with the ORC on the DNA
- this allows loading of 2 Cdt1/Mcm2-7 complexes
pre-initiation complex formation?
- in late G1 phase
- more proteins are recruited and bounf to Mcm2-7 complex including polymerase ε, DDK, CDK, Cdc45
mechanism for activation of DNA replication? transition from pre-replicative complex to initiation
- activation of DNA replication is controlled by 2 protein kinases CDK and DDK
- in late G1/S phase:
- CDK is recruited by the pre-RC and phosphorylated the pre-loading complex (which contains pol ε) and the sld3 subunit of mcm2-7. This causes the pre-RC to bind to the pre-loading complex
- DDK then bounds to the complex and together activated DNA replication by forming the CMG complex/CMG helicase in S phase
initiation mechanism
- cdc45, GINS and mcm2-7 bound tgt activates helicase
- activated helicase generates ssDNA, and opens the replication fork by breaking H bond
- allows priming by DNA pol a (primers) and pol ε
replication process, division of labor at the replication fork
- replication fork: 2 ss DNA exposed for pairing
- DNA pol only capable of synthesising 5’-3’. Which works for the leading strand in a continuous motion
- Pol alpha forms a primase complex, and forms a short RNA primer on both leading and lagging strand, to form a RNA primer extended by a short stretch of DNA nucleotides.
- leading strand: DNA pol ε takes over alpha and extends a continuous 5’-3’ leading strand of DNA. PCNA anchors and allows DNA pol ε to slide along replication fork and follows the helicase as it exposes for ssDNA
- lagging strand: DNA pol delta with protein ctf4 synthesizes lagging strand. Since helicase unwinds in a direction away from pol delta, lagging strand is synthesized in small 5’-3’ Okazaki fragments. DNA pol delta also guided by PCNA
- Lagging strand: primer reomoval and gap filling: As pol delta synthesizes lagging strand, it will end up encoutering the RNA primer from previosu Okazaki fragment and it will displace the primer creating a flap like structure
- the flap is removed by FEN1 leaving a nick, which isthen filled in by Pol delta.
replicative polymerase characteristics
DNA pol 3’ exonucleases can only add nucleotides on to a 3’OH, which is why Pol a and RNA primase need to add an initial 3’OH end
formation of okazaki fragments
- lagging strand: DNA pol delta with protein ctf4 synthesizes lagging strand. Since helicase unwinds in a direction away from pol delta, lagging strand is synthesized in small 5’-3’ Okazaki fragments. DNA pol delta also guided by PCNA
- Lagging strand: primer reomoval and gap filling: As pol delta synthesizes lagging strand, it will end up encoutering the RNA primer from previosu Okazaki fragment and it will displace the primer creating a flap like structure
- the flap is removed by FEN1 leaving a nick, which isthen filled in by Pol delta.
termination process
- The helicases on each strand pass each other
- and replication up to primer on what lagging strand
- helicase then switch to encompass both strands
- Nicks are filled in by DNA ligase I,
- Mcm7 is then ubiquitylated (but not degraded) and removed from chromatin by p97 ATPase
how are replication origins specified
- ORC bind to specific sites, but what these sites are varies among species
- In S.cerevisiae: ORC binds to a specific autonomously replicating sequence (ARS) Contain A/T rich element for identification of beginning of replication. Protein Abf1 enhances origin recognition and binding of ORC in s.cerevisiae
- In S.pombe, ORC doesnt bind to a strict consensus seq, but binds to a non-specific A/T roch sequence through a domain of its’s subunit Orc4. which contains an AT-hook that recognizes these regions
- In metazoans replication origin also not determined bys trict sequence. A combination is involved. Epigenetic markers: like H4K20me2 and Acetylation of H4 contribute to this replication origin. Chromatin structure: G4 structures also help determine ORC positionng.
why have origins in specific locations?
- avoiding conflict between transcription and replication. Ensure that these 2 mechanisms dont clash, as both replication and transcription involve unwinding DNA
- Ensuring intervals between each origin is not too larger, so the entire genome can be replicated efficiently and correctly
why are metazoan origins not specified by specific sequences?
- complex development, means that at different developmental stages, the replication may need to be modified in terms of length and duration. To allow for more flexibility, not a specific sequence is used.
- for example, in embryonic cells, there is a short S phase and reduced transcription. Hence more origin sites are required to replicate short genome in short S phase time. reduced transcription means more origin sites for more replication to occur simultaneously as no risk of clashing into transcription
- In adult cells however, there is much more transcription going on, and more regulation. Therefore the origin site needs to be more carefully moderated to prevent clashing into transcription, as well as responding to epigenetics.
- Epgenetic specification gives flexibility to allow variations of S phase length. Epigenetic markers allow better timing adn location of replication origin. This allows fore better response to cellular conditions, and complex regulation depending on cell type and developmental stages
regulation of replication? How is S phase activated?
- during cell division (S phase), each origin on the chromosome is fired once only
- as cells exit G1 phase and enter S phase, licensing triggers a surge of CDK activity, which activated initiation stage of DNA replication
- the CDKs phosphorylate substrates including proteins which form the pre-Replication Complex, which signals the initiation of replication
replication timing, how to avoid early/incorrect timing of replication?
- In G0 phase (quiescent cells) lack CDKs and DDKs and are not active. Which prevents accidental initiation of DNA rep in wrong timing
- Transition from G1 to S phase is a crucial checkpoint where licensing and activation of kinases are required before next processes
why is there a timing programme?
- prevention of early replication or replication when the cell is not ready (i.e not all proteins are synthesized for cell division)
- E.g in mammalian oocytes, remain in non-dividing stage for extensive periods and dont replicate DNA until developmental cues trigger cell cycle
DNA repair system
- In eukaryotes:
- Base excision repair (for small base lesions) - DNA glycolusase
- Mismatch repair: during DNa rep. MMR proteins recognize and remove mismatch, then resynthesize correct sequence
- DNA replication and transcription use topoisomerases: when supercoiling occurs ahead of replication fork, topo breaks DNA strand to relieve supercoiling
- Topo I: single stran break, relav negative and positive supercoils
- Topo II: DNA gyrase (ONLY FOR BACTERIA VERY important). causes dsbreak in DNA. can manage more complex topological issues
similarities of euk and prok DNA rep
- on a macro scale v.similar mechanism: semi conservative
- have the same aim: to ensure DNA is correctly copied and copy passed down to daughter cells
- proks and euks have similar mutation rates (proks are slightly higher due to faster replication/not so specific proofreading mechanism), but still both mutation rates are low
how is the Pre-Replicative complex activated
- Once CDKs phoshphorylate the proteins in the activation of Mcm2-7 helicase, replication at origin begins and replication fork starts forming
how to prevent re-replication
- checkpoints, licensing. Only licensed origins can start replication
- after each origin fires once during cell cycle, the origins are not re-licensed, hence preventing it from firing again
- After licensing and after pre-RC is formed, the factors forming Pre-RC are degraded to prevent re-replication
- Before initiation: all the initiation factors (CDK, DKK) are lacking/inactive, and only when initiation happens they are active
How to prevent each origin from firing more than once?
- after licensing, proteins needed to form a new pre-RC such as cdt1 and cdc6 are degraded or inactivated.
- in eukaryotes: cdt1 is inhibited by geminin to prevent re-licensing. Level of geminin increases as cell progresses through S phase, ensuring cdt1 cannote re-bind and re-license origins
- Moreover, the formation of the replication fork leads to cdt1 proteolysis, as it targets cdt1 for ubiquitin mediated degradation, This ensures cdt1 is no longer available to license another origin
- moreover, the initiation and replication fork formation changes chromatin struicture making it unfavourable for licensing again
- PCNA also plays signalling role to prevent re-licensing
what is the bifunctional role of CDK to prevent re-replication?
- proteolysis of cdc6: low CDK level in G1 permits pre-RC forming, but S phase and G2 phase has high CDK levels which blocks pre-RC to prevent re-replication. This is because high CDK levels phosphorylates Cdc6 and causes it to be recognised and ubiquitylated by SCF for degradation, so no pre-RC formed and no re-replication
- 2nd function is that it causes nuclear exclusion of mcm2-7: outside S phase, mcm2-7 complex are removed from the nucleusm hence prevents replication of cell in the wrong phase of cell cycle Controls replication timing
Why are replication origins not all fired at the same time?
- euchromatin regions of origins are replicated first during Early S phase, and heterochromatin origins are replicated in late S phase
1. Resource management: fewer copies of replication proteins required, staggered replication forks allow proteins to be reused.
2. Reduce demand of dNTPs (low dNTPs can lead to genomic instability)
3. timing of replication may influence chromatin modification: epigenetic marks need to be copied to caughter strands, and hence require correct timing to avoid incorrect epigenome
what is meiosis?
- in diploids during sexual reproudction
- produces haploid cells (gametes) carrying a single copy of each chromosome
- during sexual reproduction: these gametes join together to form a diploid zygote, which has the potential to form a new individual
baisc features of meiosis
- two rounds of chromosome segregation: first round segregates the homologs, and 2nd round
- meiosis 2: separates the sister chromatids producing haploids
- crossing over and random segregation of homologs, generating genetically different haploid cells
- creates genetic variability
stages of meiosis
- pre-meiotic D phaseL replicated sister chromatids held together throughout the chromosome by cohesin (differs in mitosis in the type of CDKs and cohesin used)
- these homologs then randomely line up
- meiosis I: homologs recombine in miosis I prophase
- homologs separate in through random segregation during meiosis I anaphase
- No S phase between meiotic nuclear division
- sister chromatids separate in 2nd meiotic anaphase, producing 4 haploids
differences compared to mitosis
- in mitosis: the homologs don’t pair up, only sister chromatids segregate
- in mitosis: only 1 round of division giving 2 daughter cells (vs 2 rounds of segregation giving 4 daughter cells
- mitosis produces genetically identical clones of daughter cells, (meiosis produces haploid and genetically diff daughter cells)
- meiosis has no S phase between meiotic divisions
- meiosis has homologous recombination and random assortment of chromosomes
- meiosis: separation of homologs in meiosis I
generation of gamete diveristy in meiosis calculation?
2^n (where n = number of chromosome pairs
* in humans, 23 pairs of chromosomes
* hence 2^23 = 8.4 * 10^6 geneticaly diff gametes
* and hence 8.4 * 10^6 x2 = 7*10^13 genetically different zygotes
* assuming no recombination, recombination will cause even more genetically different children
what happens if errors occur in chromosome segregation
- in females, oocytes are arrested at meiotic prophase I, and progresses during puberty once ovulation happens
- errors in meiotic chromosome segregation can cause trisomy
- common one is trisomy 21 causing downe’s syndrome, and this error increases with age after 35 more likely to have errors in segregatting chromosome causing trisomy
how do homologous chromososmes pair and recombine?
- prophase I
- in order for homologs to recognize each other and pair up and bi-orient on the first meiotic spindle
- during prophase I (long period - takes hours in yeatss and weeks in complexed plants)
- meiosis cohesin keep the sister chromatids and bivalent homologs closely associated
1. duplicated meiotic prophase chromosomes are long threadlike and closely associated w their sister chromatids
2. Alignment: homologs pair and associate in pairing process: through complementary DNA sequences (pairing sites) and causes the homologs to be closely juxtaposed to form a 4 chromatid structure called bivalent
3. Recombination: homolog pairs are locked tgt by homologous recombination (crossing over): double strand DNA breaks at sister chromatid. large number of DNA recombination event and cross over of chromatid DNA
mechanism for homolog pairing?
- Recombination:
- Pre-synaptic alignment: recobination complex (which also assembles double strand break in a chromatids) binds matching sequences on homolog and brings it closer
- Synapsis: axial core of a homolog becomes tightly linked to its partner homolog by transverse filamengts to create synaptonemal complex which bridges the gap ebtween the homologs to 100nm in gap
- crossing over begins before synaptonemal complex assembles but final steps of crossing over occurs when DNA is held in a synaptonemal complex
describe formation of a synaptonemal complex
- transverse filaments surround the axial cores of the homologs (which have matching DNA seq recognized and brought tgt by pre-synapsis recombination complex)
- axial cores of each homolog also interact w cohesin to hold together sister chromatids
- 100nm gap
- SC = formed during synapsis
what are the 5 stages of meitoic prophase, explain each stage
- leptotene: homologs condense and pair and genetic recombination begins
- zygotene: synaptonemal complex begins to assemble at sites where homologs are closely associated (pre-synaptic alignment) and where recombination occuring
- pachytene: assembly of synaptonemal complex complete and homologs are synapsed along entire lengths. (long periods for days - until desynapsis)
- diplotene: desynapsis occurs: disassembly of synaptonemal complexes along w shortening and condensing of chromosomes. (Dissassemble is complete). This allows an observation of inter-homolog connections: chiasmata (which show sites of recombination by being attached to another cohesin)
- diakinesis:
what happens if defects in Synaptonemal Complex formation in mammals
- infertility
- birth defect
what are 2 types of synapsis in meiosis prophase I
- some species dont need Synaptonemal complex formation for formatyion of chiasmata and recombination
- double stran break dependent synapsis: DSB occurs and then this leads to crossing-over, recombination and chiasmata formation
what is the mechanism for synapsis if organisms dont form synaptonemal complex?
- S.pombe doesnt form SC, but chromosomal movement: ** horsetail nuclear movement**, help align chromosome for recombination
- chromosome movement via horsetail nuclear movement and clustering of telomere: causes homologous sequences to come into proximity
*
how do DNA strands break during crossing over
- nucleasecauses double strand break with 3’ overhanging
- hence causing either crossingover or joining of DNA strands again
how do chromosomes attach correctly to the meiotic spindle in meiosis I during homolog separation
- homolog separation unique to meiosis, does not occur in mitosis and is avoided
1. in metaphase I: sister kinetochores of one homolog must attach to same spindle pole (this is AVOIDED in mitosis). Cohesin links at recombination sites (chiasmata) between sister chromatids of homolog and all along the sister chromatids to allow the homologs to be close together
2. Metaphase I: the 2 sister kinetochores of 1 homolog are fused into a single microtubule binding unit (syntelic attachment) and is attached to a single pole
3. fusion of sister kinetochore: achieved by monopolis that is localized at kinetochores and is removed after meiosis I to allow sister chromatid separation (allow bioriented on spindle during meiosis II - similar to mitosis) - at this stage (metaphase I) the spindles have attached correctly to sister chromatid kinetocore, but not yet pulled apart
what is the function of chiasmata
- connect homologs and keep all 4 sister chromatids together in close proximity through cohesin links
what is amphitelic attachement
- meiosis I: 4 kinetochores
- when incorrect attachment of meiotic spindle to sister kinetochores
importance of cohesin in meiosis I and mitosis
- meiosis I (before anaphase I): cohesin throughout the sister chromatids and hold bivalent structure of homolog pair
- meiosis I (anaphase I): cohesin lost in sister chromatid arms to allow segregation of homologs
- mitosis: kept until anaphase to allow division of sister chromatids
how is reductional chromosome segregation achieved
- cannot lose cohesion completely as this would also split the sister chromatids in meiosis I
- anaphase I: cohesin retained around centromere and lost from arms. The shugoshin protein protect cohesin around centromere from being cleaved off by separase. Separase is actiavted in APC/C activation, which leads to destruction of securin, and activation of separase
- Shugoshin recruite PP2A phosphatase to remove phosphates from cetromeric cohesins.
- separases can only cleave off phosphorylated cohesins, hence cannot cleave non-phosphorylated centromeric cohesins
- this allows sister chromatids to be linked for bi-orientation at equator for meiosis II.
- shugoshin is inactivated after meiosis I.
what is shugoshin and what is its function?
- meiosis I: anaphase I: needs to keep centromeric cohesin, but remove arm cohesin in sister chromatids
- The shugoshin protein protect cohesin around centromere from being cleaved off by separase.
1. Shugoshin recruite PP2A phosphatase to remove phosphates from cetromeric cohesins.
2. separases can only cleave off phosphorylated cohesins, hence cannot cleave non-phosphorylated centromeric cohesins
3. this allows sister chromatids to be linked for bi-orientation at equator for meiosis II.
4. shugoshin is inactivated after meiosis I.
how is equational chromosome segregation achieved in meiosis II (anaphase II)
- Anaphase II: sister chromatids are aligned bi-oriented at the equator and are to be segregated by spindle fibres.
- APC/C activation once again triggers Separase to cleave off cohesin
- without shugoshin, sister chromatids are separated due to lack of cohesin and spindle fibres attach to each sister’chromatids kinetochore and splits the sister chromatids to opposite poles
- in subsequent cytokinese they are split off to form 4 haploid daughter cells (gametes)
what is the APC/C activation
- securin in bound to separase and inhibits its function
- when APOC present, securin is degraded and separase becomes activated
- active separase cleaves cohesin off sister chromatids
what prevents DNA replication between meiosis I and II
- low CDK required to load ORC (mcm2-7), once loaded, CDK level increases and BLOCKS ORC loading/mcm2-7 loading
- low CDK activity which initiates licensing in DNA replication is initiated by cyclin B proteolysis
- in Meiosis I and II, there is limited cyclin B proteolysis, hence CDK level is not low, and this stops the licensing and loading of ORC and prevents formation of Pre-RC. Hence blocks DNA replication
what is an experiment which shows that low CDK levels prevent DNA replication in Meiosis I and II intervals
- Inject Frog oocyte with mRNA of WeeI (CDK inhibitor) during Meiosis I and II interval
- this causes CDK to be inactivated in these oocytes, and this led to DNA replication and loading of mcm helicase (licensing)
- In normal conditions, CDK present and activel and no DNA rep in interval of MI and MII
what are the 4 steps of enzymatic RNA synthesis
- Nucleotide addition cycle:
1. Translocation: RNAP moves along DNA template in 3’-5’ direction, moving the growing RNA chain from pre-translocated to post-translocated state
2. NTP binding/releasing: A NTP conplementary to DNA template base pairs w DNA in active site of RNAP
3. Catalysis: RNAP catalyzes formation of phosphodiester bond of correct NTP. this adds a nucleotide to the 3’ end of RNA molecule. Energy used for this comes from hydrolysis of NTP (releasing PPi)
4. PPi Release/binding: PPi released from active site of RNAP and completes addition
what are some features of RNA Addition cycle?
- RNA Pol is dual-functionality, it catalyzes the addition of nucleotides and the excision of incorrect nucleotides thru a backtrack reaction via hydrolysis
structure of RNA polymerase
- RNAPs in eukaryotes and prokaryotes have similar structure, and hence thought to all derive from a common ancestor
- for eukaryotes:
1. Initiation (recruitment of RNAP): TFB recruits TBP binds to TATA box and recruits RNAP to form a complex
2. Initiation (closed complex): DNA helix still in tact
3. Initiation: open complex: RNAP unwraps DNA exposing template strand
4. Abortive Initiation: RNAP synthesizes abortive RNA fragments, before successfully escaping the promoter to transition into elongation phase
5. Elongation: RNAP associated with elongation factors and adds nucleotides to RNA transcript
6. Termination: RNAP dissociates from DNA template and releases the full length RNA molecule. U-rich tracts signal RNAP to stop transcription
Evolution of RNA polymerase
- relating back to the RNA world hypothesis
- likely that RNA Polymerases in euks and proks originated from same common ancestor
- RNAPs are diverse in tree of life, from more simple structure in prokaryotes, to more complex and more diverse RNAPs in eukaryotes
- Fe-S cluster is common in all, and is an important catalytic structure that allows transcription to occur
- Mg+ active site
- function of RNAPs are conserved and is universal, despite their different structures
- Single subunit phage RNAPs can carry out transcription w/o additional factors, showing that the other factors are not essential for transcription, and that RNAP is essential
- as RNA features of all 3 domains are conserved, suggests that transcription (or RNA repliacation mechanism) had occured prior to division of three domains
structure of bacterial RNA Polymerase complex
- 6 units
- sigma factor binds specifically to the promoter region on bacterial DNA
- addition of sigma factor tgt is called the DNA polymerase holoenzyme
- sigma unit only binds with the rest of the RNA polymerase when in comes into contact w DNA (binds weakly)
what are the 3 steps of transcription
- in both euks + proks
1. initiation
2. elongation
3. termination
prokaryotic transcription
- sigma factor binds to RNAP to form RNAP holoenzyme and quickly slides through DNA until it reaches a promoter, which then the polymerase binds tightly (because sigma factor makes specific contacts)
- RNAP opens up the DNA helix and expose short stretch of DNA forming transcription bubble. bubble is stabilized as RNAP binds to non-template DNA strand
- Template strand exposed and forms mRNA by scrunching mechanism (RNAP still bound to promoter and pulls upstream DNA into the active site and expands transcription bubble) causes stress and causes release of abortive RNA fragments (abortive initiation)
- then scrunching causes core-enzyme of RNAP to break its interaction w promote and sigma factor is disgarded
- RNAP moves down DNA elongating it - expanding transcription bubble
- chain elongation continues at 20nucleotides/sec for bacterial RNAP
- RNAP encounters terminator signal. Template strand has string of ATs when transcribed forms into a hairpin strcuture and causes RNA transcript to dissociate and causes release of the DNA template strand.
initiation in bacteria transcription
- very complicated steps
- requires holoenzyme (RNAP + sigma factor)
- abortive initiation through creating stress abortive RNAs
- if RNAP released prematurely mist start over again
what is facillitated diffusion/target search mechanism in RNA transcription?
- how RNA searches a specific site for initiation (finding the promoter)
- includes 1D hopping, 1D diffusion (sliding), and intersegmental transfer
what are some ways of facilitated diffusion
- 1D Hopping: RNAP binds to DNA and moves by dissociating and reassociating with the DNA, effectively “hopping” along it. This allows RNAP to cover short distances rapidly, but it’s a local mode of movement.
- 1D Diffusion (Sliding): RNAP binds to DNA and then slides along it in a linear fashion. This “sliding” or “1D diffusion” means the enzyme stays in contact with the DNA while moving along its length. It’s an effective way to search for promoter sites without fully detaching from the DNA molecule.
- Intersegmental Transfer: This is when RNAP transfers from one segment of DNA to another non-contiguous segment. DNA looping can bring distant regions into close proximity, allowing RNAP to jump or transfer between these regions. It’s a way for RNAP to efficiently sample different areas of the genome.
- 3D Diffusion (Jumping): Apart from moving along the DNA, RNAP can also completely dissociate from the DNA and diffuse through the nucleoplasm in three dimensions, before reassociating with the DNA elsewhere. This allows RNAP to search for promoter regions that are not in its immediate vicinity.
what are consensus sequences?
A consensus sequence is a way of representing the most frequent nucleotides at each position in a collection of aligned sequences.
- after aligning multiple sequences, finding the most common base for each position
what are some common concensus sequences during RNA transcription, why are they significant?
-35 Region: This hexameric sequence is located approximately 35 nucleotides upstream of the transcription start site. It’s recognized by the σ factor, which is a part of the RNA polymerase holoenzyme, and is essential for the binding of the enzyme to the DNA.
-10 Region (Pribnow box): Another hexameric sequence found about 10 nucleotides upstream of the transcription start site. It is also critical for RNA polymerase binding and initiation of transcription.
The consensus sequences for these regions in E. coli are typically:
-35 Region: TTGACA
-10 Region: TATAAT
mechanism for sigma-RNAP-DNA complex
- sigma factor has Helix-Turn-Helix (HTH) motifs which are structural motifs for the sigma factor to bind to DNA at specific sequences. These motifs insert into the major groove of DNA helix and make sequence specific contact that determine where RNAP bind
- Once a sigma factor engages w the DNA, the N-terminal fo the sigma factor initially blocks the DNA binding domains. Only when the core RNAP binds, it causes conformational change which allows sigma factor to bind to the promoter seq
- once sigma factor’s DNA-binding domains recogniza and bind to the DNA at -35 and -10 regions of promoter, it positions RNAP to be at the correct site for transcription initiation
what are sigma factors and antisigma factors?
- sigma and anti-sigma factors are involved in gene expression in bacteria
- sigma factors bind to RNAP holoenzyme and direct it to specific promoter regions on DNA where transcription begins
- diff sigma factor recognize fdiff promoter sequences, hence regulates gene expression
- For example, Sigma 70 recognizes the consensus sequence TTGACA followed by 16-18 base pairs and then TATAAT. This sequence is commonly found in promoters targeted by Sigma 70 in E. coli.
- antisigma factors are proteins that bind to sigma factors to inhibit their function.
- switches of genes/regulatory mechanism
what are the ways for transcription termination in prokaryptes vs eukaryotes
- prokaryotes: rho-dependent, rho-independant (intrinsic termination method)
- eukaryotes: cleavage and polyadenylation + RNA pol II
rho-dependent and rho-independent mechanism
- both bacteria transcription termination mechanism
- rho-dependent termination: rho protein (ATP-dependent helicase). Rho-protein recognizes specific sequences (rut sequence, C-rich) on the new RNA transcript and binds to it. Moves along the RNA towards RNA polymerase, unwinds RNA-DNA hybrid within transcription bubble, leading to release of RNA transcript. Rho has 2 domains, N-terminal RNA binding domain, and C-terminal ATPase domain, uses ATP to dissociate
- Intrinsic termination: Relies on specific sequence in RNA transcript or DNA sequence. A GC-rich region of the RNA forms a hairpi loop followed by a series of U bases. Hairpinstructure causes RNAP to pause and week bonds between U bases and A in DNA template strand causes detachment of RNA and DNA. HOWEVER: in some conditions, mRNA can also form anti-terminator hairpins to prevent termination of transcription
RNA polymerases for eukaryotes and prokaryotes
- eukaryotes have 3 RNA polymerase, share similar seq and structural domains: RNA polymerase I transcribes 18S/28S rRNA.
- RNA polymerase II transcribes mRNA and some small RNAs.
- RNA polymerase III transcribes tRNAs, 5S ribosomal RNA, and also some other small RNA
- prokaryotes only have 1 RNAP:
differences between eukaryotic and prokaryotic transcription
- Initiation: euks have 3 diff types of RNAP, proks only has 1 RNAP
- euks’ TBF +PolII recognize TATA box promoter sequence, proks require -10 and -35 promoter sequence recognized by sigma factor
- termination: proks: rho dependent/independent, euks: polyA + cleavage + splicing
plant specific RNAP
Plants also evolved two additional types of RNA polymerases from RNA polymerase II: RNAP IV and V. They are involved in the production of non-coding RNAs that regulate gene expression
- evolved latest from Pol II
Transcription initiation of eukaryotes RNA Pol II
Eukaryotic transcription initiation must take place on DNA that is pack- aged into nucleosomes and higher-order forms of chromatin structure features that are absent from bacterial chromo- somes.
In GTF, the TFIID and TFIIH are very important.
1. TFIID binds to TATA binding protein
2. Then the TFIID w/ TATA binds to DNA TATA box, and TFIIA binds to this complex to stabilize it.
3. TFIIB then binds and recruits RNA pol 2
4. TFIIF and TFIIE recruits the important TFIIH which completes the large complex and can act as a DNA helicase/kinase, by phosphorylating polII, as it recruitrs the capping enzyme
5. Pol II then goes on to transcribe nucleotides (72 nucleotides/sec)
6. Capped polymerase stops it from getting degraded.
d. Promoter region can increase of decrease rate of transcription for a gene
i. Or repressors can bind to DNA motifs which prevent DNA transcription
eukaryotic transcription termination RNA Pol II
a. PAS (polyadenykation sequence) is a termination sequence
b. And pol II transcribes the PAS and continues
c. CAP complex binds to pol II in RNA
d. This leaves the RNA end after the PASwhich is naked and not capped/protected, hence CAP binds to it and stops its from being hydrolysed
e. Poly A tail is then added to the 3’ end to prevent endonucleation
Transcription Initiation by RNA pol I
RNA polymerase I is recruited by a basal transcription factor SL1 that is placed at a TATA element by factors that bind the upstream promoter element .
Transcription Initiation by RNA Pol III
RNA polymerase III can bind three different types of promoters, which consist of conserved elements arranged in alternative configurations.
how does multiple rounds of transcription occur for one DNA sequence
Escape from the Initiating Complex: Once the RNA polymerase (same for I, II, III) is bound to the promoter, forming the pre-initiation complex, it must transition to the elongation phase of transcription. This is often referred to as “promoter escape,” where the RNA polymerase leaves the promoter region to begin synthesizing RNA along the DNA template.
Recruitment of New Complexes: After the RNA polymerase has cleared the promoter, the transcription initiation complex that remains bound to the promoter can then recruit another RNA polymerase molecule. This allows for multiple rounds of transcription initiation, enabling the gene to be transcribed repeatedly, which is essential for producing the required amounts of RNA.
Termination process for RNA Pol II Transcription Mechanism
- Cleavage and Polyadenylation:
As RNA polymerase II transcribes the DNA into RNA, it eventually transcribes a sequence that signals the end of the gene, known as the polyadenylation signal (typically AAUAAA in humans).
Two key multi-protein complexes, Cleavage and Polyadenylation Specificity Factor (CPSF) and Cleavage stimulation factor (CstF), recognize this signal and bind to the RNA. - Pre-mRNA Cleavage:
CPSF and CstF together direct the endonucleolytic cleavage of the pre-mRNA about 10-30 nucleotides downstream of the polyadenylation signal. - Poly(A) Tail Addition:
Poly(A) Polymerase (PAP) adds a string of adenine nucleotides (the poly(A) tail) to the newly created 3’ end of the RNA. This polyadenylation is important for mRNA stability and nuclear export. - RNA Processing and Export:
Guanylyl transferase adds a 7-methylguanosine cap to the 5’ end of the pre-mRNA, which occurs co-transcriptionally (as the RNA emerges from RNA polymerase II). This capping is crucial for RNA stability and later for initiation of translation.
Methyltransferase then adds a methyl group to the guanine cap, forming the 7-methylguanylate cap (m7G). - Binding of Poly(A) Binding Protein (PABP):
PABP binds to the poly(A) tail, contributing to the stability and regulation of the mRNA. - Transcription Termination:
Tafter cleavage, the RNA polymerase II continues to transcribe for a short distance before it dissociates from the DNA. This could be facilitated by the still-transcribing polymerase encountering the cleaved, polyadenylated RNA, or by other factors that signal the polymerase to disengage.
Endoribonuclease may be involved in degrading the RNA downstream of the cleavage site, which may help to dislodge RNA Pol II from the DNA. - Torpedo Model:
A proposed mechanism for the actual termination step is the “torpedo model,” in which an exonuclease (like the 5’ to 3’ exoribonuclease Xrn2 in humans) degrades the downstream, uncapped RNA following cleavage and chases down the RNA polymerase. Once it catches up to the polymerase, the degradation of the RNA induces the polymerase to terminate transcription.
How is RNA Transcription and processing in eukaryotes viewed as a ‘factory’ ?
- RNA pol II tail (aka C-terminal domain) gets phosphorylated to be detatched from initiation proteins and start transcription elongation
- As it detaches from initiation proteins it also allows new proteins to become attached to the CTD which function in RNA elongation and RNA processing steps
- the CTD is essentially a scaffold which holds a variety of proteins close by the RNA pol II, so that it can be transferred and used on the nascent RNA when required
- This allows the RNA pol II to not only elongate and transcribe nascent RNA, but also allow its proteins to process the nascent RNA as it is formed
- allows overall rate to be sped up
what are the 3 processes to process nascent RNA in eukaryotes
- 5’ Cap (7-mehylguanosine)
- 3’ Poly A tail
- splicing
what is one MAJOR DIFFERENCE between bacterial and eukaryotic mRNA (tip: function)
- prokaryotic mRNA can be transcribed in an operon (multiple related genes controlled by one promoter i.e lac Operon)
- Eukaryotic mRNA is always 1 protein controlled by 1 promoter (however alternative splicing can create more that one protein)
what is polycistronic transcription?
- Polycistronic transcription is specific to prokaryotes and some viruses, where a single mRNA transcript can encode multiple proteins, because it contains several open reading frames. Each of these frames corresponds to a different protein, which is a common feature in bacterial DNA.
- i guess it is sort of bacteria’s way to get one gene to code for multiple proteins (like alternative splicing in eukaryotes)
give a brief overview in 4 steps of RNA processing in euakryotes
- capping proteins are attached onto ser5 of CTD of RNA pol II once it has been phosphorylated by TFIIF during transcription initiation
- Once RNA pol II starts transcription and produces aroun 25 nucleotides on nascent RNA, the RNA capping proteins add a 5’ cap to nascent RNA prevent it from being hydrolyzed by exonucleases
- CTD phosphorylated at Ser2 positions, and eventually dephosphorylated at Ser5, anddissociates w DNA to allow reinitiation
- this eventually leads to the attraction of 3’-processing proteins and splicing proteins
what happens if during RNA transcription of eukaryotes, there is no CTD?
- we can compare RNA pol II with RNA pol I and III transcripts
- RNA pol I and III also produced RNA transcript, but they are uncapped at 5’ because of a lack of CTD
- The 5ʹ-methyl cap also has important role in the translation of mRNAs in the cytosol, and helps w mRNA further processing and exporting
- hence the 5’ cap mRNA helops distinguish it from other types of RNA
RNA capping
- RNA pol II’s CTD during initiation is phosphorylated by TFIIF at ser5, and hence dissociates from initiation proteins
- 3 capping enzymes acting is succession is bound to CTD (as CTD is phosphorylated at Ser5): a phosphatase, a guanyl transferase, a methyl transferase
- once RNA pol II produces about 25 nulceotides of nascent RNA transcript, these 3 enzymes start capping process in succession
- phosphatase removes a phosphate from 5’ end of nascent RNA
- Guanyl transferase adds a GMP to the 5’ end via reverse linkage (5’-5’ link instead of 5’-3’ link)
- then methyl transferase adds a methyl group to the Guanosine
give a very simple overview of splicing
- RNA pol II CTD contains some components which later form the spliceosome
- during elongation, the nascent RNA is marked by components of spliceosome or additional SR proteins to mark downstream and upstream positions of where a splicing event should take place. This reduces exon skipping and cryptic splicing errors during splicing, and increases accuracy, to allow splicing to be made at correct location
- spliceosome in a snRNP containing RNA and proteins to capture, splice and release RNA through forming a lariat and 2 phosphoryl-transferase reactions
- 3 locations on the intron must be defined for spliceosome to recognize and splice: the 5’ splice site, the 3’ splice site and the branch pount in the intron. (recognized by spliceosome by consensus sequences, and other additional steps)
- after splicing and joining of the 2 exon sequences, the exonjunction complex (EJC) will be bound to the location of the splicing event to mark location of splicing event and to determine mRNA fate
- IMPORTANT: splicing labelling and marking occurs during elongation, but the actual splicing reaction doesnt take place until the transcription is complete!
what is splicing
- each splicing event removes 1 intron
- 2 sequential phosphoryl-transfer reactions (transesterification)
- joins 2 exons together well removing th eintron between as a lariat
structure of spliceosome and function of spliceosome (7)
- catalyzes pre-mRNA splicing
- is a complex of 5 additional RNA molecules and several hundred proteins
- hydrolyzes many ATPs while being flexible enough to deal with a huge avriety of introns
- key steps in RNA splicing performed by RNA
- these RNA recognize consensus sequences in the introns
- spliceosome contains the RNA molecules (snRNA): U1, U2, U4, U5, U6, each is complexed with 7+ protein subunits to form snRNP (which is the core of the spliceosome)
- recognizes 5’ splice junction, 3’ splice junction, branch point site, consensus sequence in introns (due to base pairing)
what is an important function of having a complexed splicing complex?
- ensure accuracy (last step of transcription) acts as a checkpoint
- allows flexibility to deal with huge variety of introns found in euakryotic cell
what is variability in introns?
- differ hugely in size (some introns are 100 nucleotides some are 10,000)
- although they all have consensus sequences but still may be difficult to determine (maybe due to cryptic splicing)
- majority of a gene is intron (up to 95%+)
It is wasteful to cut out so many introns, what is the point then?
- suggest: exon-intron arrangement may facilitate emergence on new proteins over evolutionary time scales
- presence of numerous introns in DNA allow genetic recombination to allow combining of exons from diff genes to emerge ne proteins
- many cells have composed a common set of protein domains
- alternative splicing increases coding potential of genomes
what are the 2 reactions taken place during splicing
- 2 phosphoryl-transfer reactions (transesterification)
1. A specific adenine nucleotide in the intron sequence (the branch point) attacks the 5’ splice site and cuts the sugar phosphate backbone of RNA at 5’ splice point (first phosphoryl transfer reaction)
2. the cut 5’ site become covalently linked to adenine nucleotide there by creating a loop in RNA
3. the released 3’OH end of exon sequence then reacts w the start of the nect exon sequence 3’ splice site
4. that joins the 2 exons tgt and releases lariat (2nd phosphoryl transfer)
5. releases lariat is broken down to single nucleotides and recycles
what are the consensus sequences on RNA transcript for splicing
- 5’ splice site: starting with GU—-
- branch point: the Adenine
- 3’ splice site: ends with AG
* Apart from starting GU and ending AG, the rets of the consensus sequence is variable
* distance along RNA between 3 splicing sites are highly variable depending on length of intron
* base pairing between RNA transcript and spliceosome help identify these regions
how is trancription and splicing coordinated?
- few components of the spliceosome assemble on pre-mRNA as it emerges to prevent error in splicing as the RNA transcript emerges (marks the positions to ensure correct splicing but the actual splicing doesnt occur until transcript completed)
- snRNA that is carried by phosphorylated CTD on RNA Pol II. As RNA transcript emerges, the 5’ splice site will only be labelled with the 3’ splice site that emerged form the polymerase by snRNP. Sites are labelled as they emergy, to prevent exon skipping
- Exon definition; as RNA synthesis occurs, group of SR proteins assemble on 5’ and 3’ splice sites, then recruit U1 snRNA which marke the downstream and U2 to mark upstream of exons (prevents cryptic sites and exon skipping)
full mechanism of splicing and snRNPs
- the U1 snRNP forms base pairs with 5’spice junction (due to marking of SR proteins). The BBP (branch point binding protein) and U2 auxilliary factor recognize and bind to branch point site
- U2 snRNP recruited and replaces BBP and U2 auxiliary factor and forms base pairs with branch point consensus sequence
- the U4/U6* U5 triple snRNP brings together the U1 and U2, and sebsequent rearrangements abd reactions break apart U4/U6, causing U6 to siplace U1 at 5’ junction. This change in arrangement in spliceosome allows creation of active site which catalyzes the first phosphoryl transfer reaction
- additional RNA-RNA rearrangements create the active site for 2nd phosphoryl transferase reaction, and completes the splice. This 3’ splice site cleavage, and joins the 2 exon sequences, while putting a EJC protein to mark splice event
what is the use of ATP in splicing?
- ATP not required during the 2 transesterification reactions as the breaking of bonds during this releases high energy phosphate bonds
- however, ATP hydrolysis required for assembly and reassembly of spliceosome; to break RNA-RNA interactions between U4/U6 to allow formation of new ones and new rearrangements
what is the purpose of RNA rearrngements in the spliceosome? (3)
- allow splicing signals on the pre-RNA to be examined and checked by snRNPs multiple times prior splicing. E.g U1 snRNP recognizes 5’ splice thru base pairing and SR proteins, and as splicing proceeds U1 is replaced by U6 which also requires base pairing. This ensures base-pairing signals are correct and not mis-paired Increases overall accuracy
- rearrangements create active sites for the 2 transesterifications. Only after splicing signals are checked, the 2 active sites are sequetially created. Orderly progression, with checks between each active site creation, along with specificity of active site ensures rare errors
- Once splicing complete, snRNPS remain bound to lariat, disassembly of snRNP from RNA and each other require RNA-RNA rearrangements that also require ATP hydrolysis (return to original configuration to be used again)
most important feature of spilceosome?
- caalytic sites formed both by protein and RNA
how to prevent errors in splicing? what are the possible errors?
- the rearrangements through base pairing and creation of active sites prevent errors
- 2 types of errors:
1. exon skipping: skipped an exon
2. Cryptic splice site selection: Cryptic splicing signals
are nucleotide sequences of RNA that closely resemble true splicing signals and are sometimes mistakenly used by the spliceosome. - there are 2 ways to prevent this
1. Coupling transcription with splicing: as transcription proceeds, the phosphorylated CTD of RNA Pol II carries several components of spliceosome (U1 and U2), which are transferred to RNA transcript as it emerged. The snRNPs are only bound to 5’ splice site once the 3’ splice site emerged. Coordination of labelling splice sites as they evolve prevent exon skipping
2. exon definition: exon sizes are much uniform than introns (avg 150 across eukaryotes) hence allows better labelling. SR proteins assemble on exon sequence and makr teh 5’ 3’ splice sites. SR proteins then recruit U1 (downstream mark) and U2 (upstream mark). SR proteins bind to specific RNA seqs on EXONS (splicing enhancers). These sites can be created without affecting amino acid sequence due to wobbling base (redundancy in amino acid code)
how does chromatin structure affect RNA splicing
- Nucleosomes tend to be positioned over exons, hence can determine the speed of RNA transcript emergence and hence allow proteins responsible for exon definitions to assemble on RNA transcript as it forms
- bcuz splicing and transcription is coupled, hence the rate of transcription will effect rate and accuracy of splicing. If transcription occurs slowly, it prevents exon skipping, as the spliceosome will formbefore the next splice site emerged. Nucleosome in condensed chromatin will cause RNA pol II to pause or take longer to transcribe
- mechanism unknown: chromatin structure/histone modification can attract components of spileosome and these components can then be transferred to nascent RNA
how does RNA splicing show plasticity?
- adapting to mutations: Mutations in splicing sequence often lead to new splicing patterns.Common outcomes include exon skipping or the use of cryptic splice sites.This flexibility suggests that RNA splicing’s adaptability has been crucial in evolutionary processes.
Mutations affecting splicing can lead to severe diseases, such as β-thalassemia, cystic fibrosis - alternative splicing: Alternative splicing allows for different proteins to be produced from the same gene, enhancing the genome’s coding potential.
How does splicing provide evidence for RNA world theory?
- lariat mechanism involving snRNPs is used rather than a simpler nuclease cleavage
- evidence for RNA world theory: Early cells likely used RNA as both genetic storage and as catalysts, predating the use of DNA and proteins for these roles.
RNA-catalyzed splicing was crucial in these early cells, supporting the evolutionary precedence of RNA in cellular functions. - Certain RNA introns can splice themselves without proteins, evident in organisms like the ciliate Tetrahymena, some bacteriophage T4 genes, and in mitochondrial and chloroplast genes, hence eukaryotic splicing probably evolved from that
3 steps of RNA processing
- 5’ capping occurs as soon as 25 nucleotides are made
- splicing components are transferred from CTD to RNA transcript to mark splicing sites, as RNA nascent transcript emerges
- as RNA polymerase reaches a gene, 3’ poly A tail added
3’ poly-A tail processing of pre-mRNA
- transcription termination signal encoded in genome
- these signals are transcribed into RNA and are recognized by 2 enzymes: CstF (cleavage stimulation fcator) and CPSF (cleavage and polyadenylation specificicty factor); these enzymes travel on CTD RNA Pol II tail and transferred on to the 3’end and the signal is transcribes
- CstF and CPSF bind to termination signal on nascent RNA and create the 3’ end of mRNA by first cleaving it from RNA pol II. Then 3 additional processing steps occur
* Poly-A Polymerase (PAP) adds one at a time 200 A. nucleotides to 3’ produced by cleavage (added via norma 5’-3’ linkage), These nucloeitde precursor in ATP and doesnt have a template, hence poly-A-tail is not encoded in genome
* Then, poly-A binding proteins assemble on to it and determine final length of Poly-A -tail - after mRNA being cleaved, the RNA Pol II continues transcribing (as it is still bound to DNA), it produces hundred of nucleotides but as these nucleotides lack 5’ cap, they are quickly degraded. Continued RNA degradation causes RNA pol to dissociate and terminate transcription
2 functions of 5’ cap
- enable protection from exonuclease
- form complex that facilitate ribosome assembly to initiate translation
what is backsplicing? what does it do?
- a splicing mechanism that forms circRNAs
- Black splicing mechanism: downstream of one exon is linked to the upstream of the same exon (or another upstream exon), this creates a loop containing the introns to be removed
- this process is catalyzed by exon definition complexes and in backsplicing they cause a circular linkage forming circRNAs
- circRNAs dont have 5’ or 3’ ends as they r a closed loop
- circular property makes them resistant to degradation by exonuclear, and becomes more stable
- circular RNAS have functional roles within cell due to stability: act as molecular sponges for microRNAs and affetc they gene expression, regulate protein synthesis
what ius an alternative spliceosome
- an alternative splicing mechanism involving U11/U12 instead of U1/U2, which also forms a lariat and has the U4,U5,U6 triple complex
- dedicated to minor class of introns
what are prokaryote type introns
- bacterial and organelle pre-mRNA contail self splicing introns
- this catalytic property is conseved by secondary intron structures, which are absent in nulcear introns
what is the non-sense mediated mRNA decay?
- a first quality check control system prior to translation
1. Initiation of mRNA Transport: - The process begins as the mRNA molecule moves from the nucleus to the cytosol.
- As the 5ʹ end of the mRNA exits the nuclear pore, it encounters a ribosome that starts translating it.
- EJCs are bound to the mRNA at each splice site.
* As the ribosome translates the mRNA, it displaces these EJCs as it moves along. - Normal Termination of Translation:
* In a properly spliced mRNA, the normal stop codon is located in the last exon.
* By the time the ribosome reaches this stop codon, all EJCs should be displaced.
* If no EJCs remain when the ribosome stalls at the stop codon, the mRNA is considered correct and continues to full translation in the cytosol. - Nonsense-Mediated Decay:
If a stop codon is encountered prematurely while EJCs are still bound to the mRNA, it indicates a potential error in the mRNA.
* This early stop (nonsense codon) triggers the rapid degradation of the mRNA.
* This mechanism allows the cell to “inspect” each mRNA for errors during its first round of translation as it exits the nucleus.
mechanism for transporting mRNA into cytosol
- properly processed mRNA lack certain proteins like snRNP, if snRNP present in mRNA after processing then aberrant splicing occured, and this signals nuclear exosome to degrade it
- other RNA debris, improperly processed mRNAs is not exported into cytosol and remain in nucleus and eventually degraded
- hnRNPs (heterogeneous nuclear ribonuclear proteins) are abundant on emerging pre-mRNA molecules whichhelp unwind hairpin RNA to make splicing signals readable
- mRNA export through nuclear pore complexes (NPCs) through nuclear transporters, and macromolecules like mRNA protein complexes require active transport
- export - ready mRNA hovers at pore entrance and goes through final checks like nonsense mediated mRNA decay before being transported through NPC
- REF protein is a part of EJC protein of readily transported mRNAs
- TF TAP/Mex bind to REF and actively transports mRNA through nuclear pore
- TAP/Mex dissociates from mRNA once in cytoplasm
3 key features of translation
- redundancy
- universal (except mitochondria and candida albicans)
- non-overlapping (with the exception of some polycistronic RNAs which have multiple ORFs)