The regulatory Genome - Week 3 Flashcards
regulatory genome
non-coding genome
in humans the non-coding genome represents
99% of the genome
only 1% of the genome is
coding
Evidence for function in the non-coding genome
biochemical evidence, genetic evidence, evolutionary evidence,
Evidence for function in the non-coding genome: biochemical evidence
Genome wide studies i.e. ENCODE - identifying molecular activity (transcription, transcription factor binding), up to 80% of the genome functional
Evidence for function in the non-coding genome: genetic evidence
Function defined by - phenotypic consequence of mutation, up to 15% of the genome ‘functional’
Evidence for function in the non-coding genome: evolutionary evidence
conservation of sequence across evolution, up to 10% of the genome ‘functional’
What does the non-coding genome contain?
Promoters, enhancers, silencers, insulators, noncoding RNA agents (microRNAs, piRNAs, structural RNAs, regulatory RNAs)
Non-coding functional elements associated with
distinctive chromatin structures that display signature patterns of histone modifications, DNA methylation, DNase accessibility and transcription factor occupancy.
Enhancers - how are they defined
A set of clustered short sequence elements that stimulate the transcription of a gene and whose function is not critically dependent on their precise position or orientation
Enhancers - mechanism of action
serve as sites for pre-initiation complex (PIC) formation (complex of dozens of proteins including RNA polymerase and general transcription factors necessary for transcription.)
By looping of the intervening DNA to make contact with the target gene promoter the PIC is delivered to the required site and transcription proceeds.
This loop formation might increase the concentration of PIC at the target gene and thus transcription. This is one model whereas there is also evidence for other models:
• Looping results in relocation of target gene within nucleus to a ‘neighbourhood’ within the nucleus more favourable for transcription (so called transcription factories).
• Looping results in recruitment of transcription elongation factors to promoters that can increase rates of transcription elongation (the step after transcription initiation)
Enhancers - what evidence is there that they are important to human health?
- Genetic evidence can be provided by finding mutations causing disease. For Mendelian disease mutations in a limb enhancer for SHH is a prototypical enhancer.
- Single nucleotide polymorphisms associated with risk of developing complex, polygenic disease have also been shown to be enriched in disease-relevant cell type enhancers (e.g. type 2 diabetes and pancreatic islet enhancers)
- Evolutionary conservation of enhancer sequences is also good evidence of importance to human health
- As is evidence of paucity of rare variation across human populations in human accelerated regions (regions with evidence of accelerate divergence from other primates and thus likely important for human-specific traits) Most HARs are believed to be enhancers.
Enhancers - What methods are used to identify them?
- Chromatin Conformation Capture (to capture looping interactions)
- Reporter Gene assays (e.g. luciferase) to show ability of sequence to increase expression of reporter gene transfected into cells
- ChIP-Seq (identify DNA binding sites for transcription factors and DNA associated with histone modifications associated with enhancer function (e.g. H3K4me1)
- DNase-Seq (treat DNA with DNase I that cleaves DNA in open chromatin (e.g. enhancer) and sequence these fragments)
Enhancers - What are the current and future directions for research? (ie what remains to be found out and why is this important?)
- Cell sorting to identify spatially and temporally restricted enhancers (e.g. Human Cell Atlas)
- Is there an enhancer code? (the sequences of enhancers does not give good discrimination but could other parameters (ChIP-Seq binding) provide better discrimination?
- To what extent can human disease (Mendelian and complex, polygenic) be explained by genetic variation in enhancers?
Insulators - how are they defined?
DNA element that acts as a barrier to the spread of chromatin changes or the influence of cis¬-acting elements
Insulators - What is/are there mechanism of action?
- Insulator sequences have been identified by their ability to block the activity of enhancers to increase gene expression when placed between promoter and enhancer. This may be by preventing the spreading of heterochromatin or the looping and contact between enhancer and promoter.
- Insulators have been shown, in vertebrates, to be enriched in CCCTC sequence and bind CCCTC-binding factor (CTCF). CTCF plays a key role in establishing topologically associating domains (TADs). Interactions within TADs are enabled whilst between TAD interactions are prevented
Insulators - What evidence is there that they are important to human health?
- CTCF-binding sites that define insulator sites are frequently mutated in cancer
- Microdeletions in the an insulator sequence that regulates IGF2 and H19 imprinting results Beckwith-Wiedemann syndrome
Insulators - What methods are used to identify them?
- Reporter genes inserted either flanked by insulator sequences or not. If flanked sequences are insulating then the reporter gene will be active whether inserted into heterochromatin or euchromatin.
- The ability of an insulator sequence to block promoter: enhancer interactions can be assayed by placing the tested sequence in between promoter and enhancer.
Insulators - What are the current and future directions for research? (ie what remains to be found out and why is this important?)
• In species such as Drosophila many more proteins that bind insulators are known than for humans. Are there more insulator proteins in humans and will this explain the lack of CTCF-binding sites at some topologically associating domains (TADs)?
Non-coding RNAs - How are they defined?
Mature RNA transcript that is not translated to make a polypeptide
Non-coding RNAs - What is/are there mechanism of action? List the types of non-coding RNA
MicroRNAs (miRNAs), Transfer RNAs (tRNAs), Ribosomal RNAs (rRNAs), small nuclear RNAs (snRNAs), long non-coding RNAs (lncRNAs), Circular RNAs (circRNAs), Piwi-interacting RNAs (piRNAs)
Non-coding RNAs - What is/are there mechanism of action? MicroRNAs (miRNAs)
small (~20nt) single-stranded RNAs that bind, via complementary base pairing, to transcripts and cause translational repression and/or mRNA degradation
Non-coding RNAs - What is/are there mechanism of action? Transfer RNAs (tRNAs)
adaptor RNAs with a classic cloverleaf structure that carry amino acids covalently linked to their 3’end and also containing an anticodon sequence that base pair with the codons on mRNAs. Hence crucial for translation.
Non-coding RNAs - What is/are there mechanism of action? Ribosomal RNAs (rRNAs)
Ribosomes are protein:RNA macromolecular complexes that catalyse the reaction of joining amino acids together. rRNA makes up ~80% of RNA (by mass) in a human cell.