Lecture #3 (Transcription Factors and Chromatin) Flashcards
Chromatin (Overall)
Chromatin = Things co-associated with DNA
- When you spin down DNA here are proteins and RNA that come with it (weight of proteins = weight of DNA)
Chromatin = DNA + RNA + Histoen proteins + Non-histone proteins
- Chromatin = Nucleosomes + Histone proteins
Two types of proteins that come with DNAT6
- Histones - Form the fundamental units of chromatin organization
- Basic positivley charged proteins
- There are 5 histone proteins - Non-histone proteins
- Acidic proteins
- There are thousands on non-histone proteins (DNA/RNA polymerase, DNA binding proteins etc.)
Have as much histone and non-histone proteins by weight
Epigenome
Really just another term for chromatin
Two components of epigenome:
1. Transcription factors/Regulatory DNA elements
2. Chromatin
Genome + Epigenome
Genome + epigenome = work together to make organisms
Genome + Epigenome = olecular blue print for everything (Ex. Growth + adaptation + differentiation)
- Also affects hormonal signals between organs
There is NOTHING in biology that is not affected by gene expression via epigenome
Mutations in epigenome
Mutations = affect DNA and RNA in epigenome –> Leads to diseases
Have many pathologies related to dyfunction of the epigenome
Nucleosome
Nucleosome = fundemental unit of chromatin
Conservation of Histone proteins
Histone proteins = most conserved proteins on planet
- H4 is the most conserved of the histones
Histones = small –> 100-120 Amino Acids –> H4 in mammal vs. plants has 1 Amino aciid difference (VERY conserved)
- Other histoes have more divergene but stil consevred
Conservation = allows you to study histones in model organsims –> the principles will apply to other organsims inclduing humans evcause have similar proteins
Underlying prinsiples between organisms
Underlying principles of cell and tissue differentiation in Eukaryotes
Example - principals of turning on genes in humans is model organisms is not veyr different bery turning on genes in humans
- Can pick which organism is easiest to study in and traslate to humans
Human Genotype Expression prohect (GTEx)
Mapped expression of gene gene in every cell in human body
Take homes:
1. Human genome has 20,000 protein coding genes
2. Human genome has 10,000-13,000 non-coding RNAs
3. In any cell type only HALF the genome is expressed (a little abive half) –> ONLY 10,000-13,000 of protein coding genes is expressed ; 7,000-10,000 of the protein coding genes are silenced
- Need to make sur ethe right genes are silences or expression( (Different in different cell types)
4. 8,000 of the expressed protein coding genes are core acitivies (ubiquitus) - what every cell needs to survive and replicate
5. 2,000-5,000 of teh PCGs are preferntailly expressed in certain cell types (expressed ebeyrwhere but enhanced in some cells and not others)
6. 200 of the PCGs are tissue specific
- Want to look at the 200 master proteins to look at cell type and cell fate
Ubiqitous genes in genome
House keeping genes = ubiqitous genes = ALWAYS ON
How do they get truned on?
- Example 0 Insulin gene –> HAVE Transcription factors that are specific to be expressed in that cell type
What regulates Eukaryotic Transcription Factors
Eukaryotic trasnscription is regulated by proximal and distal promoters enhancer DNA elements
- Instruction for intiating transcription are near transcripton strat site AND can be far away
Enhancers = located away from the promoters (1kb-1MB)
- Enhancers are genetically validating as transcriptional control elements –> muttaions od enhancers affects gene expression
- Enhancers = can be upstream or downstream of the promoter –> feed and give infomration to trasncrtion start site where RNA polymerase binds
- Enhancers = orientation-independent (doesn’t matter if at 5’ or 3’ end)
- Ehnacers = regulate a target that is far away
- Enhancers = enhance trasmcription
- Enhancers = just as important as promoters
- Can be affected by envirnmnetal signal (give complexity to gene regulation)
Regulatory DNA Elements
Regulatpry DNA elements = short DNA sequences (10-20 BP) –> Elements will be recognized by transcrition factors
- Transcription factor protens recognize 10-20 BP
- More BP in sequence = more specific because fewer probaility those sequence will exist genome wide) –> TF have evoloved to recognize specfic sequences by making them longer (Ex. recognition sites are longer than RE)
MOst Transcription factors recognzie 100-1,000 sites but have some that only recognize 1 site
Heat shock protein Transcription factors
Heat shock proteins = encodes protein chaparones to help proteome not denature at a high temperture –> have a set of proteins and each portein coding –> each protein has an upsteram regualtpry elements (15 nucleotde lement) that is unique that binds to the transcriotion fcator and swicthes on the sets of genes because they have the same sequence on the upstrea regulatory
WATCH VIDEO
Promoters
Promoters = where transcription begins
Promoters and enhancers have TF binding site
Discovery of promoters/ehnacers
Discover promoters and enhancers done by knocking out promoters/enhacers
Can do systemic delation –> detect enhnacer –> see if teh gene stops working or decrease in expression
- Find enhmacers/promoters by knockout
Enhancer DNA elememts
Enahcer = DNA elements that are similar to elemenst at promoter –> bind to TF that may or may not be shared with TF that bind to the promoter (may or may not be same TF)
IN IMAGE - Ehnacer fartheer awya have different TF
Seqwunece specific TF are different from geenratal trascription fcators
- Gentral transction factors = found at the promoter
DNA and protein DNA intercatioons that need to talk to get RNA polymerase to promoter and commucate that it is time to move
Model for enhancer-promoter interaction
Model for how enhancers can work depsite being far away from the promoter = enhancer-promoter intercation by looping faciliated by cohesin ring (physical proximity)
Before - thought the single oozes down the chromatin fibers
NOW - think DNA can bend allowing the enhancer to contact the promoter lock together with cohesin ring
- IF mutate cohesin protein –> destroys the ring strcuture = impairs enhnacer-promoter communication (Doesn’t REALLY prove that the model is right)
- Promoter recruits RNA pilymerase –> Polymerase can lod onto the promoter
STILL DEBATE - people think it could be indirect protein cluster (condestae) betwen the enhancer and promoters instead of direct contact
Evidence that things are DNA elements are close to each other
Evidenece that DNA sequences that are far away from each other linearly are close spatially - Uses chromain cpature
Chromatin cpature - take cells –> cross link DNA using formadheye (cross links teh lysine residues) –> digest uncessary things away; cross linkages keeps the two fragments connected –> ligate DNA (if two things are close then they will ligate) –> Sequence –> Can see DNA is ligated to something that should be far away to know thas omething brought them together to be able to ligate
Issue with Chromatin Conformation Capture
Issue = measuring ligation NOT contact
- Looking at genome that shows TADs (TADs = regions that are ligatable)
In Chart - Peak of the trainge shows ligation –> Means that the places at teh two bottom corners of he trainge (regions far away on chrosmome) are interacting at the top point of traingle
- Interpre this a the DNA sequences are close BUT really they are just ligatable
THe sequences COULD be far apart and still be ligatable –> because they could be brought together by a cytoskeata protein (If thinsg are far away and moving randomly then you get a certain low ligation frequencey BUT of there is a cytosklatal cable between the two then they will be connected and can have a higehr ligation frequnecey
- Connected far apart but still have impored ligation (Flow in HiC)
2nd way to see 3D chromosomal interactions
Use Genome Archtecture Mapping (GAM) –> high resolution sectioning of the cell and part of the nucleus
Microdisect individual slides –> Put slices in well for genome analsyis –> measure co-segregation frequencey of two parts of the genome
GAM often condirems HiC results (main results between GAM and HiC are similar)
GAM = has single cell sensitivity (1 nucleus at a time) Vs. popultion cell in test tube for ligation reaction in HiC
Can detect mltiple interactions in 1 section (Ex. 10 things comthing togetehr) vs. HiC only see intercation between 2 things
Can detect interaction of super ehnacers and actived genes as triplets across Mbp distance (see enhnaer and proxbmity genes)
Issue with GAM
Resolution - issue with how thin you can slice
Have a proxmity limit of 220 nm BUT a nucleosome if 10nm –> Means you could have 22 nucleosomes in each slice
Super enhancers
A subset of human genes are regulated by super-enhancers (Common in pluripotencey genes and oncogenes)
Idea of what a super enhancer is:
1. Super-enhancer is its own thing
- Ex. Cell cylcle assocated genes + tumor supresser genes + oncogemes = regulated by 10-20 kb super enhnacer
- Ehnacer = only a few 100 BP (Super enhnacer is longer)
2. Super-enhancers are clusters of stnadrad enhnacers
What type of genes use superenhnacers
Very important genes use super-enhnacers to regulate genetic ectivity (Ex. oncogenes)
- Super-enhnacer = has signators of chromatin + have RNA pol there + enriched for mediators
Example 2 - Glbulin or insulin genes that are more simple (might not have super enhnacer?)
THINGS that recevive more signla s= need exrtra regulatpru circut = have many things that affect one promoyer = have kb (super-ehnacer) regulating that 1 promoter
- Regulation is not just 1-3 elements BUT it is a large cluster of elments over 10kb (ALL 10 kb is important)
- Ex. pluri=potencey gene = repsonding to a lot of signals = needs a lot of enhnacers
Promters + Enhancer DNA elements
Promoter and Enhancer DNA elements interact with sequence specifc Transcrtion factors to recruit general trnascription fcators + mediators + RNA polymerase 2