TFBS Prediction Flashcards
Week 2 Lecture 2
Why do we want to predict TFBS?
- They are key elements in the regulation of gene expression.
- They help in the general understanding of gene expression
- Can help with gene finding
- Mutations in TFBS can lead to disease - Experimentally validated sites are limited in number
- Most experimental methods have poor resolution - need to find the actual site within the experimental site
Transcription initiation in prokaryotes
RNA polymerase has a strong affinity for the promoter and basal transcription rate is high
Transcription initiation in eukaryotes
RNA polymerase II and RNA polymerase III pre-initiation complexes don’t assemble efficiently, the basal transcription rate is low, and other transcription factors are needed for effective initiation
What are transcription factors?
Sequence-specific DNA binding factors that activate the initiation of eukaryotic transcription
Types of transcription factors
- Constitutive: Work for many different genes and don’t respond to external signals
- Regulatory: Limited number of genes and respond to external signals
What do transcription factors recognise?
- Upstream promoter elements: they influence initiation at the promoter to which they are attached
- Targets within enhancers: influence several genes at once
Traditional view of how transcription factors work
They activate the formation of the pre-initiation complex by:
1. Making direct contact
2. Making indirect contact
3. Inducing a DNA bend
New view on how transcription factors work
- Some can modify histone proteins affecting nucleosome positioning
- Some can bend DNA into a specific shape bringing other TFs into contact with the pre-initiation complex (enhanceosome)
- Some do not bind to DNA but form protein-protein contacts with the pre-initiation complex
Examples of transcription factors
Up to 2600 TFs
e.g. Oct-1, Oct-2, Heat shock factor, Serum response factor, GATA-I
Size of consensus sequences
Most are 9-15 bp, mean 12.2 bp
How do we predict TFBS?
- Encode the patterns that describe a binding site
- Scan these patterns against DNA (needs to be more intelligent than naive scanning)
Techniques to identify TFBS
- Electro-Mobility Shift Assay (EMSA)
- DNase I footprinting/protection
- Systematic Evolution of Ligands by Exponential enrichment (SELEX)
- SELEX-Seq
- Chromatin ImmunoPrecipitation (ChIP)
- ChIP-chip
- ChIP-seq
- ChIP-exo
Electro-Mobility Shift Assay (EMSA)
in vitro
- The mobility on a gel is different for DNA bound to protein
- Control (DNA only), DNA and protein that does not bind, DNA and protein that does bind
DNase I footprinting/protection
in vitro
- Combines DNase I cleavage with electrophoresis
- The bound protein shields the DNA from cleavage
SELEX
in vitro
- Large DNA library
- Select for binders by affinity chromatography or by EMSA
- Amplify by PCR potentially with low stringency copying to allow mutations
- Subsequent rounds use higher stringency elution
- Sequence binders