ENCODE Project Flashcards

1
Q

Aims

A

aims to identify all genetic regulatory regions

- DNA regions regulated and factors regulating them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

ENCODE project

A

Encyclopedia of DNA Elements

- wanted to identify all functional elements in genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Complexity of Expression

A
  • TF, enhancers, silencers, methylation patterns
  • splicing factors regulate transcripts
  • control and interaction of factor not well understood
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Identification of Regulatory Features

A
  1. RNA sequences: regions transcribed
  2. ChIP-seq: reveals where protein binds
  3. open chromatin: accessible areas to regulatory proteins
  4. chromatin interaction: disparate regions brought together to regulate a gene
  5. DNA methylation: methyl groups
  6. RNA binding: positions for regulatory proteins
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

RNA-seq

A
  • RNA fragmented
  • sequenced
  • mapped to reference gene
  • exons identified
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

ChIP-seq

A
  • DNA with interacting proteins sheared
  • immunoprecipitation with TF antibodies
  • purify without TF
  • map to reference genome
  • find TF binding site location
  • only able to identify sites on known TF
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

DNA Hypersensitive Sites

A
  • highly sensitive chromatin regions to DNase 1
  • nucleosomal structure less compacted allowing DNA to bind gproteins
  • mapping these sites identifies locations of regulatory elements
  • tells us novel binding sites
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

DNase-seq

A
  • DNase digestion
  • library preparation
  • PCR amplification
  • sequencing
  • map back to show open regions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

DNAase Footprinting

A
  • number of fragments that map to a sequence is a measure to regulatory activity
  • sites bound by TF show highly specific patterns of DNase1 cleavage
  • genome wide footprinting method
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

FAIRE-seq

A
  • alternative DNase seq
  • formaldehyde cross linking (more efficient in nucleosome bound DNA)
  • phenol extraction
  • identifies open regions
  • higher coverage at enhancers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

ATAC-seq

A
  • assay for transposase accessible chromatin
  • alternative to DNA-seq using mutated hyperactive transposase instead
  • cuts exposed DNA
  • isolated, sequenced, mapped
  • small sample size and fast
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Chromatin Interaction

A
  • genes regulated by regions distant from promoter

- need a way to identify long range interactions involving protein factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

ChIA-PET

A
  • isolates chromatin complexes
  • identify DNA sequence
  • PET sequences mapped back
  • shows self ligation: one fragment (TF binding site)
  • interligation: two fragments (DNA coming together)
  • clusters of overlapping PET sequences identifies enriched protein binding sites
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

ChIA-PET method

A
  1. separate out bound chromatin
  2. ChIP enrichment via antibodies
  3. links added to DNA ends
  4. proximity ligation: linkers of same chromatin will use the same linker (closer together = more likely same type will link)
  5. digest to make tags
  6. separate same and different linkers based on types
  7. sequencing and analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Long Range ChIA-PET

A
  • chromatin cross linked and fragmented
  • isolated with immunoprecipitation
  • DNA ligated (self and inter strand)
  • fragmentation and sequencing adaptors added
  • linker DNA isolated
  • mapping/sequencing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Reduced Representation Bisulphite Sequencing

A
  • reduces amount of nucleotides needed to sequence to 1% of genome
  • doesn’t identify all CG sites
  • uses RE to cut CG to make CG end fragments
17
Q

Spliceosome

A

U1 snRNP complex recognises 5’ splice site
U2 AF proteins recognise 3’/polypyrimidine tract
U2 snRNP complex recognises the branch site
- SNPs can have significant effect

18
Q

RIP-seq

A
  • immunoprecipitation of RNA binding protein of interest

- RNA bound to RBP isolated for sequencing

19
Q

CLIP-Seq

A
  • additional crosslinking with UV
  • crosslinking causes tighter binding and allows identification of binding sites when cDNA mapped back
    1. immunoprecipitation
    2. digestion
    3. reverse transcription
    4. sequencing/mapping
20
Q

-Seq Key Features

A
  1. identify region of interest
  2. isolate sequence
  3. sequence fragment and map back to genome
21
Q

ENCODE Controversy

A
  • states most of genome is function
  • considers anything transcribed is functional (many such as pseudogenes are not)
  • emphasized sensitivity over specificity (false positives)
  • lack of appropriate controls
  • abritrary choice of cell lines
  • doesn’t detract from data *