Sudbery Flashcards

1
Q

Why do we care about transcriptomics?

A
  • 98.5% of protein coding seq is the same human to mouse
  • 1-2% of the genome is coding (hence why we are so diff from mice)
  • metazoan genomes are not selected for size → much is repetitive seq for decaying pseudogenes
  • every cell has the same DNA, but cells are diff –> dep on what genes are active/exp levels
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What did ENCODE find about ‘junk’ DNA?

A
  • most of what was thought to be junk has a function in controlling something
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How much of the genome did ENCODE claim was functional?

A
  • 80% (CRMs)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are cis-regulatory modules?

A
  • inc promoters, enhancers, silencers and insulators

- regions of DNA that bind DNA BPs (eg. TFs) and reg gene exp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What seq motifs do DNA BPs bind, and what is the result of this?

A
  • bind degenerate sequence motifs
  • binding sites vary, but certain seqs are more likely
  • but just because seq is present doesn’t mean it will bind
  • eg. 8 mil GATA1 binding sites in the genome, only 0.2% bound by GATA1 (ChIP-seq)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Does all DNA exist as heterochromatin or euchromatin?

A
  • no, sliding scale
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is hetero and euchromatin?

A
  • heterochromatin = tightly packed

- euchromatin = loosely packed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What regions tend to be nucleosome free, or have v few nucleosomes?

A
  • CRMs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How tightly packed is chromatin in transcribing genes?

A
  • intermediate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can nucleosome free regions of genome be mapped?

A
  • map w/ DNase-seq
  • DNase only cuts where there are no nucleosomes, can use to build up genome wide pic of where nucleosome free regions are
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What did ENCODE measure and how?

A
  • RNA expression –> RNA-seq, CAGE-seq and RNA-PET
  • DNA/protein interactions –> ChIP-seq
  • chromatin accessibility –> DNase-seq and FAIRE-seq
  • 3D structure –> ChIA-PET and 5C
  • methylation –> RRBS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does ChIP-seq work?

A
  • prots bind to DNA, and use crosslinking to see where binds to DNA, chop up DNA and use Ab to select for DNA which is crosslinked to a prot, so can separate this DNA, seq it and work out where in genome prot binds
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What assays were carried out on what cells in ENCODE?

A
  • tier 1 = all assays
  • tier 2 = a selected subset of assays
  • tier 3 = everything else, eg. a specific assay or combination
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What did ENCODE prod?

A
  • lots of data sets and continues to gen new data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What did ENCODE claim?

A
  • vast majority (80.4%) of human genome participates in at least 1 biochemical RNA and/or chromatin-assoc event in at least 1 cell type
  • 19.4% covered by at least 1 DHS or TF ChIP-seq peak across all cell lines
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What was wrong w/ ENCODEs claim that 80% of the genome is functional?

A
  • 100% of genome participates in replication
  • about 60% of this 80% is transcription and about half of this is introns (which are NOT coding, and these are much bigger than exons so account for a signif proportion of the genome), and we would not necessarily say introns have a function
17
Q

Was ENCODEs claim that 19.4% of the genome covered by at least 1 DHS or TF ChIP-seq peak more sensible, why?

A
  • yes
  • assuming half the elements from TF and cell-type diversity sampled, could estimate a min of 20% of genome participates in these specific functions, w/ the likely figure signif higher
18
Q

What is the question at the debate over ENCODEs finding?

A
  • what do we mean by functional?
19
Q

What are the definitions of function?

A
  • causal role = a seq has a function if seq causes the function
  • selected effect = a seq has a function is that seq exists because of this function
  • also the genetic role = a seq has a function if it is req for that function (doesn’t have to be visible to natural selection)
20
Q

How is biological function created?

A
  • evolution and selection for that function
21
Q

What definition of function did ENCODE use, and what is the problem w/ this?

A
  • causal definition
  • but surely if a seq is important it will be selected for and if it has no effect on function then it is not important as far as evolution is concerned
  • but ENCODE estimates of functionality were way above the amount of DNA known to be selected (at time best estimate was around 5% under -ve selection)
22
Q

How can the amount of the genome under selection be investigated?

A
  • compare seqs from 2 species and find regions w/ fewer diffs than expected
23
Q

What is the problem w/ comparing seqs of 2 species to see how much of the genome is under selection?

A
  • what is expected?
  • if distant species used, only find function conserved across long time
  • if close species used, not enough mutations to find conserved regions
24
Q

How can problems in investigating how much of the genome is under selection by comparing seqs be overcome?

A
  • use indels –> compare 2 species, look at the gaps in alignment, are there regions w/ fewer gaps than expected?
  • had to work out how much seq conserved now in humans, can extrapolate this to plot human mouse, human horse, human chimp etc., then find fit line and extrapolate to 0
25
Q

If just look at -ve selection to see how much of the genome is under selection, then what’s missing?

A
  • +ve selection
  • non coding seqs
  • compensatory evolution (applies to non-coding seq)
26
Q

How does +ve selection affect genome selection?

A
  • seq changes because of new function
  • coding seq: dN/dS (comp synonymous to nonsynonymous changes and if lots of nonsynonymous then prob selecting for new function)
27
Q

How can selection of noncoding seqs in genome be investigated?

A
  • intra-species diversity comp to inter-species diversity

- using 1000 genomes, approx 4%

28
Q

What is the effect of compensatory evolution on genome selection?

A
  • TF sites control exp of gene, if lose a binding site then fitness reduced from 1 to eg. 0.8 (80% exp)
  • isn’t fatal so allows time to get another mutation to gain binding site and fitness again increased to 1
  • over years of evolution get many diff seqs that perform same function and give same fitness, but look quite diff
29
Q

What evidence if there for compensatory evolution?

A
  • mainly anecdotal, eg. from fly embryos
  • systematic evidence –> took 4 species of yeast, counted amount of TF binding in regulatory regions of genes, and looked at how much seq changed between 2 species, T is like evolutionary time, get much more changes in seq than binding energy, so conservation of function w/o conservation of sequence
30
Q

Are ENCODE elements conserved?

A
  • some signal of selection, but quite weak
  • melanocyte DHSs are depleted in somatic mutations in whole cancer genomes –> didn’t find somatic mutations in ENCODE elements
  • cancer function = unselected function?
  • so cancer needs seq but body doesn’t, does this make it functional?
31
Q

What are some eg.s of function w/o conservation?

A
  • eye colour genes –> not under selection, but is genetic
  • disease causing mutations such as AD –> no evolutionary advantage to stopping these mutations, as affects after reproductive age
32
Q

What did an experiment looking at enhancer and promoter evolution do and find?

A
  • experiment took livers from 20 mammals and mapped where enhancers and promoters are
  • promoters generally in conserved location
  • enhancers move between species, not v conserved between species
  • are they under seq constraint?
    promoters and enhancers have some conservation but much less than exons
33
Q

What evidence is there for function of promoters and enhancers?

A
  • 98% of DHSs are linked to a promoter in ChIA-pet experiments
  • genes closer to predicted enhancers tend to have higher expression levels in correct cell type
  • ENCODE tested a no. of elements in enhancer reporter assays “over half of the elements showing activity, often in the corresponding tissue type”
  • 65% of predicted human heart enhancers drove heart expression in mice
34
Q

What is the significance of evidence for function os enhancers and promoters being TF binding sites?

A
  • does not imply TF binding
  • does not imply enhancer state
  • does not imply contact w/ promoter
  • does not imply regulation of a promoter
  • does not imply phenotypic consequence for the cell
  • does not imply phenotypic consequence for the organism