Tonon: Lecture V Flashcards

Functional Elements of the Genome

1
Q

What was the main focus of the ENCODE Project Consortium (ENCyclopedia Of DNA Elements)?

A

to develop and apply high-throughput approaches to detect all sequence elements that have a biological function

focused on 30 megabases (∽1%)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What tools did ENCODE use to have a full perspective of the “junk” DNA?

A

DNAse digestion
microarray hybridization
DNA replication sites
Chip-chip
cis-regulatory elements (promoters, transcription factors binding sites)
long-range regulatory elements (enhancer, repressors/silencers, insulators)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name the steps scientists took to identify “junk” DNA and what they found:

A
  1. chose 44 genomic regions
  2. chose 30 megabases (15 known, 15 unknown)

they found “functional elements”, which is a genome segment that actually encoded for a defined product

DNA is no longer thought to be composed of mainly “junk”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How are the important elements of a genome identified?

A

by comparing the homology of organisms

signatures are important, whether alone or in combination, in order to identify sequences with important functions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why did it take scientists so long to detect “junk” DNA? List the 4 reasons:

A
  1. genes are not as conserved
  2. functional elements were made of small or fragmented sequences that can be interrupted with other unspecific sequences
  3. can lie in repetitive regions of the genome, making their recognition difficult
  4. evolve very rapidly and can be nearly neutral to evolutionary processes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What were the experimental targets of ENCODE in relation to the 2012 experiment?

A

assign functions to as many DNA regions as possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

DNA methylation

A

regions layered with chemical methyl groups, which regulate gene expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

open chromatin

A

areas in which the DNA and proteins that make up chromatin are accessible to regulatory proteins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

RNA binding

A

positions where regulatory proteins attach to RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

RNA sequences

A

regions that transcribed into RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

CHIP-SEQ experiment

A

technique that reveals where proteins bind to DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Modified Histone

A

histone proteins, which package DNA into chromosomes, modified by chemical marks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Transcription Factors

A

proteins that bind to DNA and regulate transcription

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What was a limitation to the ENCODE approach?

A

some cells could be open and others closed, so the cells would be replicating at different points resulting in the analyzation of the average

also the preparation across different countries was not uniform even if the protocol was in place due to the slight variations that could occur in materials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is an observation based on the image in regards to H3K27ac?

A

H3K27ac is a marker used for enhancer expression

what can be observed is that the enhancers undergo silencing, so they are not a stabilizing feature of the genome as they can undergo epigenetic changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are 2 limitations to this study?

A
  1. transcription factors were not taken into consideration
  2. they overlooked the complexity of the organism and simply focused on definite cells and definite cell lines
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What did ENCODE find?

A

80.4% of the human genome participates in at least 1 biochemical RNA and/or chromatin-associated event in at least one cell type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Explain the Manhattan plot and why it is used:

A

it represents the position of the gene in the genome in relation to its p-value

the red line is the threshold and everything above the red line is significant

used to see the association between a certiain region of the genome with a certain disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Explain the Manhattan plot and why it is used:

A

it represents the position of the gene in the genome in relation to its p-value

the red line is the threshold and everything above the red line is significant

used to see the association between a certain region of the genome with a certain disease

used to show Genome Wide Association Studies (GWAS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is GWAS?

A

https://www.youtube.com/watch?v=b7VTJv77f24

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What should be used as an enhancer for a manhattan plot studying GWAS?

A

microarray to visualize intergenic regions (Regions outside of the genes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What does GWAS identify?

A

association between disease trait and surrogate marker (tag SNP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is interesting about the findings related to GWAS?

A

only 3% of the genomic variants are located within genes

most are found in intronic or intergenic genes

24
Q

How do we analyze Crohn’s patients?

A

find associations with SNPs that are overlapping GATA2-binding site

25
What was found out about alternative splicing with ENCODE?
most genes undergo alternative splicing, during transcription with a local effect when the RNA is still bound to DNA
26
What is known about Poly(A)?
there are many poly(A) for certain genes
27
How many human genes have a Poly(A)?
at least 50%
28
In the example with CD47, what happens if the poly(A) tail is on a short 3'UTR?
CD47 protein doesn’t translocate to the membrane because the tail has nothing to bind to RNA is driven to the axon
29
In the example with CD47, what happens if the poly(A) tail is on a long 3'UTR?
binds the RNA to a few components and brings the whole set to the surface RNA remains in the soma
30
What do 3'UTRs act as?
a scaffold to regulate membrane protein localization
31
Highlight 3'UTR to cancer:
short 3'UTR → increased proliferation → alternative cleavage and polyadenylation → oncogenes activated in cancer cells
32
Why is the shorter 3'UTR associated with proliferation?
ncRNAs will have the ability to bind to it and prevent translation or induce elimination
33
What are 2 important factors in the discovery of transcription factors with ENCODE?
transcription factors work together found in both promoter regions and in intergenic regions, where apparently there are no genes (probably bound to regulatory regions, exerting some activities)
34
Describe the hierarchy of chromatin modifications in epigenetics:
DNA sequence & DNA methylation → nucleosomes → histone modification (compactin of chromatin) → nuclear organization and transcription factors
35
What is the nomenclature of histones?
first letter: histone (with number) second letter: residue number: number of histone type of modification number of residues
36
What are some exceptions for the nomenclature of histones?
acetylation does not have a number for the number of residues because there can only be 1
37
What is H3K27ac a typical modification of?
enhancers
38
What are the 3 types of chromatin modifiers and what do they do?
writers: put residue on lysine 4 erasers: readers
39
What is the most important histone?
H3
40
What are the most important residues?
K4, K9, K27, and K36
41
Are writers and erasers specific in their activity?
no, they add many different residues to different amino acids or remove many different histone modifications
42
What composes histone modifying genes?
writers erasers readers
43
What is the most frequent histone modification of promoters?
it is the 3 methylation of lysine residue number 4 in histone 3 (H3K4me3)
44
Which lysine residue number is almost always associated with active transcription of genes?
K4
45
What is on the promoters of active genes?
H3K4me3 (trimethylation)
46
What is typical of active enhancers?
H3K4me1 (monomethylation)
47
What is a classical heterochromatin marker that represses chromatin?
H3K9me3
48
What is associated with repressed genes?
H3K27me3
49
Review the chart of active and repressed transcription in a normal cell:
50
active gene promoter →
H3K4me3
51
gene promoter not expressed →
H3K27me3
52
What is the function of chromatin in regards to histones?
it is an accessibility barrier, so if there are no histones near the promoter, there is no H3K4me3, so it's marker for active promoters are "on their side"
53
What is a technique that is used to find all chromatin that is opened?
ATAC-seq, which is a DNA-seq
54
Why is ATAC-seq important?
if chromatin is open, it almost always implies there is active transcription in that region
55
What happens when you look at CHIP-seq for H3K4me3?
since there are no histones, signals are not present
56
What does it mean if important genes have bivalent promoters?
they have both H3K27me3 and H3K4me3 on the same region in order to go both directions (about 20% in ES cells)