Tonon: Lecture V Flashcards
Functional Elements of the Genome
What was the main focus of the ENCODE Project Consortium (ENCyclopedia Of DNA Elements)?
to develop and apply high-throughput approaches to detect all sequence elements that have a biological function
focused on 30 megabases (∽1%)
What tools did ENCODE use to have a full perspective of the “junk” DNA?
DNAse digestion
microarray hybridization
DNA replication sites
Chip-chip
cis-regulatory elements (promoters, transcription factors binding sites)
long-range regulatory elements (enhancer, repressors/silencers, insulators)
Name the steps scientists took to identify “junk” DNA and what they found:
- chose 44 genomic regions
- chose 30 megabases (15 known, 15 unknown)
they found “functional elements”, which is a genome segment that actually encoded for a defined product
DNA is no longer thought to be composed of mainly “junk”
How are the important elements of a genome identified?
by comparing the homology of organisms
signatures are important, whether alone or in combination, in order to identify sequences with important functions
Why did it take scientists so long to detect “junk” DNA? List the 4 reasons:
- genes are not as conserved
- functional elements were made of small or fragmented sequences that can be interrupted with other unspecific sequences
- can lie in repetitive regions of the genome, making their recognition difficult
- evolve very rapidly and can be nearly neutral to evolutionary processes
What were the experimental targets of ENCODE in relation to the 2012 experiment?
assign functions to as many DNA regions as possible
DNA methylation
regions layered with chemical methyl groups, which regulate gene expression
open chromatin
areas in which the DNA and proteins that make up chromatin are accessible to regulatory proteins
RNA binding
positions where regulatory proteins attach to RNA
RNA sequences
regions that transcribed into RNA
CHIP-SEQ experiment
technique that reveals where proteins bind to DNA
Modified Histone
histone proteins, which package DNA into chromosomes, modified by chemical marks
Transcription Factors
proteins that bind to DNA and regulate transcription
What was a limitation to the ENCODE approach?
some cells could be open and others closed, so the cells would be replicating at different points resulting in the analyzation of the average
also the preparation across different countries was not uniform even if the protocol was in place due to the slight variations that could occur in materials
What is an observation based on the image in regards to H3K27ac?
H3K27ac is a marker used for enhancer expression
what can be observed is that the enhancers undergo silencing, so they are not a stabilizing feature of the genome as they can undergo epigenetic changes
What are 2 limitations to this study?
- transcription factors were not taken into consideration
- they overlooked the complexity of the organism and simply focused on definite cells and definite cell lines
What did ENCODE find?
80.4% of the human genome participates in at least 1 biochemical RNA and/or chromatin-associated event in at least one cell type
Explain the Manhattan plot and why it is used:
it represents the position of the gene in the genome in relation to its p-value
the red line is the threshold and everything above the red line is significant
used to see the association between a certiain region of the genome with a certain disease
Explain the Manhattan plot and why it is used:
it represents the position of the gene in the genome in relation to its p-value
the red line is the threshold and everything above the red line is significant
used to see the association between a certain region of the genome with a certain disease
used to show Genome Wide Association Studies (GWAS)
What is GWAS?
https://www.youtube.com/watch?v=b7VTJv77f24
What should be used as an enhancer for a manhattan plot studying GWAS?
microarray to visualize intergenic regions (Regions outside of the genes)
What does GWAS identify?
association between disease trait and surrogate marker (tag SNP)