Lecture 2 FInal Flashcards

1
Q

The goal of _________ is subdivided into a set of items (in our case, genes) in such a way that similar items fall not the same cluster, whereas dissimilar items fall into different clusters

A

clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why do we care about genes with similar expression?

A

genes with similar expression levels are often times regulated by similar mechanisms. An example of this transcription factor which regulate many genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How would clustering be used to restore clustering?

A

The painting is first muted in its colors and sliced into randomized rows. Computer clustering algorithms are able to reorder the rows by similarities off the intensities between the,. Rows with comparable intensities are arranged adjacent to one another, and the painting can be restored with reasonable precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are 3 clustering algorithms

A
  • k means or partitioning methods divided in a set of N objects into K clusters
  • hierachical methods
  • set organism feature maps
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

_________ divides a set of N object into K clusters

A

K means or portioning methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

____________ produces a set of nested clusters n which each pair of objects is progressively nested into a larger cluster until only one cluster set through iterative training

A

hierarchical methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

__________ produces a cluster set through interactive “training”

A

set organizing feature MAP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

___________ aims to partition N observation into K clusters in which each observations belongs to the cluster with the

A

K means clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

________ are displayed in N dimensional space (n=2 if two samples)

A

Expression profiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

first ________ is picked at random among all data points

A

cluster center

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

other cluster centers are___________

A

picked as far away as possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Associate each data point to __________

A

the closest cluster center

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

recompute cluster centers based on new clusters

A

*itterated until the clusters remain unchanged

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the downsides of K means

A
  • if data is spare, clustering is not robust (releasing the procedure will likely give different clusters)
  • The best # of closers is usually not know upfront- the data is forced into K clusters
  • Outlirs throw off centroids
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

__________ refers to the study of changes in the regulation of gene activity and expression that are not dependent on DNA sequence

A

Epigenetic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

While epigenetic often refers to the study of

A

single genes or sets of genes epigenomc refers to more global analysis of epigenetic changes across the entire genome

17
Q

____________ in addition to or above the genome

18
Q

The concepts of Epigenetic:

A

-heritable but also reversible
-doesn’t involve changes to the DNA code
Can persist through or nine generations
-regulate gene expression
-main types: DNA methylation and histone modification

19
Q

DNA is not your destiny

A

epigenome changes are involved in the above phenomia

20
Q

Epigentic “marks” can tell your genes to switch on or off to speak loudly or whisper

A

it is through epigenetic marks that environmental factors like diet, stress, and prenatal nutrition can make an imprint on genes that is passed from one generation to the next

21
Q

__________ determines which genes are turned on or off in a cell type specific manner

A

epigenetics

22
Q

Maintenance of:

A
  • gene repression/activation

- repeat silencing

23
Q

Epigentics can be used to study delevelopment and differentiation; whereas, clones and identical twins determined ____________

A

differences between individuals

24
Q

Genomic imprinting is a genetic phenomenon by which certain genes are ________________

A

expressed in a parent of origin specific manner

25
_____________ 30 nm chromatin fiber
nucleosome
26
_____________ allows large amounts of DNA to be compacted into a small space very efficiently
chromatin
27
Histone modification includes ____________________
methylation, phosphorylation, and sumoylation
28
___________ a process catalyzed by the addition of a methyl group to a cytosine reside modifying is to a 5 methyl cytosine residue modifying is to methyl cytosine 5
DNA methylation
29
DNA methylation occurs largely in
CpG islands which is found in greater frequency upstream of the gene
30
DNA methylation of the transcription factors thus ________________
suppress gene expression
31
DNA methylation in vertebrates
1. DNA methylation occurs in 5th position of cytosine in CpG dinucleotides and is the major modification found vertebrate genomes 2. catalyzed by DA cytosine methyltransferases 3. Methylation patterns are established during development and differentiation heritable in somatic lineage cells 4. DNA methylation associates with gene repression in vitro and in vivo (imprinted genes, X chromosome inactivation, and silencing of tumor suppressing genes in cancer)
32
__________ DNA methylation is powerful mechanism for the suppression of gene activity
gene silencing
33
There is a reciprocal relationship between the
density of methylated cytosine residues and the transactional activity of a gene
34
Technology for the study of DNA methylation
- Bisulfide DNA treatment, followed by PCR or direct hybridization to CPG island arrays - Methylated DNA immunoprecipitatiiion followed by array or deep sequencing - restricted enzyme digestion using mthylcytosine sensitive enzymes