Lecture 2 FInal Flashcards
The goal of _________ is subdivided into a set of items (in our case, genes) in such a way that similar items fall not the same cluster, whereas dissimilar items fall into different clusters
clustering
Why do we care about genes with similar expression?
genes with similar expression levels are often times regulated by similar mechanisms. An example of this transcription factor which regulate many genes
How would clustering be used to restore clustering?
The painting is first muted in its colors and sliced into randomized rows. Computer clustering algorithms are able to reorder the rows by similarities off the intensities between the,. Rows with comparable intensities are arranged adjacent to one another, and the painting can be restored with reasonable precision
What are 3 clustering algorithms
- k means or partitioning methods divided in a set of N objects into K clusters
- hierachical methods
- set organism feature maps
_________ divides a set of N object into K clusters
K means or portioning methods
____________ produces a set of nested clusters n which each pair of objects is progressively nested into a larger cluster until only one cluster set through iterative training
hierarchical methods
__________ produces a cluster set through interactive “training”
set organizing feature MAP
___________ aims to partition N observation into K clusters in which each observations belongs to the cluster with the
K means clustering
________ are displayed in N dimensional space (n=2 if two samples)
Expression profiles
first ________ is picked at random among all data points
cluster center
other cluster centers are___________
picked as far away as possible
Associate each data point to __________
the closest cluster center
recompute cluster centers based on new clusters
*itterated until the clusters remain unchanged
What are the downsides of K means
- if data is spare, clustering is not robust (releasing the procedure will likely give different clusters)
- The best # of closers is usually not know upfront- the data is forced into K clusters
- Outlirs throw off centroids
__________ refers to the study of changes in the regulation of gene activity and expression that are not dependent on DNA sequence
Epigenetic
While epigenetic often refers to the study of
single genes or sets of genes epigenomc refers to more global analysis of epigenetic changes across the entire genome
____________ in addition to or above the genome
epigenome
The concepts of Epigenetic:
-heritable but also reversible
-doesn’t involve changes to the DNA code
Can persist through or nine generations
-regulate gene expression
-main types: DNA methylation and histone modification
DNA is not your destiny
epigenome changes are involved in the above phenomia
Epigentic “marks” can tell your genes to switch on or off to speak loudly or whisper
it is through epigenetic marks that environmental factors like diet, stress, and prenatal nutrition can make an imprint on genes that is passed from one generation to the next
__________ determines which genes are turned on or off in a cell type specific manner
epigenetics
Maintenance of:
- gene repression/activation
- repeat silencing
Epigentics can be used to study delevelopment and differentiation; whereas, clones and identical twins determined ____________
differences between individuals
Genomic imprinting is a genetic phenomenon by which certain genes are ________________
expressed in a parent of origin specific manner