(Week 3) [T6] Precision Medicine Omics Data Analysis Flashcards
What can we learn from omics data in terms of genomics?
What are the genome variants related with disease or drug.
What can we learn from omics data in terms of epigenomics?
Which genes are active/silenced. Modifications correlated with diseases or response to drugs.
What can we learn from omics data in terms of metabolomics?
Which metabolites are present and how much.
What can we learn from omics data in terms of interctomics?
How genes and RNA molecules interact with each other.
What can we learn from omics data in terms of microbiomics?
Which microbes are present in our organism.
Give an example of data processing technique, statistics and dimensionality reduction.
- Normalization; aggregation.
- Fisher’s exact test; pearson chi-square.
- PCA.
Which are the 3 types of machine learning?
Unsupervised
- Clustering. Divide data in groups, in a way that intracluster distances are minimized and intercluster distances are maximized.
- Pattern mining and subspace clustering. Find meaningful correlations on a subset of the feature space.
Supervised
- Classification. Classification could be binary or multi-class.
- Regression. Learn a mapping to estimate the outcome of a new observation.
Semi-supervised have few labelled individuals due to label acquisition costs and uncertainty
Which are the evaluation techniques for classification models?
- Train-test split. Learn model from train data and assess it against test data.
- Cross validation. Divide data into k subsets/folds and use 1 fold for test and remaining (k-1) subsets for train. Repeat for all k folds.
- Prevalence. Percentage of the population with the condition.
- F-measure = 2 Precision x Recall / (Precision + Recall).
- Confusion matrix.