Chapter 11 + 12 Flashcards
Corresponding to the HC's of week 1 of the course's second part
Similarities between data analysis pipelines between different omics approaches
-All technologies yield many measurements for each sample
-Same way of handling dimensionality
-yields hundreds or thousands of variables per sample like different genes, proteins or metabolites
Samples organised in matrix
Rows: the samples
Columns: the variables (like genes)
Four components of the generalized data analysis pipeline
- Experimental design and data collection
- Data preprocessing and quality control
- Data analysis
- Biological interpretation
The experimental design can have large impact on the statistical power and therefore the …
Conclusions that are reached
First step in experimental design
Frame a biological question
What is the aim of the biological question?
Determine the hypothesis that will be tested and the statistical test that will be executed.
What does the biological question determine which is needed for an interpretable and successful outcome?
The experimental preconditions
Three types of main objectives which require a different type of experimental design
- Detection of responsive features under controlled experimental conditions (perturbation study)
- detection of biomarkers
- identification of regulatory or mechanistic relationships between variables
Experimental designing after biological question
Identify noise factors and design the experiment
Noise factors
Factors that can disturb a proper measurement (from the biological experiment up to and including the measurement)
Noise factors can lead to …
bias
Three basic principles to deal with noise factors.
- Replication
- Randomization
- Blocking
What is the aim of the experimental design?
Ensure reliable measurements free from bias
Replication
Duplicate, repeat or perform the same measurement more than once
> obtain an estimate of the experimental error
On what factor is the type of error which is estimated with replication dependent?
On how the replication is done
> For estimating and controlling biological variability: different organisms or batches of cells samples should be processed in the same manner.