8 Oct - Processing (Luca) Flashcards
What is data transformation?
All that has to do with cleaning, reshaping, transforming, augmenting data
What can frequency distributions show us?
- We can spot outliers, we can see the 2. range, 3. we can see the shape of the curve.
What is correlations?
Measure the strength and direction of the association (covariance) between
random variables. A correlation is not necessarily linear! Also the strength of the slope does not equal strength in correlation.
What defines a strong and weak correlation?
trick question: there is no minimum value that defines a strong correlation, it depends on the problem at han.
What is the pearson correlation test?
It measures the linear association between two variables.The goodness of fit (R^2) is exactly the pearson correlation.
What is the bonferroni correction?
The more tests are performed, the higher the chance finding a false
positive result. It is very conservative.
What are the pearson correlations assumptions?
- Continuous variables
- Paired samples (x and y values are available for all samples)
- Independent observations
- Linear relationship between the two variables
- Roughly Gaussian-distributed variables
- Absence of outliers
What is the spearman correlation?
Measures the monotonic association between two ranked variables
and tests its statistical significance
What are the spearman assumptions?
- Ordinal variables
- Paired samples (x and y values are available for all samples)
- Independent observations
- Monotonic relationship between the two variables