Definitions Flashcards
Expectation E(x) and variance Var(x) for N(mu, sigma)
E(x) = mu Var(x) = sigma
Expectation E(x) and variance Var(x) for N(mu, sigma)
E(x) = mu Var(x) = sigma
Expectation E(x) and variance Var(x) for N(mu, sigma)
E(x) = mu Var(x) = sigma
Expectation E(x) and variance Var(x) for N(mu, sigma)
E(x) = mu Var(x) = sigma
Expectation E(x) and variance Var(x) for N(mu, sigma)
E(x) = mu Var(x) = sigma
Sample
Set of values (randomly (assumption)) selected from a population
Mean
Average value of a set of values
Median
The value in the middle of your set of sorted data
For odd numbered set this is the middle value
For even numbered sets this is the average of two middle values
Variance
This is how much the values in the set deviate from their mean.
Calculated through the sum of all values in your set subtracted by the mean, square this to eliminate negative values and divide the outcome by n-1, where n is the size of your set
Standard deviation
The square root of the variance, which was squared to eliminate the negative values.
Which numerical summaries are location and which are scale?
Location: mean and median
Scale: variance and standard deviation
Histogram
A barplot of a set X where the AREA of the bar over a cell (called a bin, see it as a little interval) is equal to the number of observations in that cell divided by the size of X.
The sum of bars is therefore equal to 1 (100%)
Correlation
The correlation between two variables quantifies the linear relation between them.
It is calculated by taken the sum of the product of the subtraction of the mean of all the variables in both sets, and dividing it by the square root of the product of both set’s variances.
Correlation values and corresponding relations between the two sets
+1: perfect LINEAR relation with positive slope
- 1: perfect LINEAR relation with negative slope
0: No linear relation
Normal QQ-plot
A plot that matches the quantiles of two sets, to see whether it is normally distributed