Equations And Stats etc Flashcards
How does standard deviation relate to variance
Sd is square root of variance
Mean mode median
Mean is sum of values / total points
Median is value in series and middle point
Mode is most frequent occurring
How are mean median mode effected in normal date, positive and negatively skewed data
Normal distribution mean = mode = median
Positive skewed mean > median > mode
Negative skewed mean < median
What is variance and how does it relate to standard deviation
Variance is the average of squared differences from the mean
Standard deviation is the square root of variance
Parametric tests
2 groups
Paired t
Unpaired t
More than 2
Anova one way
Anova two way
Correlation
Pearson’s
Non parametric tests
Don’t assume normal distribution
2 groups
Paired wilcoxin rank
Unpaired Mann Whitney
More than 2
Paire freidman
Unpaired Krushall Wallis
Correlation spearman’s rank
Qualitative data tests
Qualitative are all non parametric by default
Up to 2x2
Fishers exact
More than 2x2
Chi squared
Standard deviations include how much of the distribution
1 sad 68%
2 sd 96%
3 sd 99%
Qualitative data
Qualitative data is categorical data
Ie ordinal (order ie asa)
Nominal (no natural order ie gender)
Wha5 is statistical variability and what are measures of variability
Statistical variability is the spread of the data
Measure of variability usually accompanies measure of central tendency
Small variability = clustered around central tendency
Types
Range
IQ range (ie 25th to 75th centiles gets rid of outliers and is used with median)
Variance (average of the square difference from the mean- all data used)
Standard deviation is square root of variance (mean, v sensitive to outliers)
Type 1 error
If the null hypothesis is rejected when it is true (ie a false positive the tests shows a difference when there is non)
The risk of this happening is the a risk so sometimes used interchangeably
Type 2 error
Is the null hypothesis accepted when it is false (ie false negative the tests shows no difference when there is one)
The risk of this happening is the B risk
Way to reduce type 1 and 2 error
Increase sample size
Power analysis determines the smallest size required
Power = 1-b
Sensitivity
Sensitivity of a test is the proportion of true positives of everyone who has the disease
Ie true positives / everyone with disease
Ie true positives / true positives +false negatives
High sensitivity is ulseful for ruling OUT a disease
Specificity
Specificity is the number of true negatives of everyone who don’t have the disease
Ie true negatives / everyone without the disease
Ie true negatives / true negatives + false positives
Specificity useful for ruling IN a disease