Descriptive Statistics Flashcards
Define central tedency
the tendency for the values of a random variable to cluster round its mean, mode, or median.
Define mean, median and mode
mean - average
median - middle value of data set
mode - most common number
what are the 4 measures of variability
- Standard deviation
- Interquartile range
- Confidence intervals
- Z - scores
Define standard deviation
The dispersion of values around the mean
Define interquartile range
which is the difference between the first and third quartiles.
Define confidence intervals
a range of values so defined that there is a specified probability that the value of a parameter lies within it.
Define Z - scores
A z-score describes the position of a raw score in terms of its distance from the mean when measured in standard deviation units
What is a high and low standard deviation
Low standard deviation means data are clustered around the mean, and
high standard deviation indicates data are more spread out.
Define correlation
Correlation is a statistical measure that expresses the extent to which two variables are linearly related
Define regression
a measure of the relation between the mean value of one variable (e.g. output) and corresponding values of other variables
Defije multiple regression
explains the relationship between multiple independent or predictor variables and one dependent or criterion variable
Define p value
- P-value is the probability that a random chance generated the data or something else that is equal or rarer
- P value is a number between 0 and 1
What is the P value threshold for statistical significance
- Threshold for statistical significance is most commonly <0.05
Lower the P value = what
Greater amount of statistical significance
What does a P value of 0.05 denote
5% probability that the results happened by chance
Define linear regression
Linear regression expresses the relationship of two variables by fitting a linear equation to observed data