Fundamentals of statistical testing Flashcards
How is the mean calculated?
The sum of all the numbers in a data set, divided by how many numbers there are
Standard deviation measures the ____ of data around the ____, on ____.
It is calculated as the average ____ from the ____.
spread
mean
average
difference
mean
Numerically speaking, the distribution of data is the number of ____ per each ____ of a ____.
Shows us which values occur ____ often and which ____ often.
Shape formed by the bars of a ____ ____/____
observations
value
variable
more
less
bar chart/histogram
Some shapes are ‘algebraically tractable” (there is a maths formula to draw the line). We can use them for statistics because they have particular known properties.
What are five known distributions?
- Normal
- Chi-squared
- t
- Beta
- Uniform
What are two other names for the normal distribution?
Gaussian distribution
Bell curve
What are the four key properties of normal distributions?
Continuous, unimodal, symmetrical, bell-shaped
What are the proportions of a normal distribution with respect to SD?
~68% of the area below the curve is within plus or minus 1 SD from the mean
95% of the are below the curve is within plus or minus 1.96 SD from the mean
99% of the area below the curve is within plus or minus 2.58 SD from the mean
A critical value is a value that ____ off a specific ____ of a ____
cuts
proportion
distribution
A sample is a (usually ____) selected ____ of values of a particular ____ taken from a ____ pool of values, often the ____
random
subset
size
larger
population
The sampling distribution of the mean is the ____ of the ____ of many ____ of a particular ____.
The distribution is ____ and centred around the ____ population mean, 𝜇
Every ____ has its own sampling distribution
distribution
means
samples
size
normal
true
statistic
The central limit theorem states that as N gets ____, the sampling distribution of 𝑥‾ tends ____ a ____ distribution with ____ = 𝜇
larger
towards
normal
mean = 𝜇