Tutorial 1 Flashcards
How to measure the mean?
How to measure the deviations?
How to measure sample variance?
How to measure variance?
How to measure standard deviation?
What does an ππ tell us about a set of data?
The value of the SS indicates the spread of the data in terms of how far observations are located away from the mean. However, it is not an incredibly useful measure since it is
unlimited and, depending on the amount of observations can approach infinity. (Each added
observation will add to the SS as long as its score is not exactly equal to the mean).
So, it does relate to variance, the standard deviation and is a measure of how far people tend to be away from the mean, but it is not interpretable without knowing the number of
observations, and tends not to be considered in research output.
Under what circumstances will the value of ππ equals 0?
The distance between each observation and the mean must equal 0. If it does not equal the
mean, even by a little, the square of the distance is also not zero.
That means that if all scores must be equal to the mean, all scores must also be equal to each
other. I.e. if all scores are identical, since that would then be the mean, and all scores would have a distance to the mean of 0.
Can SS ever be negative?
No
What is a sampling distribution?
The sampling distribution of a statistic (such as the mean, or a certain test statistic) is the
(theoretical) distribution of a set of samples that we draw from the population.
E.g. if we were to measure average length in thousands (or infinitely many) samples by drawing
a sample, measuring everybody, computing the average, drawing a new sample, again
compute the average, draw a new sample, etc. it would be a distribution that shows how likely
it is to get a particular average sample value. Thus, it describes how the mean, the value of π
varies across samples of size π.
What do we know about the shape and characteristics of the sampling distribution for π, the sample mean?
We know the distribution for the sample mean to be approximately normally distributed in
reasonably large samples. This is related to what we call the central limit theorem, which
roughly implies that the distribution of the sample mean becomes a normal distribution if we
look at enough large samples, regardless of what the population distribution of the individual
observations looks like. (In small samples, the variable itself needs to be normally distributed
for this to be true). It is centred (the peak is at) the population mean and has a standard deviation equal to the π/βπ, which is called the Standard Error when talking about the sampling distribution.
What is ππΈπ?
It is the Sampling Error of the Mean. This Sampling Error is related to the size of the sample π and the variance of individual scores π.
What does the value of ππΈπ tell you about the typical magnitude of sampling error?
As π increases, how does the size of ππΈπ change (assuming π stays the same)?
As the variance increases there is more βroomβ to have sample means that differ a lot from the population mean. So, the sampling error becomes larger.
In the formula ππΈπ = π / βπ this is also clear to see: If we keep N the same and s becomes
larger as we divide a larger number by an equal number.
As π increases, how does the size of ππΈπ change (assuming π stays the same)?
As the number of observations increases it becomes less likely that we accidentally sample predominantly extreme values, and more likely that we sample people
equally from above and below the mean score. (Imagine sampling, by accident, the 10 tallest people from a population. Our mean then is way too high. If we instead sample 11 people, the added person must be shorter than the other 10 as those were the 10 tallest, so the mean will
become lower and closer to the population mean. Moreover, it becomes increasingly unlikely that we sample only very tall or small people if our sample would be of size 40).
In the formula ππΈπ = π / βπ this is also clear to see: If we increase N and keep s the same, we divide by a larger number and the sampling error becomes smaller.
What research decisions influence the magnitude of risk of Type I error?