Ch.6 Field Flashcards
Population and Samples
Population and Samples
What are some points on samples you have to remember?
- The form of our model is similar for all samples and for the population (The model we get for one sample is similar to all the other models for all the other samples, and similar to the model for the population)
- Parameter estimates vary across samples -> Samples don’t have parameter values that match the true values exactly
- Spread of scores around pouplation model is consistent -> lines representing limits are parallel to the model itself. In other words, at all values of the IV the spread of scores around the model is assumed to be similar (See image 1)
Population and Samples
What is error?
The difference between the predicted value of the DV at a certain level of the IV vs the observed value of the DV at the same value of the IV
error = observed - model
- When referring to error in sample models, we use e
- When referring to error in population models, we use ε
!!! e and ε are the same concept (error = observed - model), they’re just used in different circumstances !!!
(See image 2, notive the difference in hats as well, because with a sample model we want to estimate the population parameters (we can’t get the population values from our data) we add hats. In the population model because our data give us the actual numbers in the population, we don’t estimate parameters, so there are no hats. Also note the difference in e & ε as mentioned above)
Population and Samples
What are some general notes on error?
- most errors in prediction will be close to 0 (this is what the book said, seems like bullshit to me, nonetheless I think just remember it as a note if it’s asked in a multiple choice question)
- As magnitude of errors increase, frequency decreases.
~ !!! The opposite isn’t necessarily the case. Remember it just in this direction only !!! - The distribution of the errors is also normally distributed with a mean of 0 and a variance of s^2
Errors vs Residuals
Errors vs Residuals
What is a Residual
Residual = observed value - value predicted by the model = error, BUT SPECIFICALLY FOR A SAMPLE MODEL (simply, residual is error for a sample model)
- Since we use a sample model to estimate the population model, it’s likely that the residuals from the sample model are a good approximation of the population errors
- If we make a plot distribution of all the residuals, it’s normal with a mean of 0
Errors vs Residuals
What do we use Residuals for?
We use Residuals to inder different stuff about the errors in the population model
Errors vs Residuals
What is the equation for Total error?
See image 3
Errors vs Residuals
What is the ordinary least squares (OLS) regression?
It’s a method that uses the method of least squares to estimate the parameters (b-values) for which the total error is at it’s minimum (method to minimize total error)
Errors vs Residuals
How do you estimate the variance of the model errors?
See image 4
Confidence Intervals and Significance Testing
CI & Significance Testing
General notes on Sampling Distribution
- It’s the distribution of parameter estimates across samples
- The width reflects variability in sampling error
~ Also called th standard deviation -> in a sampling distribution the sd is called standard error
CI & Significance Testing
What is the general equation for any test-statistic?
effect/error
This also equals to = size of parameter/sampling variation in the parameter
- Sampling variation in the parameter is equal to the difference between the means for each sample
CI & Significance Testing
What is the Central Limit Theorem?
!!! If the model errors are normally distributed, the sampling distribution of b^ is also normal.
Therefore, we can estimate the se of b^ and construct the CI and the test-statistic
CI & Significance Testing
What is true about the relationship between sample size and sampling distribution?
As sample size increases, the sampling distribution approximates a normal distribution with a mean equal to the population mean and a variance equal to σ^2/n
(specifically, when model errors are normally distributed, the sampling distribution for b^ is normal)