Personalized Questions (Might not be relevant to you all) Flashcards
Do all estimators work on assumptions. Hence? What does robustness of a confidence interval mean?
Estimators
All estimators work on assumptions. Point estimates can be different too, not just interval estimates
Robust
- Coverage of the interval remains unaffected by violation of the statistical assumptions underlying its construction
- Captures the unknown population parameter value in repeated samplings by the same percentage of times as defined by the size of the interval when one or more assumptions are being violated
What do we need to calculate CIs. As level of confidence of the CIs increase, what happens to the precision
Things for CI
Sample Statistic, Standard Eror, Alpha Value
All else equal, as level of confidence increases, it is less precise (Wider area akin to being more confidence the true population parameter will be captured)
What are summary characteristics? What are the 2 common summary characterstics
Statistics and parameters!
- A summary characteristic is some kind of aggregation undertaken on the individual values in one or more variables to produce a single quantity that is
informative about the value- Mean
- Standard Deviation
- Varaince
- Correlation
- Bla bla
What happens to the sampling distribution as (a) Number of repeated sampling increases; (b) As number of samples increases
(a) Number of repeated sampling increases
* Unbiased sampling distirubtion will get increasingly closer to the populaton distribution
(b) As number of samples increases
- Mean of sampling distribution will get increasingly closer to the mean of populaton distribution
- Sampling distribution will become increasingly normal
- Central Limit Theorem
Don’t mix up the two…
What are some effect size. What are effect sizes
Quantitative measure of the strength of a relationship between construct measures.
- Mean
- Mean DIfference
- R2
- Coefficients (Pearson and Regression)
- Odds Ratio
- Cramer’s V
- Anything you can put a CI over
What is the population correlation coefficeint. Can we calculate it
p (Rho).
It can only be estimated
What do associations of categorical variables aim to do
Measure strength and direction of 2 variables
- Note: It is just like a continous one. There’s both strength and direction!
OLS Estimator - Which is bias/unbias
Unbias
- Unbiased in maximising SSreg if assumptions hold (like residuals)
- Unbiased in estimating sample regression coefficients
Bias
- Bias, but Consistent, in R2
- Adjusted R2 is only LESS bias (not no bias)
What is the model equation for simple linear regression?
Y_hat = a + bxi
Note: b does not have a hat
In regression diagnostics, what do we look for in
(a) Linearity
(b) Hetereoscadescity
(c) Outliers / Influential Cases (From normality)
Linearity
- See misfit between the 2 lines (probability different colours)
Heteroscadescity
- See fanning out of residual
- nCV and Residual plots MAY NOT be consistent
Outliers & Influential Cases
- Influential
- Change regresion coefficients and R2
- Cook’s D >1 is definitely a problem
- Outliers
- Look for +- 3
- Though for smaller samples, might be 3.5 to 4
- Large studentized residuals is maybe a problem
In t-tests, what if the design is unbalanced and violated homogenity?
We must use adjustment separate variance estimates
If Levene and Flinger comes out p < .05 and p > .05
What should we do?
Assume hetereogeneity. Be conservative.
When are standardized mean differences useful
Hedges g and Bonnet’s d
- Useful if it has an arbitary scaling and can’t be meaningfully intepreted
- stop mixing up arbitrary. arbitrary = sucky!!!!
In observed mean differences, will the mean differene estimates be the same?
Yup.
There is only one mean difference. However, all the other statistics will be different! (e.g. SE, t , etc)
Does homogeneity of variance matter at all in dependent group t-test
Yes
- While it is not an assumption
- g and d will differ and they still consider homogenity of group variances