Personalized Questions (Might not be relevant to you all) Flashcards by andy Sitoh

Do all estimators work on assumptions. Hence? What does robustness of a confidence interval mean?

Estimators

All estimators work on assumptions. Point estimates can be different too, not just interval estimates

Robust

Coverage of the interval remains unaffected by violation of the statistical assumptions underlying its construction
Captures the unknown population parameter value in repeated samplings by the same percentage of times as defined by the size of the interval when one or more assumptions are being violated

How well did you know this?

Not at all

Perfectly

What do we need to calculate CIs. As level of confidence of the CIs increase, what happens to the precision

Things for CI

Sample Statistic, Standard Eror, Alpha Value

All else equal, as level of confidence increases, it is less precise (Wider area akin to being more confidence the true population parameter will be captured)

How well did you know this?

Not at all

Perfectly

What are summary characteristics? What are the 2 common summary characterstics

Statistics and parameters!

A summary characteristic is some kind of aggregation undertaken on the individual values in one or more variables to produce a single quantity that is
informative about the value
- Mean
- Standard Deviation
- Varaince
- Correlation
- Bla bla

How well did you know this?

Not at all

Perfectly

What happens to the sampling distribution as (a) Number of repeated sampling increases; (b) As number of samples increases

(a) Number of repeated sampling increases
* Unbiased sampling distirubtion will get increasingly closer to the populaton distribution
(b) As number of samples increases

Mean of sampling distribution will get increasingly closer to the mean of populaton distribution
Sampling distribution will become increasingly normal
- Central Limit Theorem

Don’t mix up the two…

How well did you know this?

Not at all

Perfectly

What are some effect size. What are effect sizes

Quantitative measure of the strength of a relationship between construct measures.

Mean
Mean DIfference
R²
Coefficients (Pearson and Regression)
Odds Ratio
Cramer’s V
- Anything you can put a CI over

How well did you know this?

Not at all

Perfectly

What is the population correlation coefficeint. Can we calculate it

p (Rho).

It can only be estimated

How well did you know this?

Not at all

Perfectly

What do associations of categorical variables aim to do

Measure strength and direction of 2 variables

Note: It is just like a continous one. There’s both strength and direction!

How well did you know this?

Not at all

Perfectly

OLS Estimator - Which is bias/unbias

Unbias

Unbiased in maximising SS_regif assumptions hold (like residuals)
Unbiased in estimating sample regression coefficients

Bias

Bias, but Consistent, in R²
Adjusted R² is only LESS bias (not no bias)

How well did you know this?

Not at all

Perfectly

What is the model equation for simple linear regression?

Y_hat = a + bx_i

Note: b does not have a hat

How well did you know this?

Not at all

Perfectly

In regression diagnostics, what do we look for in

(a) Linearity
(b) Hetereoscadescity
(c) Outliers / Influential Cases (From normality)

Linearity

See misfit between the 2 lines (probability different colours)

Heteroscadescity

See fanning out of residual
nCV and Residual plots MAY NOT be consistent

Outliers & Influential Cases

Influential
- Change regresion coefficients and R²
- Cook’s D >1 is definitely a problem
Outliers
- Look for +- 3
- Though for smaller samples, might be 3.5 to 4
- Large studentized residuals is maybe a problem

How well did you know this?

Not at all

Perfectly

In t-tests, what if the design is unbalanced and violated homogenity?

We must use adjustment separate variance estimates

How well did you know this?

Not at all

Perfectly

If Levene and Flinger comes out p < .05 and p > .05

What should we do?

Assume hetereogeneity. Be conservative.

How well did you know this?

Not at all

Perfectly

When are standardized mean differences useful

Hedges g and Bonnet’s d

Useful if it has an arbitary scaling and can’t be meaningfully intepreted
- stop mixing up arbitrary. arbitrary = sucky!!!!

How well did you know this?

Not at all

Perfectly

In observed mean differences, will the mean differene estimates be the same?

Yup.

There is only one mean difference. However, all the other statistics will be different! (e.g. SE, t , etc)

How well did you know this?

Not at all

Perfectly

Does homogeneity of variance matter at all in dependent group t-test

Yes

While it is not an assumption
g and d will differ and they still consider homogenity of group variances

How well did you know this?

Not at all

Perfectly

How many standard errors are there in a dependent group?

Study These Flashcards

One.

Standard Error of the Mean Difference

How are dummy and ANOVA similar?

Study These Flashcards

Almost everything

F-Statistic for R²
T-Statistic for Coefficients
Proportion of DV explained
df
P-values

How many contrasts do we need to establish orthogonality? What is the maximum number of linear contrasts

Study These Flashcards

Two.

Don’t mix this up with number of linear contrasts.

The maximum number of linear contrasts to account for differences among means is k - 1

In a one-way within-subject design, does the interval spacing have to be the same?

Study These Flashcards

The interval spacing must be the same for all participants, but the distance between each interval can vary, as long as all same

What does g and delta estimate based on (uni/multi). When is uni/ multi used

Study These Flashcards

Hedges’ g and Bonett’s delta estimate the effect size and associated confidence interval using a univariate method.
The multivariate method is only used in the omnibus test of the null hypothesis.

Does hedges g and bonnet’s d require spherecity?

Study These Flashcards

Only g requires sphericity

HOWEVER

Both Hedges’ g and Bonett’s delta may be biased due to violation of the sphericity assumption

Wtf does Pillai examine

Study These Flashcards

Multivariate

Test omnibus hypothesis of no difference between all levels of the factor
Test polynomial orthogonal linear contrasts
- Only tell significance
- No effect size

What is the defitnition of sphericity. Hence?

Study These Flashcards

Sphericity

Homogenity of variances of all possible difference scores between pairs of three or more within-subject conditions (or levels) at a population level

Hence

Epislon

Estimated from sample covariance matrix
Calculated from population covariance matrix

What does the epislon estimators do?

Study These Flashcards

Estimate epislon in sample data

Greenhouse-Geisser estimator - E_hat
Huynh-Feldt estimator E-wave
- Adjustments to null hypothesis tests under the univariate approach.

Define main effect contrast. And what is the scaling

_Main effect contrasts_ * Compare cell means of the two-way table to investigate contrasts of each factor in the two design. _Order-0 Scaling_ * Scaling for a difference in means * Absolute sum to 0

Define interaction contrast. And what is the scaling

_Interaction Contrast_ * Compare the cell means of the two-way table using the cross-product of contrast weights from the linear comparisons for the main effect linear contrasts _Order-1 Scaling_ * Scaling for the difference two sets of differences between means

What happens to CI in multiple NHSTs

The CI coverage for capturing the true population parameter may be lower than nominal rate requested.

How many NHSTs is the mark where probability of type 1 error is high

By 25 NHSTs, High probability of at least one type 1/false rejection error due to chance alone

What is the SE_m. Why do we need it? And what is the Formula?

Average Error Score. r_xx does not tell us what is typical 𝑆𝐸𝑚= 𝑠𝑥 √1−rxx * Sx * Standard Deviation of Observed Scores

What is the first classical theory equation

X_i = T + E_i E_i is *unsystematic* error variance

What is the third classical theory equation. Can we calculate it?

p²_xt= True Score Variance / Observed Score Variance _Note_ p²_xt * Theoratical reliability coefficient * It can only be estimated using the sample reliability coefficient r_xx

What are the equations for reliability using variances

(a) * True Score Variance / Observed Score Variance (b) * 1 - (Error Score Variance/Observed Score Variance) Note * Observed is always the denomintor.

What is criterion validity in theory. What are the types?

Extent to which test scores _predict_ scores on an relevant criterion variables. * Concurrent * Evaluated against a criterion measured at the same time * Predictive: * Evaluated against a criterion measured later

What is criterion validity in practice

_In practice:_ * Validity Coefficient * Exclusive to criterion validity * Will be attenuated due to measurement error * Pearson Correlation

If estimator A is more efficient than estimator B, what is the observed test statistic

Observed test statistic derived from A will be larger than the one from B Efficient = Smaller S.E. = Larger Test Statistic Value

What do we need to calculate an observed test statistic?

* Known sample statistic value * aka. "Assumed population parameter value" * Can use a known population parameter value if we wish * Standard Error

What do we need to calculate a critical test statistic?

* Alpha * Relevant probability distribution stuff * *t-distirubtion*: one-dfs * F: two-dfs

Personalized Questions (Might not be relevant to you all) Flashcards

(37 cards)