Personalized Questions (Might not be relevant to you all) Flashcards

1
Q

Do all estimators work on assumptions. Hence? What does robustness of a confidence interval mean?

A

Estimators

All estimators work on assumptions. Point estimates can be different too, not just interval estimates

Robust

  • Coverage of the interval remains unaffected by violation of the statistical assumptions underlying its construction
  • Captures the unknown population parameter value in repeated samplings by the same percentage of times as defined by the size of the interval when one or more assumptions are being violated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do we need to calculate CIs. As level of confidence of the CIs increase, what happens to the precision

A

Things for CI

Sample Statistic, Standard Eror, Alpha Value

All else equal, as level of confidence increases, it is less precise (Wider area akin to being more confidence the true population parameter will be captured)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are summary characteristics? What are the 2 common summary characterstics

A

Statistics and parameters!

  • A summary characteristic is some kind of aggregation undertaken on the individual values in one or more variables to produce a single quantity that is
    informative about the value
    • Mean
    • Standard Deviation
    • Varaince
    • Correlation
    • Bla bla
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What happens to the sampling distribution as (a) Number of repeated sampling increases; (b) As number of samples increases

A

(a) Number of repeated sampling increases
* Unbiased sampling distirubtion will get increasingly closer to the populaton distribution
(b) As number of samples increases

  • Mean of sampling distribution will get increasingly closer to the mean of populaton distribution
  • Sampling distribution will become increasingly normal
    • Central Limit Theorem

Don’t mix up the two…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some effect size. What are effect sizes

A

Quantitative measure of the strength of a relationship between construct measures.

  • Mean
  • Mean DIfference
  • R2
  • Coefficients (Pearson and Regression)
  • Odds Ratio
  • Cramer’s V
    • Anything you can put a CI over
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the population correlation coefficeint. Can we calculate it

A

p (Rho).

It can only be estimated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do associations of categorical variables aim to do

A

Measure strength and direction of 2 variables

  • Note: It is just like a continous one. There’s both strength and direction!
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

OLS Estimator - Which is bias/unbias

A

Unbias

  • Unbiased in maximising SSreg if assumptions hold (like residuals)
  • Unbiased in estimating sample regression coefficients

Bias

  • Bias, but Consistent, in R2
  • Adjusted R2 is only LESS bias (not no bias)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the model equation for simple linear regression?

A

Y_hat = a + bxi

Note: b does not have a hat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

In regression diagnostics, what do we look for in

(a) Linearity
(b) Hetereoscadescity
(c) Outliers / Influential Cases (From normality)

A

Linearity

  • See misfit between the 2 lines (probability different colours)

Heteroscadescity

  • See fanning out of residual
  • nCV and Residual plots MAY NOT be consistent

Outliers & Influential Cases

  • Influential
    • Change regresion coefficients and R2
    • Cook’s D >1 is definitely a problem
  • Outliers
    • Look for +- 3
    • Though for smaller samples, might be 3.5 to 4
    • Large studentized residuals is maybe a problem
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In t-tests, what if the design is unbalanced and violated homogenity?

A

We must use adjustment separate variance estimates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

If Levene and Flinger comes out p < .05 and p > .05

What should we do?

A

Assume hetereogeneity. Be conservative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When are standardized mean differences useful

A

Hedges g and Bonnet’s d

  • Useful if it has an arbitary scaling and can’t be meaningfully intepreted
    • stop mixing up arbitrary. arbitrary = sucky!!!!
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In observed mean differences, will the mean differene estimates be the same?

A

Yup.

There is only one mean difference. However, all the other statistics will be different! (e.g. SE, t , etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Does homogeneity of variance matter at all in dependent group t-test

A

Yes

  • While it is not an assumption
  • g and d will differ and they still consider homogenity of group variances
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How many standard errors are there in a dependent group?

A

One.

Standard Error of the Mean Difference

17
Q

How are dummy and ANOVA similar?

A

Almost everything

  • F-Statistic for R2
  • T-Statistic for Coefficients
  • Proportion of DV explained
  • df
  • P-values
18
Q

How many contrasts do we need to establish orthogonality? What is the maximum number of linear contrasts

A

Two.

Don’t mix this up with number of linear contrasts.

  • The maximum number of linear contrasts to account for differences among means is k - 1
19
Q

In a one-way within-subject design, does the interval spacing have to be the same?

A

The interval spacing must be the same for all participants, but the distance between each interval can vary, as long as all same

20
Q

What does g and delta estimate based on (uni/multi). When is uni/ multi used

A
  • Hedges’ g and Bonett’s delta estimate the effect size and associated confidence interval using a univariate method.
  • The multivariate method is only used in the omnibus test of the null hypothesis.
21
Q

Does hedges g and bonnet’s d require spherecity?

A

Only g requires sphericity

HOWEVER

Both Hedges’ g and Bonett’s delta may be biased due to violation of the sphericity assumption

22
Q

Wtf does Pillai examine

A

Multivariate

  • Test omnibus hypothesis of no difference between all levels of the factor
  • Test polynomial orthogonal linear contrasts
    • Only tell significance
    • No effect size
23
Q

What is the defitnition of sphericity. Hence?

A

Sphericity

Homogenity of variances of all possible difference scores between pairs of three or more within-subject conditions (or levels) at a population level

Hence

Epislon

  • Estimated from sample covariance matrix
  • Calculated from population covariance matrix
24
Q

What does the epislon estimators do?

A

Estimate epislon in sample data

  • Greenhouse-Geisser estimator - E_hat
  • Huynh-Feldt estimator E-wave
    • Adjustments to null hypothesis tests under the univariate approach.
25
Q

Define main effect contrast. And what is the scaling

A

Main effect contrasts

  • Compare cell means of the two-way table to investigate contrasts of each factor in the two design.

Order-0 Scaling

  • Scaling for a difference in means
  • Absolute sum to 0
26
Q

Define interaction contrast. And what is the scaling

A

Interaction Contrast

  • Compare the cell means of the two-way table using
    the cross-product of contrast weights from the linear comparisons for the main effect linear contrasts

Order-1 Scaling

  • Scaling for the difference two sets of differences between
    means
27
Q

What happens to CI in multiple NHSTs

A

The CI coverage for capturing the true population parameter may be lower than nominal rate requested.

28
Q

How many NHSTs is the mark where probability of type 1 error is high

A

By 25 NHSTs,

High probability of at least one type 1/false rejection error due to chance alone

29
Q

What is the SEm. Why do we need it? And what is the Formula?

A

Average Error Score.

rxx does not tell us what is typical

𝑆𝐸𝑚= 𝑠𝑥 √1−rxx

  • Sx
    • Standard Deviation of Observed Scores
30
Q

What is the first classical theory equation

A

Xi = T + Ei

Ei is unsystematic error variance

31
Q

What is the third classical theory equation. Can we calculate it?

A

p2xt = True Score Variance / Observed Score Variance

Note

p2xt

  • Theoratical reliability coefficient
  • It can only be estimated using the sample reliability coefficient rxx
32
Q

What are the equations for reliability using variances

A

(a)

  • True Score Variance / Observed Score Variance

(b)

  • 1 - (Error Score Variance/Observed Score Variance)

Note

  • Observed is always the denomintor.
33
Q

What is criterion validity in theory. What are the types?

A

Extent to which test scores predict scores on an relevant criterion variables.

  • Concurrent
    • Evaluated against a criterion measured at the
      same time
  • Predictive:
    • Evaluated against a criterion measured later
34
Q

What is criterion validity in practice

A

In practice:

  • Validity Coefficient
  • Exclusive to criterion validity
  • Will be attenuated due to measurement error
  • Pearson Correlation
35
Q

If estimator A is more efficient than estimator B, what is the observed test statistic

A

Observed test statistic derived from A will be larger than the one from B

Efficient = Smaller S.E. = Larger Test Statistic Value

36
Q

What do we need to calculate an observed test statistic?

A
  • Known sample statistic value
    • aka. “Assumed population parameter value”
    • Can use a known population parameter value if we wish
  • Standard Error
37
Q

What do we need to calculate a critical test statistic?

A
  • Alpha
  • Relevant probability distribution stuff
    • t-distirubtion: one-dfs
    • F: two-dfs