L2: Classical test theory Flashcards

Question

what is the internal consistency reliability estimation technique?

Answer 1

- looks at how consistent the items within a single test are w each other, if items on a test measure the same trait, the test should be internally consistent - assumes parallel or essential tau equivalent test model (theres multiple internal consistency techniques) - consider (blocks of) items as seperate tests - formula will give the reliability

Answer 2

carry over effects (by end of test might be way better at answering or more tired or whatever so different parts of test might not be as consistent w each other)

Answer 3

1. split half - assumes parallel test model - split test in 2 parts - formula gives reliabilty (see table 6.2) 2. cronbachs alpha (or KR20 for binary items or standardized alpha) - assumes essential tau equivalent test model - each item considered separate part - formula gives reliability (see table 6.2) 3. omega - assumes congeneric test model (or stricter) - not applied in practice yet - estimate true score variance using unidimensional factor analysis - reliability is true score variance / observed score variance

Answer 4

- alternate forms: hardly feasible in practice, only in specific situations - test retest: important to establish reliability for new test, in research only rarely used - split half: depends highly on split used (undesirable), still used frequently - cronbachs alpha/kr20: very pop in research due to its ease, assumption (essential tau equivalence) hardly met; its the lower bound of the reliability! so the actual reliability will be equal to or higher then cronbachs alpha

Answer 5

1. tests used for high impact inferences at individual level (ex: personnel selection, diagnosis of learning disabilities etc) - good: 0.9 or larger - sufficient 0.8-0.9 - insufficient: smaller than 0.8 2. tests used for less impact inference at individual level (descriptive use ex: study/therapy progress, career choice tests etc) - good: 0.8 or larger - sufficient: 0.7-0.8 - insufficient: smaller than 0.7 3. tests used at group level (ex: customer/team satisfaction, student evaluations, comparing groups etc) - good: 0.7 or larger - sufficient: 0.6-0.7 - insufficient: smaller than 0.6

Answer 6

differences in the item scores reflect differences in the construct (indicates how good the items are) - item total correlation: correlation between item scores & sum scores (how well does an item predict the total score on the test?) - corrected item total correlation: correlation between item scores & rest scores (how well does an item predict the other item scores excluding the item u are looking at)

Answer 7

its biased upwards as you are correlating an item with itself (partly) -> this problem solved with corrected item total correlation as it exluces the item itself

Answer 8

1. test length 2. sample heterogeneity 3. the correlation between pretest and posttest scores

Answer 9

lengthening a test will generally increase reliability

Answer 10

Rnew = n x Roriginal / 1+(n-1)Roriginal where n = new amount of items / original amoutn of items

Answer 11

in homogeneous samples, reliability will be smaller than in heterogenous samples R = St^2 / St^2 + Se^2 in homogeneous samples, St^2 will be smaller as ppl are relatively similar to each other while in heterogenous samples St^2 will be larger as ppl are relatively dissimilar to each other

Answer 12

difference score: Di = Xi (post test) - Yi (pretest): difference between post and pretest scores difference score reliabilty: 𝑅𝑑 = 𝑠𝑋𝑜^2 𝑅𝑋𝑋 + 𝑠𝑌𝑜^2 𝑅𝑌𝑌 − 2𝑟𝑋𝑜𝑌𝑜𝑠𝑋𝑜𝑠𝑌𝑜 / 𝑠𝑋𝑜^2 + 𝑠𝑌𝑜^2 − 2𝑟𝑋𝑜𝑌𝑜𝑠𝑋𝑜𝑠𝑌o pretest variance*reliability of pre tests + posttest variance*reliability of posttest - 2xcorrelation between pre and posttest*sd of pre test*sd post test important properties: - if correlation between pretest and posttest is large, reliability will be small - aka, difference reliability depends on reliability of the pretest and posttest - sensitive to difference in variance between Xi and Yi

Answer 13

1. true score estimate = summed item score 2. true score estimate: Xest = mean of Xo (observed scores) + Reliability (Xo (score of person youre interested in) - mean of Xo) - based on regression to the mean - due to unreliability, high scoring persons will likely score lower on a next test -> the lower the reliability, the more the true score estimate is pulled toward the mean

Answer 14

amount of error present in an individuals score Sem= So (sd of observed scores) * sqrt (1-Reliabilty) so higher reliability = smaller sem lower reliability = higher sem can be used to construct 95% confidence interval around true score estimate

Answer 15

effect size/correlations observed will be SMALLER than the effect sizes / correlations of the true scores (because observed scores are diluted by error) + correlation is smaller & less likely to be significant for less reliable test anyway (so always consider reliabilty) aka when measurements arent reliable (cus theres measurement error), it weakens the relationships observed between variables

Answer 16

the corrections done on the observed scores in order to remove the error in them, can be wrong !

Answer 17

- when there is little error score variance relative to the true score variance - when the sum of the true score variance & error variance comes close to the true score variance - when the proportion of error variance in the observed variance is small

Answer 18

A smaller standard error of measurement means that there is less deviation of observed scores from true scores, so a more reliable test

Answer 19

tau equivalent test

Answer 20

essentialy tau equivalent test model or stricter

Answer 21

parallel test model

Answer 22

parallel test model

Answer 23

the congeneric test model

Answer 24

parallel test model

Answer 25

parallel test model & tau equivalent test model

Answer 26

alternate forms reliability

Answer 27

tests need to have identical true scores & identical error variance

Answer 28

test retest

Answer 29

internal consistency

Answer 30

- alternative forms - test retest

Answer 31

- test retest - internal consistency - alternative forms

Answer 32

A correlation between the item’s error scores caused by carry-over effects

Answer 33

raw alpha: LIkert scale items that do not differ in variance too much KR20: binary items standardized alpha: Likert scale items that differ substantially in their item variance

Answer 34

Raw alpha and KR20 are based on the item covariances and item variances; standardized alpha only uses item correlations

Answer 35

When the item variances differ a lot from each other and thus the test score mostly reflects items with high variances

Answer 36

Add more items to the test that are perfectly parallel to the original items

Answer 37

homogenous sample

Answer 38

highly correlated

Answer 39

split halves reliabilty: spearman brown coefficient reliability of one halve: correlation between forms

Answer 40

Corrected item-total correlation

Answer 41

consistency or stability of test scores across repeated applications. It's a crucial aspect of psychometrics because it determines how much trust can be placed in test results.

Answer 42

that 2 tests measure the same trait w equal true scores and error variances

Answer 43

that 2 tests have equal true score variances but can differ in error variances

Answer 44

that a linear relationship between true scores across 2 tests exists, allowing for flexibilty in error & true score variances

Answer 45

treats test items as a sample from a larger domain (bucket) of possible items - the reliability of a test is the average correlation between all possible pairs of tests drawn from that domain: basically how consistent the test resuts would be if you made different tests by pulling out different sets of items from the bucket of all possible questions basically this theory is saying "we want to know if the questions we randomly picked give a reliable pic of the persons true ability, even if we swapped them out for different questions from the same big bucket"

L2: Classical test theory Flashcards

Ch 5, 6, 7 (70 cards)