Lecture 2 + Wa Flashcards
What are observe scores, true scores and measurement error?
The scores on a test to measure a certain ability or characteristic
The actual levels of a certain ability or characteristic people have
The effect by random factors
(…) minus the error score equals (…) what should be on the dots (…)?
observed score and true score, respectively
Johns couch is 200 centimeters wide. He measures it with a measuring tape and finds it to be 205 centimeters.
What is the observed score, true score and error score here?
observed = 205, true = 200 and error = 5
What is the central idea in the classical test theory (ctt)?
Every test taker has a true score on a test
Why do observed scores not match true scores in practice?
Because measurement error exists and changes the observed score
What are two core assumptions of classical test theory?
- Observed scores are true scores plus measurement error, 𝑋𝑜 = 𝑋𝑡 + 𝑋𝑒
- Measurement error is random
you do not need to know the formulas, they function as a support for basic understanding
What follows from the two core assumptions of CTT?
- 𝑋𝑒 = 0 (mean of the measurement error is equal to 0) this is because error is random (nonzero = systematic)
- 𝑟𝑡𝑒 = 0 (the correlation between true score and error is equal to 0) I think because the true score contains no error (and also because the mean is 0 for all true scores?)
- observed score variance = true scores variance + error variance (note that variance is s^2)
Reliability can be defined in two ways, which?
Proportion of variance. This basically means that if there is a high proportion of noise (error) the reliability will be low, if there is a high proportion of signal (true score variance), the reliability will be high. Aka proportion/ratio of true score variance to observed score variance (which is also the formula)
Shared variance. Reliability defined as the correlation^2 btwn the observed and the true scores variance. A high correlation^2 equals a high reliability and vice versa
There are three types of tests regarding the methods of multiple tests, what are these and explain?
Split-halves = you have two tests bc you split one in half
Test-retest = retest the same test
Cronbach and Omega = each item counts as their own test
There are four models regarding the reliability of a test, which are these (do not explain)
Parallel, tau-equivalent, essentially tau-equivalent and congeneric test
What are the restrictions/implications of the parallel test model?
Restrictions = True mean scores of both tests are equal and error of both tests are equal
Implications = Mean of true scores and variance of true scores equal, as well as for the observed scores. Correlation between true scores =1 and reliability is equal
> also most restrictive
Which tests’ reliability are based on the parallel test model?
Split-halves and test-retest
What are the restrictions/implications of the tau-equivalent test model?
restriction = true mean scores equal across both tests
implication = mean and variance of true scores are equal, but only the mean of the observed score is equal. Correlation btwn the true scores is equal
What are the restrictions/implications of the Essential tau-equivalent test model?
restriction = mean true scores are not equal
implications = variance of true scores ARE equal and the correlation between the true scores = 1
Which tests’ reliability are based on the Essential tau-equivalent test model?
Cronbach’s alpha
Which tests’ reliability are based on the congeneric test model?
basically everything is different (except ofc like mean error score or smth) and correlation btwn true scores is still 1
> least restrictive
Which tests’ reliability are based on the congeneric test model?
Omega
What are three methods of reliability estimation (CTT)? Explain themmm
Alternate forms (parallel model, apply two versions of same test, correlation = reliability)
test-retest (parallel, same test twice, correlation = reliability)
internal consistency (parallel or essential tau, blocks of items = test, some complicated formula = reliability)