Lecture 1: Classic Test Theory Flashcards
When psychologists assess the quality of a test, what two metrics do they typically refer to?
Validity and reliability
What is test variance and how do you calculate it? (2)
Item variance is the measure of dispersion of the scores on item i. The test variance is the measure of the dispersion of the test scores. A covariance matrix is constructed in which the variance of each item is along the diagonal and the covariance between each item is displayed. The test variance is the sum of all these values in the matrix or the variance of the final test scores, its the same value.
Whats the difference between covariance and correlation if there is one?
Covariance is an unscaled measure of association between variables, correlation is a scaled measure of association between variables between -1 and 1
What can be used to infer the dimensionality of a test in CTT?
Principle Component analysis (PCA)
What is meant by Principal component analysis (PCA)?
Principal component analysis (PCA) is a technique for reducing the dimensionality of such datasets, increasing interpretability but at the same time minimising information loss. It does so by creating new uncorrelated variables that successively maximise variance. E.g reducing something (e.g tumour) with 30 dimensions (smoothness, volume) to two principle components
Summarise the main steps of how PCA is calculated
We calculate the covariance matrix of our data, we calculate the eigenvectors of the covariance matrix, and this gives us our principal components. The eigenvector with the largest eigenvalue is the first principal component, and the eigenvector with the smallest eigenvalue is the last principal component.
What does Xgp represet in CTT?
πππ is a random variable denoting the repeatedly sampled measurements of test g on subject p.
What two fundamental equations can be derived from CTT?
- πΈ (πππ) =πππ
The expected value of Xgp is equal to the true value - πΈππ =πππ βπππ (for a fixed subject)
(error = observed score - true score)
What three assumptions are there within CTT?
(a) the measurement is on an interval scale;
(b) the variance of observed scores π2 ππ is finite;
(c) the measurements are repeatedly sampled in a linear, experimentally independent way.
What 8 properties are derived from CTT?
- The expected error score is zero;
- The correlation between true and error scores is zero;
- The correlation between the error score on one measurement and the true score on another measurement is zero;
- The correlation between errors on linearly experimentally independent measurements is zero;
- The expected value of πππ over persons is equal to the expected value of the true score random variable over persons;
- The variance of πΈππ over persons is equal to the expected value, over persons, of π2 πππ (variance within persons);
- Sampling over persons with any πππ, the expected value of the error score random variable is zero;
- The variance of observed scores is the sum of the variance of true scores and the variance of error scores;
Give proof that the expected error score is 0
*Not required but gives an idea of how CTT is derived
- πΈ (πππ) =πππ (fundamental Eq. 1)
- πΈππ =πππ βπππ (fundamental Eq. 2)
πΈ (πΈππ) = πΈ (πππ βπππ) = πΈ (πππ) βπΈ (πππ)* = πππ - πππ = 0
*For one person πππ is fixed
Give the proof for the following:
- The correlation between true and error scores is zero;
- The correlation between the error score on one measurement and the true score on another measurement is zero;
- The correlation between errors on linearly experimentally independent measurements is zero;
*Not needed to reproduce exact theorems
- The correlation between true and error scores is zero;
ππ =ππ +πΈπ
or π =π+πΈ
πΈ πΈππ =0 (property 1) β πΈ (πΈπ |ππ =πππ) =πΈ (πΈππ) =0 for all πππ β π (πΈπ,ππ )=0
If you know the error is 0 for each person, the expected value of the error is also 0. Therefore there cannot be a correlation between the error and the true score.
- The correlation between the error score on one measurement and the true score on another measurement is zero;
πΈ πΈπ =0 (property 1) β πΈ (πΈπ |πβ =πβπ) =0 for all πβπ β π (πΈπ,πβ) = 0
Same logic; if error is zero, it cannot be correlated with true score
- The correlation between errors on linearly experimentally independent measurements is zero;
πΈ (πΈπ) =0 (property 1) β πΈ (πΈπ |πΈβ =πΈβπ) =0 for all πΈβπ β π (πΈπ,πΈβ) =0
Same logic; if error is zero, it cannot be correlated with other errors
Give the proof of property 8: The variance of observed scores is the sum of the variance of true scores and the variance of error scores;
- π πΈπ,ππ =0 (property 2)
- ππ =ππ +πΈπ (population model)
π2 (ππ) = π2 (ππ +πΈπ )*= π2(ππ)+π2 (πΈπ) +2π( ππ,πΈπ)
βπ2 ππ =π2(ππ)+π2 πΈπ
or π2/π =π2/π +π2/πΈ
*Covariance matrix
How can reliability be defined according to these terms (conceptually, and proof shown)
Conceptually: That it is the squared correlation between the test score and the true score for a participant
Using fundamental Equations 1 and 2, and property 2, reliability can be defined as:
π (ππ,ππ) = π(ππ,ππ) / π (ππ) π(ππ) = π(ππ +πΈπ,ππ) / π (ππ) π(ππ) = π (ππ,ππ) +π(πΈπ,ππ) / π (ππ) π(ππ) = π2 (ππ) +0 /π (ππ) π(ππ) = π (ππ)/ π (ππ)
βππ =π^2π,π =
= π(ππ,ππ) ^2
= (π(ππ)/π (ππ))^2 =
= π2 (ππ)/π2 (ππ)
corr between test score and true score
= formula for corr
= Xg = Tg + Eg
=covar of T+E & T can be written as covar of T & T + covar of E & T (rule)
=covar of T = the var, covar between E and T is 0 as explained before
= the π (ππ) cancel, leaving one on top
=Not there yet, reliability of x = corr between x and t squared (how much, in %, var of the total score variance is due to the true score)
=the corr squared = what we derived before
= the var of t / the var of x
How insightful is this definition of reliability?
insightful as π^2 (ππ) =π^2(ππ)+π^2 (πΈπ) (property 8)
These are theoretical equations, we cannot calculate them without the variance of true scores. How do we try to do this?
The concept of parallel tests: if you have test h with a parallel test form g.
What are the assumptions of parallel tests
You assume the true scores are identical on the two tests as well as the variance of the error.
How are parallel tests g and h defined mathematically?
πβπ =πππ β πβ =ππ =π
The true score on one test is the same as the true score of another for one subject
π2(πΈβπ)=π2(πΈππ) βπ^2 (πβ) =π^2 (ππ) =π^2(π)
If you have the same error variance, you have the same test score variance.