test construction Flashcards
What is Classical Test Theory?
A theory of measurement used for developing and evaluating tests, also known as true score test theory
What is the formula representing the relationship between obtained test scores, true score variability, and measurement error?
X = T + E
What does true score variability (T) represent?
Actual differences among examinees regarding what the test measures
What is measurement error (E)?
Random factors affecting test performance in unpredictable ways
What are some examples of measurement error?
- Distractions during testing
- Ambiguously worded test items
- Examinee fatigue
What does test reliability refer to?
The extent to which a test provides consistent information
What is a reliability coefficient?
A type of correlation coefficient that ranges from 0 to 1.0
How is a reliability coefficient interpreted?
As the amount of variability in obtained test scores due to true score variability
What reliability coefficient is considered minimally acceptable for many tests?
0.70 or higher
What reliability coefficient is usually required for high-stakes tests?
0.90 or higher
What are the four main methods for assessing a test’s reliability?
- Test-retest
- Alternate forms
- Internal consistency
- Inter-rater
What does test-retest reliability measure?
The consistency of scores over time
How is alternate forms reliability assessed?
By correlating scores from different forms of the test administered to the same examinees
What does internal consistency reliability measure?
The consistency of scores over different test items
Why is internal consistency reliability not useful for speed tests?
It tends to overestimate their reliability
What is coefficient alpha also known as?
Cronbach’s alpha
What is Kuder-Richardson 20 (KR-20) used for?
Evaluating internal consistency reliability for dichotomously scored items
What is the split-half reliability method?
Correlating scores from two halves of a test
What is a drawback of split-half reliability?
It underestimates a test’s reliability
What formula is used to correct split-half reliability?
Spearman-Brown prophecy formula
What does inter-rater reliability assess?
The consistency of scores or ratings assigned by different raters
What methods are used to evaluate inter-rater reliability?
- Percent agreement
- Cohen’s kappa coefficient