Theories Flashcards
True ability of the testtaker (true score) + error
Classical test score theory
Center of distribution should represent the true score and dispersion around the mean of the distribution should display the distribution of sampling errors
Classical test score theory
Seek to estimate the extent to which specific sources of variation (error) under defined conditions are contributing to test score
Domain sampling theory
Using a limited number of items to represent a larger and more complicated construct. Also states that as reliability of a test increases as the number of items increases
Domain sampling theory
Based on the idea that a person’s test scores vary from testing to testing because of variables in the testing situation; analogous to true score theory
Generalizability theory
Influence of particular facets on the test score nga bisag unsa pa na factor (test admin, no. of items) maka lead ra ug same score in different situations
Coefficient of generalizability
Used to focus on the range of item difficulty that helps assess an individual’s ability level
Item-response theory (aka latent-trait theory)
An IRT model with very specific assumptions about underlying distribution
Rasch model
Assumptions in using IRT
- unidimensionality
- local independence (only one construct)
- monotonicity (item mutugma sa construct kay mu increase if piliun sya)
Probabilistic relationship between a testtaker’s response to a test item and that testaker’s level on the latent construct being measured expressed in graphic form
Item characteristic curve
Understand the range over construct for which an item is most useful in discriminating groups of testtakers
information curve
Estimate which an observed score deviates from a true score
Standard error of measurement (the higher the SEM, the lower the reliability)
Estimates how repeated measures of a person on the same instrument tend to be distributed around his or her “true” score.
The standard error of measurement (SEM
A range or band of test scores that is likely to contain the true score; set or derived from SEM; ideal is 95%
Confidence interval
Determine how large a difference should be before it is considered statistically significant
Standard error of difference