Chapter 8 Flashcards
What’s an ideal item on a test for a norm-referenced test?
What about criterion?
Top scorers should get it correct, while low scorers should get it wrong.
The above doesn’t matter for criterion-based. An ideal test item is based on how well it assesses mastery.
Scaling definition
Process of settings rules for assigning numbers in measurement.
Stanine scale
Raw scores transformed into scores ranged from 1 to 9.
Rating scale
Records judgements of oneself, others, experiences, or objects
Summative scale
Final test score is a sum of all items
Method of paired comparisons
Asked to choose an option based out of two options.
Comparative Scaling
Sort options in comparison based on judgements. (eg. rank cards)
Categorical scaling
Sort objects into categories (eg. sorting cards to “justified” “sometimes justified” “always justified”
Guttman scale
Weaker to stronger expressions.
Agree with stronger ones will also agree with milder
Direct vs Indirect estimation
Direct (like equal-appearing intervals) transforms responses to another scale.
Indirect is no need to transform to another scale.
Selected-response vs. Constructed-response formats
Item formats. One is multiple options choose one, other is generate own answer.
3 types of selected-response item formats
MCs, matching, t/f
What are the names of the two columns in matching
Premises and responses
Completion item
Fill in the blank item
Computerized adaptive testing
What are the advantages of CAT?
Items are based on performance on previous items.
They reduce number of items needed and reduce measurement error (both by around 50%)