Chapter 6: Writing and Evaluating Test Items Flashcards
What are the stages of test development?
Conceptualization, Construction, Test Tryout, Item Analysis, Test Revision
Define incremental validity
incremental validity: the extent to which a proposed test provides unique information about a construct relative to that which is offered by existing tests of the same construct (kidney test example)
Advantages and Disadvantages of Polytomous Format
Advantages
Easy to administer
Probability of correct answer is lower than true/false (have to know more)
Disadvantages
Hard to write good distractors
Still based on recognition
Reliability - requires excellent distractors
Advantages and Disadvantages of Dichotomous Format
Advantages
Simple
Easy to administer
Flexible
Absolute
Disadvantages
Encourage memorization
Truth is sometimes gray
50% probability of guessing correctly
Reliability: requires many items
Define dichotomous and polytomous format. Common examples?
Dichotomous format: Belonging to the closed-ended family of questions, dichotomous questions are ones that only offer two possible answers, which are typically presented to survey takers in the following format
Polytomous format: Items that are scored in multiple-ordered categories are referred to as polytomously scored items.
Which types of questions are “selected-response format”?
multiple-choice that’s either dichotomous or polytomous
What are the two major formats of summative scales, as given in lecture? What type of data do they create?
Likert (nominal data) and Category (On a scale of 1 to 10… that creates interval data)
In creating a category format, the use of what will reduce error variance?
Anchors (what are the endpoints, the low and the high)
When does the category format begin to reduce reliability?
When less than four categories are used
What are the four questions that should be asked when generating a pool of candidate test items?
Cumlative scoring: summing them up
Subscale Scoring: total test scores is divided into groups that is individual summed (think act)
Class or category scaling: Pass or fail, you have it or you don’t
Ipsative Scoring: Forced choice (you have two choices to pick from)
Define item analysis. What two methods are closely associated with item analysis?
process which examines student responses to individual test items (questions) in order to assess the quality of those items and of the test as a whole
Item difficulty: The higher the score on the difficulty index, it means that it is easy
item discriminately: Methods to discriminate between high and low scorers
Define item difficulty. What does the proportion of people getting the item correct indicate?
a form of item analysis used to assess how difficult items are
the higher the score on the difficult index, the easier the question is
Define item discriminability. What is good discrimination? What are two ways to test item discriminability?
how well an item performs in relation to some criterion, high discriminability means that it separates the smart from the dumb
Extreme group method and point biserial method
Will guessing help you on an exam?
Depends on how the test is graded. If it is corrected for guessing, than it will do you no good
Know and be able to identify examples of a double-barreled item.
You can do this