writing and evaluating test items Flashcards

Question

Discrimination Index (di)

Answer 1

Higher values indicate better discriminability

Answer 2

-Calculated by looking at the number of people in the upper quartile who got the item correct divided by the number of people in the lower quartile who got the item correct -Essentially subtracting item difficulty between top and bottom 25% di = U/Nu – L/NL

Answer 3

-Also known as item-total correlation - Item correlations can also be used for Likert-type items, category format items, etc. Again, good items should be those that have a positive item-total correlation For example: If an item on a questionnaire measuring schizophrenia symptoms has a high correlation with total scores on the overall questionnaire, then the item is good at measuring schizophrenia symptoms Could use this correlation of an indicator to include or exclude from test/questionnaire in future Include higher and exclude lower

Answer 4

The relationship between performance on an item and performance on the overall tests tells us how well the item is tapping into what we want to measure. -A graphical display of item functioning Total test score plotted on X-axis Proportion (i.e., 0.23, 0.50, etc.) getting the item correct plotted on Y-axis -need discrete categories for scores.

Answer 5

A different model of psychological testing -Makes extensive use of item analysis -Computer generates items Each of these items has a particular difficulty level -Computer gives you an item -If you answer it correctly, the next item will be of increased difficulty, if incorrectly, the next item will be of decreased difficulty -Looks at what you can do and only gives you what it thinks you can handle -Essentially, the test is ‘tailored’ to the individual Example: This person can answer most items correctly at the 0.30 (or 0.45 or 0.70, etc.) level of difficulty… Rather than: This person got 30% or 45% or 70% on this test.

Answer 6

- Tests based on IRT can easily be adapted for computer administration - Quicker tests - Morale of test-taker is not broken down - Reduces chances of cheating

Answer 7

- tests individuals at average ability best - doesn't assess high or low levels well - high precision for average ability levels, low precision at either end

Answer 8

- equal number of items assessing all ability levels | - relatively low precision across the board

Answer 9

- tests focuses on the range that challenges each individual test taker - precision therefore high at every ability level

Answer 10

-Compares performance with some objectively defined criterion E.g., the extent to which performance on the QLT predicts success at stats in psychology Develop tests based on learning outcomes What is it that the student should be able to do? E.g., At the end of this lecture you should be able to: Describe an ICC Calculate item discriminability Calculate a point-biserial correlation

Answer 11

- 2 groups: 1 given the learning 'unit'. 1 not given the learning 'unit' - collect scores; plot on graph and should form a V or U shape -bottom curve is the anitmode

Answer 12

Tells you that you got something wrong, but not why Emphasis on ranking students rather than identifying gaps in knowledge ‘Teaching to the test’

Answer 13

it is defined by the level of difficulty of items answered correctly Instead of total test score as a traditional method does

writing and evaluating test items Flashcards

(37 cards)