Lecture 4 Modern Test theory Flashcards

Question

Why is item information important for the psychometric properties of a test?

Answer 1

- If a test’s items have characteristics (e.g., item difficulty levels) that are more strongly represented at some trait levels than at others = test’s psychometric quality might differ by trait levels - A test provides 'good info' when it can accurately detect differences between individuals who have different trait levels - To reflect much smaller and more subtle differences between test takers, we need a test with stronger psychometric properties

Answer 2

We need the probability that a respondent with a particular trait level will answer the item correctly (Pi (θ)) I (θ) = Pi (θ)(1- Pi (θ)) | No need to remember the formula just to understand it

Answer 3

Quantifies item information thorugh computing information values at many more trait levels (picture 6) - Tells us how much information an item is giving about the latent variable at certain difficulty levels

Answer 4

The item provides the most information at trait levels around the item's difficulty (β), where the peak of the bell curve is located The amount of information decreases as θ moves away from β in either direction ↪ If the respondent's trait level is much higher or lower than the difficulty, the item provides less information about their trait The width of the curve relates to the discrimination parameter (α): a steeper ICC (higher α) leads to a narrower, taller IIF, meaning the item provides more precise information over a smaller range of trait levels

Answer 5

Together, the curves show how different items target different levels of the latent trait, providing a range of information across the scale

Answer 6

Evaluates the quality of the test as a whole The sum of the individual item information functions, indicating that the scale provides the most information at trait levels where the individual items overlap - Useful for illustrating the degree to which a test provides a different quality at different trait levels

Answer 7

The **peak** - where the items provide the most combined information (likely near the middle of the trait range) ↪ indicates that the test is most reliable at these trait levels The **width** - the range of θ values where the scale provides useful information ↪ The scale will provide less information at the extreme ends (very low or very high trait levels), where only one or two items contribute to measurement

Answer 8

From IRT perspective, a test's psychometric quality can vary across trait levels - Different from CTT, where a test has one reliability that can be estimated using an index such as a coefficient alpha

Answer 9

1. **Norm-referenced test** 2. **Criterion referenced test** Picture 8

Answer 10

Compare score to population (e.g. intelligence test) - We need broad range of information so that we have enough info for each individual in the population Picture 8 top graph

Answer 11

Determine if somone passes a certain cut-off (e.g. personal selection test) - we need the most information at the cut-off point so that we don't make mistakes at determining whether a person belongs there or not - above or below the cutoff we don't need that much info since we either take the person or not based on the selection test

Answer 12

Used to study whether an item is fair towards different groups - The groups should be equal on the latent variable so we would expect no mean difference on the item scores between the two groups

Answer 13

So that we can meaningfully compare the groups on an item - If a test includes another variable (besides the latent one) that plays a role in how well one of the groups is able to answer, it displays differentional item functioning (might be biased towards one group) ↪ the item functions differently in the different groups

Answer 14

Picture 9 After computing Item Characteristic Curve for both groups, we see that the item is more difficult (more to the right on the graph) for one group even though they have the same ability on the latent variable - E.g. the dutch student has higher probability of answering correctly even though both internationals and dutch students have the same statistical knowledge - Mean difference on the latent variable doesn't matter as we are considering each value of the variable - the students all have the same ability on the latent variable

Answer 15

- Detecting whether the item discriminates differently for one group - E.g. for international students it doesn't discriminate that well (much more false positives and false negatives) - Guessing is not commonly considered as a source of DIF Picture 10

Answer 16

An attempt to identify individuals whose response pattern doesn't seem to fit any of the expected patterns of responses to a set of items - E.g. Finding a person who responds correctly to a difficult items but doesn't respond correctly on easy items

Answer 17

- It could indicate cheating, random responding, low motivation, cultural bias of the test, intentional misrepresentation, or even scoring or administration error It might reveal that a person's personality is unique and that they don't fit the ''typically expected'' pattern of responses

Answer 18

- Provides an accurate and very efficient assessment of individuals' latent variable - It uses an item band (many items of which we know the difficulty and discrimination of - Computer selects the item which gives the most information for *you* (probability of a correct response of 50%) ↪ Could be completely different for each person - This could be different for each subject since they differ in the position on the latent variable

Answer 19

- If you answer correctly, you get a more difficult item - If you answer incorrectly, you get an easier item ↪ Through this, the computerised adaptive test algorithm is trying to find your latent value - It will start by first assuming that you are on an average value (theta = 0), then it continues giving you items based on the response you give and whether you're correct or not - It continues until the estimate for the latent trait doesn't change anymore - That is your latent variable

Answer 20

1. Population and test are independent - subject characteristics captured by the latent variable, test characteristics captured by the item discrimination and difficulty 2. Focus on items, not the test

Answer 21

1. Statistically more complex - need computer to calculate the latent variable and other characteristics 2. Needs large samples - to be able to estimate all the variables

Answer 22

**Classical test theory** * Sum score is taken as the construct score * Focus on test * Reliability of a test **Item Response Theory** * Score on latent variable is taken as the construct score * Focus on items ↪ Difficulty / discrimination * Item/Scale information - instead of reliability of a test * Computerized Adaptive Testing (CAT)

Lecture 4 Modern Test theory Flashcards

(46 cards)