Lecture 6: Essential Test Item Consideration Flashcards

Question

Ipsative scores

Answer 1

ordinal numbers that simply reflect test taker's raking of constructs assesses by the scaled within a forced-choice format test

Answer 2

- easy to administer, easy to score - high degree of objectivity - enhance test score reliability - group administration

Answer 3

- restrict the potential responses to a selection chosen by test developers - susceptible to guessing (pass/fail) - response styles (tendiency towards middle/extremes) - responses can be disorted by social desirability, self-monitoring, etc.

Answer 4

- "Free" response format - open-ended (variety is limitless) - but usually some constarints on response behavior in instructions (time limit, length of response, usage of materials, etc.)

Answer 5

the use of constructed responses is limited mainly to projective techniques also know as performance-based measures of personality - respond as freely as possible and reveal aspect of personality

Answer 6

- do not restrict response behavior to pre-selected options - may elicit greater acceptance of a test - provide richer samples of the behavior of examinees (unique charcateristic can emerge)

Answer 7

- less objective scoring (might be evaluated different by different scorers) - highly diverse responses, questioning their comparability - test lenght: responses require more time for administration and scoring

Answer 8

- item validity: does a specific item carry its own weight within a test by eliciting information that advances the purpose of the test? (also item discrimination)

Answer 9

1. Item difficulty | 2. Item Discrimination (item validity)

Answer 10

- approprateness - difficulty level - possible bias or offensiveness toward any group

Answer 11

of item diffulty is carried out through statistcis that assess wtather items perform the way they were intended to perform

Answer 12

- diffculty level of a test as a whole is a function of the difficulty levels of the indivudal items that make up a test (easy items = easy test) - item diffculty is sample dependent (depends on the ability of test takers)

Answer 13

the higher the prcentage passing, the easier the item is | - when normally distributed, p values can be trasnformed into z-values

Answer 14

relative difficulty levels of items can be compared across various groups by administering a commong set of item (anchor items)

Answer 15

- difficulty in words in the frequency with which they are used in the language - quantitaive indexes: percentage of test takers who aswer an item correctly (p-value)

Answer 16

allows for the difficulty items to be placed on a uniform numerical scale for samples of test takers at different ability levels

Answer 17

- the average score on a test is the same as the average diffuclty on its items - average percenatge passing (p) for the items in a test id 80%, the average score on the test will be 80% as well

Answer 18

when test items are too easy for a certain group | - the distribution will be negatively skewed

Answer 19

when the test items are too difficult for a certain group | - the score distribution is positively skewed

Answer 20

- have a great deal of influence on item difficulty - the number of sitractors directly affects indexes of item difficulty because the probability of guessing correctly is higher when the number pf choice is smaller

Answer 21

the extent to which items elicit responses that accurately differentiate test takers in terms of behavior, knowledge, or other charcateristics that a test is designed to evaluate

Answer 22

- Internal criteria - total test score is used to validate item (homogeneity of the test increases) - > the reliability indexes based on interitem consistency is enhanced - external criteria- test are used in valdating items - > the validity of scores on the test as a whole is enhanced

Answer 23

total score may be used to validta items | - all test items should correlate highly with the total score and each other

Answer 24

- items are validated against external criteria that are also mor global - not necessarily have to correlate highly with one another (not homogenous?)

Answer 25

all statistical procedures used to gauge the degree to which items discriminate in term of criterion require info about... 1. item performance 2. criterion stading for individuals in the samples from which the item discrimination statistics are extracted

Answer 26

the differences in the percenatge or proportion of test takers in the upper and lower criterion groups who pass a given item or answer in the keyed direction - positive discrimination indexes: more individuals in the upper criterion group

Answer 27

test takers must be classified into distinct criterion groups based either on their total scores on the test or on some external indicator of their standing on the constructs assessed - once the groups are created, the percentage (p) of individuals within each group who üasses the item is calculated

Answer 28

- test theory method used for expressing the relationship between performance on an item and criterion

Answer 29

when item scores are dichotomous (pass/fail) and the criterion measure is continuous

Answer 30

- when item scores and the critetion measure are both dichotomous - both can range from -1 to +1

Answer 31

1. Pure speed tests 2. Pure power tests 3. Blend of speed and power

Answer 32

difficulty is manipulated mainly through timing, limits are so short that most test takers connot complete all the items - when test takers fisnih all items - actual capacity has not been determined

Answer 33

- have no time limits | - difficulty is mainly manipulated by increasing or decreasing the complexity of items

Answer 34

most ability tests - fall between the extremes of pure-speed and pure-power continuum (time limits allow test takers to attempt all items)

Answer 35

necessary to calculate the proportion of individuals at each total score who passed a given item (combine info about item difficulty and item discrimination)