Test Construction Flashcards

Question 1

Q

Items with moderate difficulty levels are typically retained in classical test theory because

Answer

A

.5 = moderate
increases test score variability
helps ensure scores are normally distributed
provides maximum discrimination b/w examinees
maximizes the test’s reliability

Question 2

Q

Item discrimination index measures

Answer

A

the extent to which a test item discriminates between examinees who obtain high versus low scores on the entire test

ranges from -1 to 1
.35 or above is acceptable
items with moderate difficulty most likely to differentiate

Question 3

Q

Benefits of Item Response Theory

Answer

A

item characteristics (parameters) are sample invariant (same across different samples)
possible to equate scores from different sets of items/tests
easier to develop computer adaptive test

Question 4

Q

Which theory of test construction uses an item characteristic curve?

Answer

A

item response theory

Question 5

Q

item characteristic curves provide information on ___

Answer

A

difficulty, discrimination, probability of guessing correctly

Question 6

Q

According to an item characteristic curve, an items ability to discriminate between high and low achievers is represented by the ____

Answer

A

slope of the curve

- the steeper the slope, the greater the discrimination

Question 7

Q

According to an item characteristic curve, the probability of guessing correctly is indicated by _____

Answer

A

the point at which the ICC intercepts the vertical axis

Question 8

Q

According to an item characteristic curve, an item’s difficulty level is indicated by ____

Answer

A

the ability level at which 50% of examinees in the tryout sample provided a correct response

Question 9

Q

According to classical test theory, an examinee’s obtained test score (X) is composed of ____ and ____

Answer

A

their true score (T) and an error component (E)

Question 10

Q

A reliability coefficient of .84 indicates

Answer

A

that 84% of variability in scores is due to true score differences among examinees, while the remaining 16% is due to measurement error.

Question 11

Q

Kuder-richardson Formula 20

Answer

A

a variation of coefficient alpha for when test items are scored dichotomously (right/wrong)

Question 12

Q

internal consistency reliability is not appropriate for

Answer

A

speeded tests

Question 13

Q

the reliability coefficient is maximized when the range of scores is

Answer

A

unrestricted

Question 14

Q

standard error of measurement

Answer

A

used to construct a confidence interval around a measured (obtained) score.

Question 15

Q

Content Validity

Answer

A

test will be used to obtain information about an examinee’s familiarity with a particular content or behavior domain. Determined by experts

Question 16

Q

Construct validity

Answer

A

the test will be used to determine the extent to which an examinee possesses a particular hypothetical trait

Question 17

Q

Criterion-related Validity

Answer

A

the test will be used to estimate or predict an examniee’s standing or performance on an external criterion

Question 18

Q

Face validity

Answer

A

whether or not a test looks like it measures what it is intended to measure

Question 19

Q

convergent and discriminate validity are used to assess ___ validity

Answer

A

construct

Question 20

Q

a squared factor loading provides a measure of

Answer

A

shared variablity

Question 21

Q

when factors are orthogonal, a test’s communality can be calculated by ___

Answer

A

squaring and adding the test’s factor loadings

Question 22

Q

Two types of criterion-related validity

Answer

A

concurrent and predictive

Question 23

Q

standard error of the estimate

Answer

A

is used to construct a confidence interval around a predicted (estimated) criterion score.

Question 24

Q

Base rate

Answer

A

true positives + false negatives/ total number of people

Question 25

Q

Sensitivity

Answer

A

percent of people in the validation sample who have the disorder and were accurately identified by the predictor as having the disorder. (true positives/ true positive + false negatives)

Question 26

Q

specificity

Answer

A

percent of people in the validation sample who do not have the disorder and were accurately identified by the predictor as not having the disorder. (true negatives/ true negatives + false positives)