Test Construction Flashcards

Question 1

Q

Standard Error of Measurement/CI

Answer

A

The SEM is used to construct the CI around a specific test score. Depends on test’s SD and reliability coefficient.

Question 2

Q

Criterion Contamination

Answer

A

Bias introduced to criterion score as a result of person’s knowledge about their performance on the predictor

Question 3

Q

Item Difficulty

Answer

A

Determined by dividing # of people who got it right by total #. 0 is very difficult, 1.0 is easy. .5 difficulty is preferred.

Question 4

Q

Reliability

Answer

A

Consistency of test scores over time, across forms, or across items. Can do test-retest, coefficient alpha, interrater, split-half, alternative forms. Reliability of .80 means 80% of variability is TRUE variability.

Question 5

Q

Cross-validation and shrinkage

Answer

A

Re-assess criterion-related validity on a new sample to see how generalizable coefficient is. Coefficient shrinks as a result because the “chance factors” operating in original sample aren’t present.

Question 6

Q

Standard Error of Estimate/CI

Answer

A

Index of error when predicting criterion scores. Used to make a CI around a predicted score. Magnitude depends on criterion’s SD and validity coefficient.

Question 7

Q

Classical Test Theory

Answer

A

Observed variability in test scores reflects: 1) true differences between examinees on the attribute, and 2) effects of random error.

Question 8

Q

Factor Analysis

Answer

A

Stat technique used to determine how many factors are needed to account for intercorrelations among a set of tests, substests, or test items.

Question 9

Q

Construct Validity

Answer

A

Extent to which a test measures a hypothetical trait it is intended to measure.

Question 10

Q

Incremental Validity

Answer

A

Extent to which predictor increases decision-making accuracy. Calculate by subtracting base rate from positive hit rate. (linked to true and false negatives and positives, and criterion cut-off scores)

Question 11

Q

Kappa stat

Answer

A

Correlation coefficient used to assess interrater reliability

Question 12

Q

Split half/ Spearman-Brown

Answer

A

Split-half: split test in half and correlate two halves. Tends to underestimate reliability
Spearman-Brown: corrects split-half technique and figures out reliability if test were full length.

Question 13

Q

Item discrimination

Answer

A

the extent to which a test differentiates between examinees who obtain high versus low scores on the test or on an external criterion. Ranges from -1.0 to +1.0. If all in upper group and none from lower get it right, score is +1.0

Question 14

Q

Coefficient Alpha/KR-20

Answer

A

Both are used to assess internal consistency reliability (inter-item consistency). KR-20 used for test items that are scored dichotomously.

Question 15

Q

Sensitivity and Specificity

Answer

A

Sensitivity= % of people in sample who have disorder and were accurately identified

Specificity: % of people who do not have disorder and were accurately identified as NOT having it

Question 16

Q

Factor loadings and communality

Answer

A

Factor loading= correlation between a test or other variable, when squared this is the amount of variability in test accounted for by that factor.
Communality= total variability in scores accounted for by the factor analysis.

Question 17

Q

Oblique and Orthogonal Rotation

Answer

A

Oblique= rotation produces correlated factors

Orthogonal= rotation produces uncorrelated factors

Rotation is done to simplify interpretation of factors

Question 18

Q

Content validity

Answer

A

Extent to which test adequately samples the domain of info or skill (expert judgment)

Question 19

Q

Test length/ range of scores

Answer

A

Increasing test length with more items of similar content and quality increases reliability. Or increase heterogeneity of sample in terms of the attribute measures, which increases the range.

Question 20

Q

Relationship between reliability and validity

Answer

A

reliability is a necessary but insufficient condition for validity!

Question 21

Q

Item characteristic curve

Answer

A

Constructed in item response theory for each item. Provides info on relationship between examinee’s level on the ability or trait measures and the probability of responding correctly.