Study Guide 12: Criterion Related Validity Flashcards

1
Q

Area under the Curve (AUC)

A

Graph of true positive rate (sensitivity) vs. False positive rate (1-specificity). Interpreted as an estimate of the probability that a randomly chosen depressed person will have a higher depression score than a randomly chosen non-depressed person. AUC of 0.5 = line of no information, recommend: AUC ≥ 0.8.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Base rate

A

Proportion of current employees who are successful, but have not been selected using any test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Binomial effect size display (BESD)

A

a tool for reporting the magnitude of effect size
Concurrent evidence: validity evidence demonstrating the degree to which criterion and test that are measured at the same time correlate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Criterion (or criterion variable):

A

a measure of some attribute or outcome that is of primary interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Criterion-related evidence

A

validity evidence demonstrating the correlation between performance on the test with performance on relevant criterion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Negative predictive value (NPV):

A

TN (TN + TP) - % of people who are truly not depressed (according to the criterion) out of those that the scale identified as non-depressed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Positive predictive value (PPV)

A

TP/(FP + TP) - % of individuals who are truly depressed (according to the criterion) of those that the scale identified as depressed

  • influenced by prevalence of X in population
  • As prevalence , PPV
  • small differences in  specificity levels can strongly influence  PPVs
  • changes in sensitivity have little impact on PPV
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Predictive evidence:

A

validity evidence demonstrating the degree to which criterion and test that are measured at different times correlate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Prevalence:

A

the proportion of the population or sample found to have the condition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Receiver operating characteristic (ROC) curve

A

graph sensitivity and specificity fro all of the possible scores of a scale by plotting true positives (sensitivity) versus false positives (1 – specificity).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Selection ratio:

A

of applicants hired/# of applicants selected. All else being equal, the smaller the selection ratio, the more useful the test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Sensitivity:

A

TP/(FN + TP). The proportion of actual positives which are correctly identified as such. How sensitive a measure is at detecting the condition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Specificity:

A

TN/(TN + FP). The proportion of actual negatives which are correctly identified. How specific a measure is at ruling out those without the condition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Standard error of estima

A

an estimate of the amount of error to be expected in the predicted criterion score, speaks to the validity of the measure. Whereas, SEM speaks to the reliability of the measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Taylor-Russell tables

A

table demonstrating the connection between the validity of a test and the likelihood of the test resulting in a successful selection of a candidate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Utility analysis

A

frames validity in terms of a cost versus benefit analysis of test use. Is a test worth using? Does it outweigh the costs?

17
Q

Validity generalization

A

Type of meta-analysis that seeks to evaluate validity coefficients across a group of studies. Can reveal a) average level of validity coefficient found across a number of studies, b) degree of variability among smaller studies, and c) source of variability across smaller studies.