Chapter 5: Validity Flashcards

1
Q

Validity

A

Appropriateness and accuracy of the interpretation of test scores
Can’t be measured in a single test (need a comparison basis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Construct underrepresentation

A

Test doesn’t measure important aspects of the specified construct
Similar to content sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Construct-irrelevant variance

A

Test measures features that are unrelated to the specified construct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

External threats to validity

A

Examinee characteristics (ex- anxiety, which hinders examinee)
Deviation from standard test administration and scoring
Instruction and coaching
Standardization sample isn’t representative of population taking test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

3 types of validity

A

Content validity
Criterion-related validity
Construct validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Content validity

A

Degree to which the items on the test are representative of the behavior the test was designed to sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How content validity is determined

A

Expert judges systematically review the test content

Evaluate item relevance and content coverage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Criterion-related validity

A

Degree to which the test is effective in estimating performance on an outcome measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Criterion

A

Comparison basis for a test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Predictive validity

A

Time interval between test and criterion

Example: ACT and college performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Concurrent validity

A

Test and criterion are measured at same time

Example: language test and GPA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Considerations in test-criterion studies

A
Selecting a criterion
Criterion contamination (taking test changes later performance)
Decision-theory models (circumstances surrounding test that need to be made aware of)
Validity generalization (does the test actually predict things similar to the test criteria)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sensitive test

A

Everyone of interest is identified, but lots of false positives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Specific test

A

Very accurate in identification, but lots of false negatives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Construct validity

A

Degree to which test measures what it is designed to measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Determining convergent validity

A

Correlate test scores with tests of same or similar construct

17
Q

Determining discriminant validity

A

Correlate test scores with tests of dissimilar construct

18
Q

Multitrait-multimethod approach to determining construct validity

A

Use multiple measures for same constructs to check for convergence as well as measures for other constructs to check for divergence

19
Q

Contrasted group study approach to determining construct validity

A

Create 2 separate and different groups: administer test and look for differences between them

20
Q

Factor analysis

A

Used to determine if test is measuring factors related to the given construct
Assign factor loadings (similar to correlation coefficients): variables should have high loadings on only 1 factor

21
Q

Evidence based on response processes

A

Is the manner of responses consistent with the construct being assessed?

22
Q

Evidence based on consequences of testing

A

If the test is thought to result in benefits, are those benefits being achieved?

23
Q

Incremental validity

A

Determines if the test provides a gain over another test

24
Q

Face validity

A

Determines if the test appears to measure what it is designed to measure
Not a true form of validity
Problem with tests high in these: can fake them

25
Q

Internal vs. external validity

A

Internal: Does the measure work in ideal conditions?
External: Does it work in the real world?