Lecture 10: More on Validity Flashcards
what is the most important evidence of validity?
associations with other variables
validity
the degree to which evidence and theory support the interpretations of test scores for proposed uses of a test; , all the content and all of the theory we think we’re measuring
What is associations with other variables? what is evidence of this?
does the test’s actual associations with other measures match the associations that it should have with those measures? We do this by assessing: convergence evidence (concurrent/predictive) & discriminant evidence
What are key issues in examining and interpreting convergent and discriminant evidence?
- Methods for evaluating “associations with other variables” 2. Factors affecting observed associations 3. Interpreting associations)
Methods for evaluating “associations with other variables”
- Focused examinations (few criterion variables) 2. Unsystematic examinations of sets of correlations 3. Multi-trait multi-method Matrix (MTMM) 4. Systematic examination of sets of correlations—“Quantifying Construct Validity”
Focused Examination
one (or very few) criterion has strong relevance for the implications/meaning of test scores (one or view criterion variables that have an association with the construct; SAT scores predict GPA); validity generalization: process of evaluating a test’s validity over a large set of studies (meta analysis idea)
validity generalization
process of evaluating a test’s validity over a large set of studies (meta analysis idea)
Unsystematic Examination of Sets of Correlations
several criterion variables (other measures) are examined; probably the most common; “eyeball” the pattern of correlations and draw conclusions regarding convergent and discriminant validity
Multitrait-Multimethod Matrix
several criterion variables (other measures) are examined; more systematically evaluated the pattern of correlations and draw conclusions regarding convergent and discriminant validity; not as common, but golden child of evidence
Multitrait-Multimethod Matrix steps
- Measure multiple traits/constructs 2. Use multiple methods of measurement 3. Use each method to measure each construct 4. Compute all correlations and evaluate their pattern); example: validity of measure adult playfulness: constructs 1. Adult playfulness 2. Spontaneity 3. Boredom multiple methods for assessing each construct 1. Self-report 2. Spouse report 3. Observation
What are the strongest MTMMM correlation? (ideally)
ideally mono-trait hetero-method correlations are stronger than hetero-trait correlations
Quantifying Construct Validity
: several criterion variables are examined; predict correlations to other construct ahead of time and compare to predictions (pretty new, Dr. Davis doesn’t even mention it); systematically evaluate the pattern of correlations and draw conclusions regarding convergent and discriminant validity
How to do Quantifying Construct Validity
- Make concrete expectations for pattern of correlations 2. Quantify match between expected pattern and actual pattern of correlations (How close is the match? Is there a significant degree of match?) **don’t need to know how to do it
key issues in examining and interpreting convergent and discriminant evidence
methods for evaluating “associations with other variables”; factors affecting observed associations (why would a validity correlation be high or low); interpreting associations: gauging the size/ meaningfulness of a validity coefficient
Factors Affecting Observed Associations
- Associations between constructs: for example, true score correlation from CTT 2. (Im)precision of measurement (essentially the reliably could impact validity measurement): for example, measurement error / reliability from CTT 3. Restricted Range: significant impacts the correlation of the validity coefficient 4. Relative proportions of differential skew (skewed distributions impact correlations, especially if the skews of two variables are different it results in a lower correlation); if your samples of your measured construct are not normally distributed, it will impact the correlation 5. Method variance: generally, variables measured by different methods are less strongly correlated than other variables measured by the same method 6. Time: generally, variables measured at different times are less strongly correlated than variables measured at the same time 7. Predictions of single event: generally, behavior/outcomes assessed at a single occasion are less predictable than behaviors/outcomes that are aggregated across occasions