Chapter 6: Validity Flashcards
Judgment or estimate of how well a test measures what it
purports to measure in a particular context.
Judgment based on evidence about the appropriateness of inferences drawn from test scores
Validity
Logical result or deduction
Inference
May yield insights
regarding a particular population of testtakers as compared to the norming sample described in a test manual.
Absolutely necessary when the test user plans to alter in some way the format, instructions, language, or content of the test.
Ex: a local
validation study would be necessary if the test user sought
to transform a nationally standardized test into Braille for
administration to blind and visually impaired testtakers.
Local validation studies
3 categories of validity
Content validity
Criterion-related validity
Construct validity
Based on an evaluation of the subjects, topics, or content covered by the items in the test.
Ex: Driver’s license exam
A test that covers all relevant topics about traffic rules and excludes irrelevant topics
Content validity
Obtained by evaluating the
relationship of scores obtained on the test to scores on other tests or measures
Ex: Compare a student’s college entrance exam score to their first-semester GPA
Criterion-related validity
Arrived at by executing a comprehensive
analysis of:
a. how scores on the test relate to other test scores and measures, and
b. how scores on the test can be understood within some theoretical framework
Ex: An interviewer can use a structured interview to assess an applicant’s competencies and ensure they are a good fit for the role.
Construct validity
Classic conception of validity which visualizes construct validity as being “umbrella validity” because every other variety of validity falls under it.
Trinitarian view
Different ways of approaching the process of test validation
Strategies
Concept which states that all validation is singly focused on providing evidence to support the interpretation or the inference.
Unitary view of validity
Refers to the in-the-moment and in-the-place evaluation of targeted variables (such as behaviors, cognitions, and emotions) in a natural, naturalistic, or real-life context.
Ex: Assessing physical activity, mood, and stress, or assessing alcohol us
Ecological momentary assessment (EMA)
How well findings from a test or study can be generalized to real-life situations and settings outside the controlled environment
Ex: Assessing driving skills using a real driving course rather than a simulated computer game.
Ecological validity
Relates more to what a test appears to measure to the person being tested than to what the test actually measures.
Ex: A test measuring anxiety that includes questions about sweating, trembling, and feeling worried, demonstrates high face validity because these items are directly related to the construct of anxiety.
Face validity
“structure” of the evaluation
Plan regarding the types of information to be covered by the items, the number of items tapping each area of coverage, the organization of the items in the test, and so forth
Test blueprint
2 types of criterion-related validity
Concurrent validity
Predictive validity
Index of the degree to which a test score is related to some criterion measure obtained at the same time.
Ex: therapist may use two separate depression scales with a patient to confirm a diagnosis
Concurrent validity
Index of the degree to which a test score predicts some criterion measure.
Ex: College entrance tests are often used to predict college succes
Predictive validity
3 characteristics of criterion
Relevant
Valid
Uncontaminated
Occurs when the criterion measure includes aspects unrelated to the intended construct, leading to inaccurate or misleading data
Ex: A manager’s evaluation of an employee’s work performance is influenced by how much they like or dislike that employee personally, leading to unfair assessments.
Criterion contamination
Extent to which a particular trait, behavior, characteristic, or attribute exists in the population
Ex: base rate of depression in a college student population
Base rate
Proportion of people a test accurately identifies as
possessing or exhibiting a particular trait, behavior, characteristic, or attribute.
Ex: proportion of neurological patients accurately identified
as having a brain tumor
Hit rate
Proportion of people the test fails to identify as having, or not having, a particular characteristic or attribute.
Miss rate
2 subtypes of miss rate
False positive
False Negative
A miss wherein the test predicted that the testtaker did possess the particular
characteristic or attribute being measured when in fact the testtaker did not.
Ex: A psychological test designed to screen for depression incorrectly flags a person as depressed when they are not experiencing symptoms of depression.
False positive