Ch. 6 - Validity Flashcards
base rate
an index, usually expressed as a proportion, of the extent to which a particular trait, behavior, characteristic, or attribute exists in a population
bias
as applied to tests, a factor inherent within a test that systematically prevents accurate, impartial measurement
central tendency error
a type of rating error wherein the rater exhibits a general reluctance to issue ratings at either the positive or negative extreme and so all or most ratings cluster in the middle of the rating continuum
concurrent validity
a form of criterion-related validity that is an index of the degree to which a test score is related to some criterion measure obtained at the same time
confirmatory factor analysis
a class of mathematical procedures employed when a factor structure that has been explicitly hypothesized is tested for its fit with the observed relationships between the variables
construct
an informed, scientific idea developed or generated to describe or explain behaviors…
i.e., intelligence, personality, anxiety, job satisfaction
construct validity
a judgement about the appropriateness of inferences drawn from test scores regarding individual standings on a variable called a construct
content validity
describes a judgement of how adequately a test samples behavior representative of the universe of behavior that the test was designed to sample
convergent evidence
with reference to construct validity…
data from other measurement instruments designed to measure the same or a similar construct as the test being construct-validated and that all point to the same judgement or conclusion with regard to a test or other tool of measurement
contrast with discriminant evidence
convergent validity
criterion
the standard against which a test or a test score is evaluated
may take many forms, including a specific behavior or set of behaviors
criterion contamination
a state in which a criterion measure is itself based, in whole or part, on a predictor measure
criterion-related validity
a judgement regarding how adequately a score or index on a test or other tool of measurement can be used to infer an individual’s most probable standing on some measure of interest
discriminant evidence
with reference to construct validity…
data from a test or other measurement instrument showing little relationship between test scores or other variables with which the scores on the test bring construct-validated should not theoretically be correlated
contrast with convergent evidence
expectancy chart
expectancy data
exploratory factor analysis
a class of mathematical procedures employed to estimate factors, extract factors, or decide how many factors to retain
face validity
a judgement regarding how well a test or other tool of measurement measures what it purports to measure that is based solely on “appearances”, such as the content of the test’s items
factor analysis
a class of mathematical procedures, frequently employed as data reduction methods, designed to identify variables on which people may differ (or factors)
factor loading
in factor analysis, a metaphor suggesting that a test (or an individual test item) carries with it or “loads” on a certain amount of one or more abilities that, in turn, have a determining influence on the test score (or on the response to the individual test item)
fairness
as applied to tests, the extent to which a test is used in an impartial, just, and equitable way
false negative
a specific type of miss characterized by a tool of assessment indicating that the test-taker does not possess or exhibit a particular trait, ability, behavior, or attribute when in fact, the test-taker DOES possess or exhibit a particular trait, ability, behavior, or attribute
false positive
an error in measurement characterized by a tool of assessment indicating that the test-taker possesses or exhibits a particular trait, ability, behavior, or attribute when in fact the test-taker does not
generosity error
aka leniency error
a less than accurate rating or evaluation by a rater due to that rater’s general tendency to be lenient or insufficiently critical
contrast with severity error
halo effect
a type of rating error wherein the rater views the object of the rating with extreme favor and tends to bestow ratings inflated in a positive direction
a set of circumstances resulting in a rater’s tendency to be positively disposed and insufficiently critical
hit rate
the proportion of people who are accurately identified as possessing or not possessing a particular trait, ability, behavior, or attribute based on test scores
homogeneity
describes the degree to which a test measures a single trait
incremental validity
used in conjunction with predictive validity, and index of the explanatory power of additional predictors over and above the predictors already in use
inference
a logical result or deduction in a reasoning process
intercept bias
occurs when the use of a predictor term results in a consistent under-prediction or over-prediction of a specific group’s performance or outcomes
leniency error
aka generosity error
local validation study
the process of gathering evidence, relevant to how well a test measures what it purports to measure, for the purpose of evaluating the validity of a test or other measure tool
typically undertaken in conjunction with a population different from the population for whom the test was originally validated
method of contrasted groups
aka known groups method
a system of collecting data on a predictor of interest from groups known to possess (and not possess) a trait, attribute, of ability of interest
miss rate
the proportion of people a test of other measurement procedure fails to identify accurately with respect to the possession or exhibition of a particular trait, ability, behavior, or attribute
a “miss” in this context is an inaccurate classification or prediction, may be subdivided into false positives and false negatives
multitrait-multimethod matrix
a method of evaluating construct validity by simultaneously examining both convergent and divergent evidence by means of a table of correlations between traits and methods
predictive validity
a form of criterion-related validity that is an index of the degree to which a test score predicts some criterion measure
ranking
the ordinal ordering of persons, scores, or variables into relative positions or degrees of value
rating
a numerical or verbal judgement that places a person or attribute along a continuum identified by a scale of numerical or word descriptors called a rating scale
rating error
a judgment that results from the intentional or unintentional misuse of a rating scale
two types of rating error are leniency (aka generosity) error and severity error
rating scale
a system of ordered numerical or verbal descriptors on which judgments about the presence/absence or magnitude of a particular trait, ability, behavior, or attribute, emotion, or other variable are indicated by raters, judges, examiners, or (when the rating scale reflects self-reports) the assessee.
severity error
less than accurate rating or error in evaluation due to the rater’s tendency to be overly critical
contrast with generosity error
slope bias
it occurs when a predictor has a weaker correlation with an outcome for specific groups
test blueprint
a detailed plan of the content, organization, and quantity of the items that a test will contain
validation
the process of gathering and evaluating validity evidence
validation study
validity
a general term referring to a judgment regarding how well a test or other measurement tool measures what it purports to measure
judgment has important implications regarding the appropriateness of inferences made and actions taken on the basis of measurements
validity coefficient
a correlation coefficient that provides a measure of the relationship between test scores and scores on a criterion measure