UNIT 5 NEW Flashcards
Could also be used to determine the number of items needed to attain a desired level of reliability that has the desired reliability
Spearman–Brown formula
sources of error variance of alternate forms
test construction or administration
If interested in looking in the truth independent of measurement, psychologists look for the
construct score
It provides an estimate of the amount of error inherent in an observed score or measurement
Standard Error of Measurement (SEM)
sources of error variance of test-retest
administration
A statistical measure that can aid a test user in determining how large a difference should be before it is considered statistically significant
standard error of difference
a reference to an IRT model with specific assumptions about the underlying distribution
rasch model
The procedures of this provide a way to model the probability that a person with X ability will be able to perform at a level of Y
Item Response Theory (IRT)
obtained by correlating two pairs of scores obtained from equivalent halves of a single test administered once
split-half reliability
its use is typically to evaluate the homogeneity of a measure (or, all items are tapping in a single construct)
internal consistency
If test developers or users wish to shorten a test, the __ may be used to estimate the effect of the shortening on the test’s reliability
Spearman–Brown formula
It is a useful measure of reliability when it is impractical or undesirable to assess reliability with two tests or to administer a test twice (because of factors such as time or expense)
split-half reliability
A __ of behavior, or the universe of items that could conceivably measure that behavior, can be thought of as a hypothetical construct
domain
exist when, for each form of the test, the means and the variances of observed test scores are equal
parallel forms
consists of unpredictable fluctuations and inconsistencies of other variables in the measurement process;
sometimes referred to as “noise”;
random error
trait, state, or ability presumed to be ever-changing as a function of situational and cognitive experiences
dynamic characteristic
a statistic that quantifies reliability, ranging from 0 (not at all reliable) to 1 (perfectly reliable)
reliability coefficient
In general, a primary objective in splitting a test in half for the purpose of obtaining a split-half reliability estimate is to create what might be called __
mini- parallel-forms
tied to the measurement instrument used
true score
often used when coding nonverbal behavior
inter-scorer reliability
The influence of particular facets on the test score is represented by _
coefficients of generalizability
to evaluate the relationship between different forms of a measure
alternate forms
terms that refer to variation among items within a test as well as to variation among items between tests
item sampling or content sampling
the inherent uncertainty associated with any measurement, even after care has been taken to minimize preventable mistake
measurement of error