Chapter 5: Reliability Flashcards
_____ is a synonym for dependability or consistency.
Reliability
A _____ is an index of reliability, a proportion that indicates the ratio between the true score variance on a test and the total variance.
reliability coefficient
Recall from our discussion of _____ that a score on an ability test is presumed to reflect not only the testtaker’s true score on the ability being measured but also error.
classical test theory
A statistic useful in describing sources of test score variability is the _____(σ2)—the standard deviation squared.
variance
Variance from true differences is true variance, and variance from irrelevant, random sources is _____.
error variance
The term _____ refers to the proportion of the total variance attributed to true variance. The greater the proportion of the total variance attributed to true variance, the more _____ the test.
reliability/reliable
1) Test construction
2) Test administration
3) Test scoring and interpretation
4) Other sources of error: Underreport and Overreport
Sources of Error Variance (4)
Sources of Error Variance:
One source of variance during test construction is item sampling or content sampling, terms that refer to variation among items within a test as well as to variation among items between tests.
Test construction
Sources of Error Variance:
test environment: the room temperature, the level of lighting, and the amount of ventilation and noise, for instance.
testtaker variables. Pressing emotional problems, physical discomfort, lack of sleep, and the effects of drugs or medication can all be sources of error variance.
Examiner-related variables: physical appearance, demeanor, presence, absence, oral exam emphasizing key words, nonverbal cues when correctness
Test administration
Sources of Error Variance:
The advent of computer scoring and a growing reliance on objective, computer-scorable items virtually have eliminated error variance caused by scorer differences in many tests. If subjectivity is involved in scoring, then the scorer (or rater) can be a source of error variance.
Test scoring and interpretation
Reliability Estimates (4)
1) Test-Retest Reliability Estimates
2) Parallel-Forms and Alternate-Forms Reliability Estimates
3) Split-Half Reliability Estimates
4) Other Methods of Estimating Internal Consistency:
a) Inter-item consistency
b) The Kuder-Richardson formulas
c) Coefficient alpha
Reliability Estimates:
using the same instrument to measure the same thing at two points in time.
is an estimate of reliability obtained by correlating pairs of scores from the same people on two different administrations of the same test.
**The passage of time can be a source of error variance. The longer the time that passes, the greater the likelihood that the reliability coefficient will be lower.
**even when the time period between the two administrations of the test is relatively small, various factors (such as experience, practice, memory, fatigue, and motivation) may intervene and confound an obtained measure of reliability
Test-Retest Reliability Estimates
Reliability Estimates:
it is referred to as an internal consistency estimate of reliability or as an estimate of inter-item consistency.
Ex. Both groups take both tests: group A takes test A first, and group B takes test B first. The results of the two tests are compared, and the results are almost identical, indicating high parallel forms reliability.
Put simply, you’re trying to find out if test A measures the same thing as test B.
source of error variance: item sampling
cons: time-consuming and expensive.
Parallel-Forms and Alternate-Forms Reliability Estimates
Reliability Estimates:
is obtained by correlating two pairs of scores obtained from
equivalent halves of a single test administered once.
One acceptable way to _____ is to randomly assign items to one or the other half of the test.
odd-even reliability
**The Spearman-Brown formula
Split-Half Reliability Estimates
Reliability Estimates: Other Methods of Estimating Internal Consistency
refers to the degree of correlation among all the items on a scale. A measure of inter-item consistency is calculated from a single administration of a single form of a test. An index of interitem consistency, in turn, is useful in assessing the homogeneity of the test.
Tests are said to be homogeneous if they contain items that measure a single trait.
The more homogeneous a test is, the more _____ it can be expected to have.
Inter-item consistency