Lecture 4: Essential of Reliability Flashcards
Reliability
- suggest trustworthiness
- quality of test that suggest they are sufficiently consistent and free from measurement error
- consistency and precision of the results of the measurement process
Measurement error
any fluctuation in scores that results from factors related to the measurement process that are irrelevant to what is being measured
- reliable scores should be free of measurement error
treu score
- hypothetical entities that would result from error-free measurement
- goal of reliability analysis: to estimate true scores
Individual’s true score
the average score in a hypothetical distribution of scores that would be obtained if the individual took the same test an infinite number of times
observed score
derived from tests (= scores that the individuals actually obtain)
any observed score (X0) is made up of two components
- the true score component
- the error score component
True score component (Xtrue)
is construed to be that portion of observed score that reflects whatever ability, trait, or characteristic the test assesses
error score component (Xerror)
difference between the observed score and the true score
- any other factor that may enter into the observed score as a conseqeunce of the measurement process
sample variance (true scores in group data)
the average amount of variability in a group of scores
sample variance consist of (two components)
- a portion that is true variance
- a portion that us error variance
True variance
differences among the scores of individuals within a group that reflect their standing or position in whatever characteristic the test assesses
error variance
differences among test scores that reflect factors irrelevant to what the test assesses
- reliability scores increases as the error component decreases
Reliability coefficient (reliability)
defined as the ration of true score variance to total test score variance
- if test score variance = true variance (reliability = 1)
Two-step process (Evaluation of reliability)
- What are possible sources of error?
2. What is the magnitude of those errors?
The relativity of reliability
- tests cannot be reliable, test scores are reliable!!
- score might be unreliable (due to test taker, testing situation)
3 sources of error which can enter the test score
- Context in which tésting takes place
- test taker
- specific characteristics of the test itself
Random measurement error vs. systematic measurement error
- some of the errors can be minimized (due to proper testing practice etc.)
- other cannot be eliminated but may be detected by variozs types of checks built into the test
Sources of error
- Interscorer difference
- Time sampling error
- Content sampling error
- Interim inconsistency
- Interim inconsistency and content homogeneity
- Time and content sampling error
Interscorer difference
- errors entering into scores whenever the element of subjectivity influences scoring
- refers to the variations in scores that stem from differences in the subjective judgements of the scorers
Scorer Reliability
- method for estimating error due to interscorer differences
- 2 independent scorers (two independent scores are generated)
- correlation between the set of scores
(for metric variables)
Time sampling error
variability in test scores as a function of the fact that they are obtained at one point in time rather than at another
Concept of time sampling error
- hinges on two related notion
- Construct/behavior is liable to fluctuate in time
2. Construct/behaviors chnage at different paces in time
Test-retest reliability
- test is administered twice on two different occasions to one or more groups of individuals
- correlation between the scores obtained from the two administrations
= test-retest reliability coefficient - crucial: length of time interval!
Content sampling error
term used to label the trait-irrelevant variability that can enter into test scores as a result for fortuitous factors related to the content of the specific items included in a test