Ch 5.2 Reliability Flashcards
Consistency Across Time: Test-Retest Reliability
- Correlating paires of scores from the same people on two different administrators of the same test
- Appropriate for relatively stable constructs (personality)
Consider
- Time between administrations
– Generally, as time increases reliability decreases
– Coefficient of stability (interval between testing > 6 mos.)
- Intervening factors
– Going through treatment?
– Learning occurring?
When a change is assumed we do not use test-retest reliability
Consistency Across Raters: Inter - Scorer (Rater) Reliability
- Degree of agreement of consistency between two or more scorers
- Commonly included in test manuals
- High consistency indicates that with sufficient training, scores may be derived in a systematic, consistent way by various scorers
Consider
- Clarity of scoring criteria
Summary: Different Approaches to Reliability [placeholder]
.
Self-Reports
- Test-Retest
- Alternate Form/Parallel Form
- Internal Consistency
Interviews/Behavioral Observations
- Inter-Rater Reliability
How reliable is reliable?
Depends on the purpose of the test & importance of results
- .65 - .70 (minimal_
- .80+ (good)
- .90+ (really good, sometimes too good)
- Possible item redundancy
Interpretation Considerations
The purpose of the reliability coefficient
- One time?
- Use repeatedly?
The nature of the test
- Is the construct homogeneous in nature? (life satisfaction)
- Or, heterogeneous in nature? (daily stress measure)
Standard Error of Measurement
Used to estimate the extent to which an observed score deviates from the true score
SEM
- SD of a theoretically normal distribution of test scores obtained by one person on equivalent tests
- The higher the standard error of measurement, the lower the reliability (inverse relationship w/ reliability)
- Provides a measure of precision of an observed score (allows us to estimate the range in which the true score is likely to exist)
- Related to confidence intervals
Confidence Interval
- Range of test scores likely to contain the true score
- Observed score + or - (z critical value) * SEM
Ex: WAIS: You score was = 100; SEM = 2.5
SEM & Confidence Intervals
- 90% confidence your true score is between 96.8 & 103.2
- 95% confidence your true score is between 95.1 & 104.9
- 99% confidence your true score is between 93.6 & 106.5
This is the case because 100 + or - the z score (1.28) x the SEM (2.5)
This equals 100 + or - 3.2z. You score is anywhere in the range of 100 + or - 3.2