5010: Reliability, Responsiveness Flashcards
Types of Measurement Error
Systematic
(consistent, unidirectional, biased, “constant”)
Random
(inconsistent, either direction equally likely, try to minimize, on average- will cancel out)
Sources of Measurement Error
Rater (stabilization, recording)
Meas. Instrument or Method (goni-faulty, consistency- interrater)
Subject (clothing, m. mass, gender, time of day, meds)
Types of Reliability
Intrarater (usu. MOST reliable)
Interrater
Test-retest (suggests no rater involvement, self-reported data)
Intraclass Correlation Coefficient (ICC)
True score variance b/w subjects
= ————————————————
Total variance
Reliability Coefficient
True score variance b/w subjects
= ————————————————
Total variance + error variance
Variance
Measure of avg. variability of sample data. Ideally for a clinical measure.
Interpretation of Reliability Stats
Range: 0-1
Statistical Measures of Reliability
ICC- for Continuous (sometimes Ordinal)
SEM for Continuous
Kappa for categorical
Cronbach’s alpha for Multiple items, one meas.
Bland-Altman Plots
Plot difference between test-retest
Shows repeatability and any bias over time
Validity
Are we really measuring what we think we are measuring?
4 Categories of Validity testing
Face Validity
Content Validity
Criterion-related validity
Construct Validity
Face Validity & How to Test
Does it appear to be valid for this measurement (subjective)
- Have clinicians look it over & give opinion
- Perform the test on patients & ask their opinion
- Clinician or Pt may reject it
Content Validity & How to Test
Does instrument address all aspects and only aspects of the attribute being measured?
- 1st author must give definition of what they intend to meas.
- Use a thorough, organized, comprehensive development process.
- May sample expert opinion
- Test for ceiling & floor effects
- Analyze data using factor analysis
What is Factor Analysis
Testing tool used to test CONTENT VALIDITY
- used for multi-item meas. tools
- complex statistical analysis based on correlation among items
- will identify # and type of underlying dimensions being meas
- may identify items that do not correlate/fit w/ other items
i.e: balance test –> might also identify strength –> but that is not what we are trying to measure.
Criterion-Delated Validity & Test
this test requires a “gold standard” to serve as criterion.
Goniometry meas- gold standard is w/ x-ray
We don’t always use them because of cost, time, practicality, and the may be uncomfortable
2 Ways to Test:
1. Concurrent- gold standard used @ same time
2. Predictive- gold standard is some future event
(GRE to predict PT success)
To measure, calculate the correlation b/w the measurement and the gold standard.