U8 Flashcards
reliability
refers to the extent a measure is consistent and free from error
a reliable scale isn’t always _____ but a valid scale is always _____
valid
reliable
variance
measure of variability in scores in a sample
the ____ the variance the more homogenous the scores
smaller
trimmed mean
computed by “trimming away” a certain potion of scores
- used in certain contexts to produce more stable/accurate outcomes
measurement error
random events can increase/decrease a score
bias error
systematic/constant
present every time consistently
classical test theory
X = T (+/-) ME (+/-) BE
X= observed score
T = true score
systematic error
(+problem)
- predictable errors of measurement (refection of bias error)
- one direction
- threatens validity of the measurement
(solved by re-calibrating instrument)
random error
- due to change
- not following procedures properly
(variability in participants)
sources of measurement error
- instrument
- participant
- researcher variability
- environment variability
correlation might be a measure of _____
reliability
less error in reliability score closer true score is to ______
observed score
test-retest reliability
measures consistency of measuring instrument (equipment/person) over time/testing sessions
interclass correlation coefficient
- similar to PPMC
- bad test-retest reliability
- evaluates association and difference (from variance estimate)
- 2+ measures
why is the interclass correlation coefficient limited
- only bivariate
- cannot separate how much error can be expected across tests (variance components)
- score may be poorly correlated but not diff
- scores may be highly correlated but significantly diff
standard error of measurement (SEM)
calculates range of expected score deviations from test to test
factors affecting test-retest reliability
- learning effect
- rater bias
- test retest intervals
learning effect
influences of on test on later test-practice/boredom
rater bias
(+prevention)
tester desires a certain outcome
- prevention (masking tester/strict set of method)
test-retest intervals
period between assessments
- too short (fatigue/learning effect)
- too long (changes in participant)
intra rater reliability
stability of data within 1 individual btwn trials
inter rater reliability
stability of observer or instrument measuring same group (aiming to produce generalizable results)
alternate forms reliability
- establishing reliability of standardized tests by using 2 forms of a test then observing reliability