Chapter 5: Measurement* Flashcards
How do you evaluate the accuracy of your measurement tool?
Reliability and construct validity
Reliability
the degree to which a measure (of behavior/personality/intelligence/psychological construct) is consistent, providing a stable form of measurement
Construct validity
the degree to which your measure actually measures what you want it to measure
True score theory
If only measuring a variable, true score is a person’s real score. If conducting an experiment, true score is a person’s score affected by the condition they are in.
Errors
sources of variability in your measure caused by things other than your IV (if there is an IV)
Types of errors
random and systematic
True score
an individual’s actual level of the variable being measured, not the score they get on the measure of that variable
Measurement error
any contributor to the measure’s score that is not based on the actual level of the variable of interest (i.e. not the true score); responsible for the degree to which a measure’s score deviates from the true score
Random error
has no pattern, unavoidable, unpredictable, and can’t be replicated by repeating the experiment e.g. misreading or misunderstanding questions, time of day
Systematic error
has a pattern, produces consistent errors, and affects a participant’s scores in all conditions e.g. response biases, individual differences, incorrectly calibrated measuring instruments
Why is low reliability a problem?
Difference between conditions can be misleadingly inflated or deflated due to unreliable measurements
Types of reliability for a measure using the correlation coefficient
Test-retest reliability, internal consistency reliability, inter-rater reliability
Test-retest reliability
degree of reliability assessed by administering the same measure on two different occasions, then calculating the correlation between the two different scores obtained
Alternate forms reliability (solution for practice effects in test-retest)
two different forms of the same test are administered on two separate occasions
Challenges of test-retest reliability
practice effects and demand characteristics