Week 6: Reliability & Validity of Measurement (same same) Flashcards
Distinguish btn state characteristics and trait characteristics?
Some phenomena don’t change or do so only
gradually, height, gender (trait characteristics)
Some phenomena change within an individual over a relatively short time span (state characteristics)
why important? cos tools for state and trait
are different. E.g. if depression (how someone is
feeling at that moemnt youd use state, if more interested
in person over time and stable then use trait
what is the measurement problem?
The fact we cant measure some constructs that affect us directly.
• We can measure height, temperature and other physical phenomena directly.
– these are directly observable variables.
• We might like to measure happiness, sadness, depression, …
– but these are not directly observable
(LATENT variables)
What is a latent variable?
A construct we cant measure directly. e.g. happiness
What are the five steps involved in SCALE DEVELOPMENT if youre measuring a latent variable?
Technically at beginning youre stuck with an intuitive concept, then….
1) Have the construct DEFINED and agreed upon before we try to measure it!
Vague, intuitive definitions are a recipe for trouble !
2) OPERATIONAL definition
3) Measurement scales
4) Validity of the meausure
5) Reliability of measure
Does having reliability mean having validity?
NO theyre not linked.
On a target/dart throwing board:
Darts are all in the centre, all together on target.
Comment on reliability and val?
Good validity
Good reliability
On a target/dart throwing board:
Darts are all on the outer ring of board, bunched together closely.
Comment on reliability and val?
Good reliability, (consistency) poor validity (are they on target?)
On a target/dart throwing board:
Darts are all equally spread out on outer most ring
Comment on reliability and val?
Poor reliability, good validity (on average)
Tricky one-if dont know just memorise.
What is face validity?
Does the measure (instrument) appear to measure what it claims to measure ?
• What is it ?
– Does the instrument appear to measure the desired
construct
• How is it assessed ?
– Often assessed qualitatively
– In some cases an expert panel and/or patient/client
input may be sought
• Things to consider:
– Has little scientific rigor
– Might be thought of as a necessary but not sufficient
condition
e.g. if testing anxiety levels, you dont want to be asking their fave sex positions.
What do Initial development of measures involve?
Initial development of measures may involve expert opinion and/or individuals from the relevant population.
What is concurrent v?
• What is it ?
– The new measure should correlate (imperfectly) with an
established measure of the same or a related construct
– The construct underlying the established measure should be theoretically related to the construct being measured
• How is it assessed ? By correlation of the new and established measures
NOTE: if you decide to make a new scale of depression,
you would hope there is a positive correlation
between yours and established one.
So you want + correlation but you dont want it
to be .97 or something highly correlated cos then
theres no point in yours. nothing new
What is convergent v?
• What is it ?
• The new measure should correlate (imperfectly) with an another measure of the same construct
• In convergent validity we are using a 2nd ‘new’ measure as opposed to an established measure as used in concurrent validity
• How is it assessed ?
– Correlation between the measure being validated and another
measure of the same construct
Is there possibly some confusion here between convergent and concurrent validity ?
YES a lot of confusion. But what is our distinction?
WE make a distinction.
Concurrent: there is an existing scale and you correlate yours with that.
Convergent: there is no accepted scale and
you simply correlate all of the new attempts.
What is construct v?
• What is it ?
– Demonstrate that the measure being validated behaves as the construct ought to behave under varying conditions
• How is it assessed ?
– Through a triangulation of correlations such as the
excerpt from the D-HS article illustrates
– Confirmatory factor analysis (not covered here)
Methodological flaw from authors in screenshot example?
GO BACK TO THIS?
A methodological flaw here is that the authors are developing the measure and validating it in one set of data. This can lead to a self-fulfilling finding of validity.