Q2 reliability & validity Flashcards by elliana nath

what is true of a good measure?

it assesses behavioral variability accurately

How well did you know this?

Not at all

Perfectly

observed score =

true score + measurement error

How well did you know this?

Not at all

Perfectly

true score

systematic variance

How well did you know this?

Not at all

Perfectly

measurement error

error variance

How well did you know this?

Not at all

Perfectly

where does measurement error come from? (2)

participant
not the participant

How well did you know this?

Not at all

Perfectly

2 ways participants may introduce measurement errror

transient factors
stable attributes

How well did you know this?

Not at all

Perfectly

transient factors

anxiety, tiredness (not stable)

How well did you know this?

Not at all

Perfectly

stable attributes

differ across participants but are stable within a given individual; IQ, personality traits, general motivation

How well did you know this?

Not at all

Perfectly

3 ways (aside from the participant) that measurement error can be introduced

situation factors
measure factors
mistakes

How well did you know this?

Not at all

Perfectly

situation factors

where are participants taking a questionnaire? lighting, temperature, etc.

How well did you know this?

Not at all

Perfectly

measure factors

questions aren’t clear or the questionnaire isn’t good

How well did you know this?

Not at all

Perfectly

what kind of mistakes can contribute to error variance?

participant is given the wrong test, experimenter says something wrong, computer error, etc.

How well did you know this?

Not at all

Perfectly

reliability

consistency and dependability in scores across time; undermined by measurement error

How well did you know this?

Not at all

Perfectly

how do we estimate reliability?

via correlations between measures of the same attribute

How well did you know this?

Not at all

Perfectly

satisfactory reliability coefficient

0.7+ or 70%

How well did you know this?

Not at all

Perfectly

3 kinds of reliability

Study These Flashcards

test-retest
inter-item
inter-rater

test-rest reliability

Study These Flashcards

will you get the same score of behavior if you measure at two different times? wouldn’t test non-stable traits

inter-item reliability

Study These Flashcards

ensures consistency among scale items aiming to measure the same construct (internal consistency)
make sure there are high enough correlations between similar questions

number used to assess inter-item reliability and satisfactory threshold

Study These Flashcards

chronbach’s alpha >0.7

what does chronbach’s alpha measure?

Study These Flashcards

inter-item reliability
systematic variance

how do we assess inter-rater reliability in nominal data?

Study These Flashcards

kappa coefficient (chi-square)
0.6+ is reliable

how do we assess inter-rater reliability in ordinal data?

Study These Flashcards

spearman’s r (0.7+)
kendall’s tau (0.45+)

how do we assess inter-rater reliability in interval/ratio data?

Study These Flashcards

pearson’s r (0.7+)

how do we assess inter-rater reliability in 3+ raters’ data?

Study These Flashcards

intraclass correlation coefficient (0.75+)

validity

are you actually measuring what you want to measure? accuracy

why is validity a concern in psychological research?

psychological constructs are not directly observable

construct validity

when multiple measures correlate with each other

convergent validity

finding measures that correlate with each other that should correlate with each other (happiness and positive affect)

discriminant validity

no correlation with unrelated measures (happiness and negative affect)

criterion-related validity

correlation between measure and relevant behavior; should have predictive power in behavior (SAT -> graduation rates, GPA)

concurrent criterion-related validity

correlation between measure and behavior at the current time

predictive criterion-related validity

correlation between measure and behavior at a future time

2 kinds of criterion-related validity

concurrent predictive

how can we maximize reliability and validity?

provide specific operational definitions (precise, appropriate) no ambiguity on how you measure your variable (backed up by the literature) standardize procedures (scripts, etc.) inclusion and assessment of subject variables (sex, age, etc.) random and/or standardized sampling to wash out potential systematic individual differences between groups

Q2 reliability & validity Flashcards

(35 cards)