lecture 17 Flashcards

Question

What is internal consistency?

Answer 1

1. Internal consistency: Do the items in the measure correlate with each other? Example: IQ test if you fail the easier question you should fail the harder question Question 3/ What is the square root of pi? this is probably measuring crystallized knowledge not intelligence. It probably won’t correlate with performance on pattern matching Question 4: what is your favourite colour? if this doesn’t correlate with the patter nshown from the other masures, we would say that these are measuring something different because they aren’t in the same cluster together.

Answer 2

How consistent is the measure over time? this is the raven’s matrices we expect there would be changes in the true value of this goes down

Answer 3

Do observers agree on ratings? observers are like items on a scale. We want them to all agree on what they see. the manual will bias the results. It will increase the consistency of scores so the process of choosing what the peopel are paying attention to is the validity process/ the measurement process often works backwards, we start out by assuming that caregivers are attached to ttheir caregivers in different ways

Answer 4

Reliability is often expressed in terms of correlation Pearson r correlation coefficient (or reliability coefficient)

Answer 5

Correlate estimates between items Method 1: Split-half procedure split measruement instrument into 2 equal parts. Will get the score from one half and comare it to the score on the other. Ex; could take all even items, then calculate score if you only did the odd items, the even and od scores should be similar if they are measuring the same tihings. Method 2: Cronbach’s alpha procedure 𝛂 or 𝛚 get each of the pairs (all different options) of items on the measurement instrument and correlate performance on each of these pairs. Then you take the average of all of those correlations. The nice things about this procedure means each items is important in establishing reliability of scle. Als otells you which items are most problematic because it will have low correlations with all of the other items.

Answer 6

Benchmark internal consistency: r = .80 is a good starting point when it comes to internal consistency we want to see a positive r of at least .80.

Answer 7

Test-retest reliability Correlate estimates between measurement occasions Participants complete same measure at Time 1 and Time 2 (Repeated-measures design) repeated measures design where time is the independent variable.

Answer 8

Moderators of test-retest reliability: How much time between time 1 and time 2? How stable is the construct? IQ vs. attitudes we think IQ is stable but attitudes towards things are more changeable.

Answer 9

Benchmark test-retest reliability: § r = .80 is a good starting point Assuming little time between measurements Assuming construct should be relatively stable

Answer 10

Interrater reliability Correlate estimates between observers Observers rate same behavior Kappa = 𝛋 kappa represents correlations to interrater reliability. it is almost always lower than we would like. Often lower than is desirable!

Answer 11

Increasing interrater reliability: Generate concrete, easily observable guidelines Initiate dialogue between observers Practice with feedback compare your ratings with the expected ratigns with samples.

Answer 12

Internal consistency Split half reliability – odd & even items correlated, r = .93-.96 Cronbach’s Alpha – average item correlations, r = .88-.90 Test-retest reliability 2 days, r = .97 4 weeks, r = .87 79 years, r = .54 this is not concerning because we expect the correlation to be less strong overtime Interrater reliability Not applicable

Answer 13

An evaluation of whether the measurement instrument quantifies the psychological construct

Answer 14

Face valdiity content validity predicitve validity concurrent validity convergent validity discriminant validity reactivity

Answer 15

Face validity: Does the measure appear to assess the psychological construct? Do Raven’s matrices seem like intelligence? when I look at the measurement, does it look like it is capturing the construct? everyone can have their own judgement this is a good place to start but it is possible that the person is wrong about the validity researchers may intentionally obscure the face validity to prevent people from knowing what is being measured.

Answer 16

Content validity: Does the measure appear to assess ‘the whole construct and nothing but the construct’? Are Raven’s comprehensive? is it contaminated by other variables like language background?

Answer 17

Predictive validity: Does the measure correlate with future behaviors relevant to the construct? - If ‘no,’ why measure the construct? - Raven’s correlates with job performance, r = .3-.5 if you have someone take the ravens test and ask their boss to rate their job performance, you will see that there is a correlation. The correlation gets higher for jobs that require more intellect. This shows predictive validity.

Answer 18

Concurrent validity: Does the measure correlate with current behavior? in the same test session or on the same day.

Answer 19

Convergent validity: Does the measure correlate with other defensible operationalizations of the construct? Raven’s matrices correlate with Weschler's test, r = .85 if you score highly on one you should score highly on the other if they are both measuring the same construct.

Answer 20

Discriminant validity: Does the measure not correlate with unrelated constructs? IQ scores do not correlate with favorite colors, r = .05 it shouldn’t correlate with unrelated things.

Answer 21

Reactivity: Does awareness of the construct change the psychological construct that is measured? If so, the measure cannot be valid

lecture 17 Flashcards

(46 cards)