Reliability Flashcards

1
Q

What does reliability in a test mean?

A

Consistent results between administrations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Is validity essential for reliability?

A

No. Reliability is essential for validity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How are reliability and validity expressed? Also, what is the shorthand used in equations for both?

A

They are expressed as correlations.
Reliability = rxx
Validity = rxy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What two things does rxx stand for?

A

The correlation between scores (x) on two administrations of a test.
The proportion of obtained score variance explained by true score variance. (0.8=80% explained by true score variance).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does true score theory state?

A

If a test is given to an individual an infinite number of times, their results would create a normal distribution and the mean would be their true score. The standard deviation would be the standard error of the estimate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If rxx = 0.8 what amount of variance is due to true score variance and what amount is due to error score variance?

A

80% is due to true score variance and 20% is due to error score variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When will a persons raw score = their true score

A

When error = 0 or rxx = 1. There is a perfect correlation…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What two measures of variance do we need to add to get the total variance in obtained score

A

true score variance and error variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the five types of reliability ?

A
  1. Test-retest
  2. Alternate form (immediate)
  3. Alternate form (delayed)
  4. Internal consistency
    a. split-half
    b. Kuder Richardson and Cronbach’s alpha
  5. Scorer (inter-rater)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What occurs in test-retest reliability?

A

Two tests are given to the same group of people on two different occasions.
Goldy locks zone is 4 weeks apart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What type of correlation does test-retest give us and what type of error variance?

A

Coefficient of stability (how stable over two tests)

Time sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two ways Alternate forms reliability is administered?

A

Two versions of test constructed in an identical way, except they contain different content. These are then administered to the same people, in one condition one after another, in another at separate times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What type of error variance does alternate forms measure?

A
Immediate = Content sampling
Delayed = Time and content sampling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is Split-half administered and how is the correlation measured?

A

A single test given and then split into two equal halves.

The correlation is measured by comparing the scores on each half of the test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of error varaince does split half produce?

A

Content sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does Spearman-Brown formula allow us to do?

A

It lets us look at the effect of lengthening and shortening a test.

17
Q

What is the difference between Kuder-Richardson and Cronbach’s alpha? Also, what type of error variance is measured?

A

KR is used for dichotomous answers, while Cron’s is used for Likert-scales.
These both look at content sampling and content heterogeneity.

18
Q

What does scorer reliability measure?

A

Interscorer differences.

19
Q

How many test forms and on how many occasions do the 5 reliability tests need to be given?

A

test-retest = one form, on two occasions
Alternate forms immediate = two forms, one occasion
Alternate forms delayed = two forms two occasions
Split half = one form, one occasion
KR-20 & Cron’s = one form, one occasion
Scorer = one form, one occasion

20
Q

How high should reliability be for a good test?

A

Greater than 0.8

21
Q

How do you estimate a confidence interval around a true score?

A

First find the true score ti = M+rxx(xi-M)
Then calculate the standard error of measurement
SEM= SD square root of 1-rxx
Then calculate CI’s
ti+/- 1.96 X SEM

22
Q

What is the true score if the rxx equals 0 or 1

A

for 0, true score will equal the mean

for 1 true score will equal the raw score.

23
Q

Should we consider the difference significant between scores if their percentile bands overlap?

A

No

24
Q

How do we test if there is a significant difference between two scores?

A

We use the standard error of difference. This will calculate how large the difference needs to be for it to be significant. We then find the difference between the two scores and see if it is bigger than our SEdiff.