Reliability Flashcards

Question 1

Q

an index of reliability, a proportion that indicates the ratio between the true score variance on a test and the total variance

Answer

A

reliability coefficient

Question 2

Q

the prerequisite of validity

Answer

A

high reliability

Question 3

Q

reliability increases with [ ]

Answer

A

test length

Question 4

Q

standard deviation squared. is useful because it can be broken into components

Question 5

Q

variance from true differences

Answer

A

true variance

Question 6

Q

variance from irrelevant, random sources

Answer

A

error variance

Question 7

Q

refers to the proportion of the total variance attributed to true variance

Answer

A

reliability

Question 8

Q

sources of variance

Answer

A

test construction
administration
scoring
interpretation

Question 9

Q

variance in test construction

Answer

A

item sampling or content sampling

Question 10

Q

variance in test administration

Answer

A

test environment
testtaker variables
examiner-related variables

Question 11

Q

test scoring and interpretation

Answer

A

scorers and scoring system

Question 12

Q

an estimate of reliability obtained by correlating pairs of scores from the same people on two different administrations of the same test

Answer

A

test-retest reliability

Question 13

Q

how stable is the construct or measure

Answer

A

coefficient of stability

Question 14

Q

If the duration of test-retest is too short, there is a tendency for [ ]

Answer

A

carryover effect/practice effect

Question 15

Q

test-retest is not applicable for [ ]

Question 16

Q

how to measure test-retest reliability

Answer

A

pearson r or spearman rho

Question 17

Q

The consistency of test results between two different – but equivalent – forms of a test.

Answer

A

parallel forms and alternate-forms reliability

Question 18

Q

for each form of the test, the means and the variances of observed test scores are equal.

Answer

A

parallel forms

Question 19

Q

are simply different versions of a test
that have been constructed so as to be parallel.

Answer

A

alternate forms

Question 20

Q

coefficient for parallel and alternate forms

Answer

A

coefficient of equivalence

Question 21

Q

the advantage of having another form

Answer

A

eliminates carryover/practice effects

Question 22

Q

how to measure parallel and alternate forms reliability

Answer

A

pearson r or spearman rho

Question 23

Q

Defines measurement error strictly in terms of consistency or inconsistency in the content of the test.

Answer

A

internal consistency reliability

Question 24

Q

obtained by correlating two pairs of scores obtained from equivalent halves of a single test administered once.

Answer

A

split-half reliability estimate

Question 25

Q

three steps in split-half reliability estimate

Answer

A

Step 1. Divide the test into equivalent halves.
Step 2. Calculate a Pearson r between scores on the two halves of the test.
Step 3. Adjust the half-test reliability using the Spearman-Brown formula.

Question 26

Q

allows a test developer or user to estimate internal consistency reliability from a correlation of two halves of a test.

Answer

A

spearman-brown formula

Question 27

Q

-Used with ratio or interval data.
-Mean of all possible split-half correlations
-Preferred statistic for obtaining an estimate
of internal consistency reliability.
-Typically ranges in value from 0 to 1

Answer

A

cronbach’s coefficient alpha

Question 28

Q

used for test with dichotomous items, primarily those
items that can be scored right or wrong (such as multiple-
choice items). useful in terms of evaluating the internal
consistency of highly homogenous items

Answer

A

kuder-richardson formula

Question 29

Q

is used for items that have varying difficulty. For example, some items might be very easy, others more challenging. it should only be used if there is a correct answer for each
question

Question 30

Q

it’s used for a test where the items are all about the same difficulty.

Question 31

Q

refers to the degree of correlation among all the
items on a scale.

Answer

A

inter-item consistency

Question 32

Q

Ideally, the average inter-item correlation for a set of items should be between [ ] and [ ], suggesting that while the items are reasonably homogenous, they do contain sufficiently unique variance so as to not be isomorphic with each other.

Answer

A

.20 and .40

Question 33

Q

The degree of agreement or consistency between
two or more scorers (or judges or raters) with regard
to a particular measure.

Answer

A

inter-scorer reliability

Question 34

Q

how to measure inter-scorer reliability

Answer

A

pearson r or spearman rho

Question 35

Q

A reliability coefficient of .80 indicates that 20% of the variability in test scores are due to [ ].

Answer

A

measurement error

Question 36

Q

Coefficient of inter-rater reliability provides information about error as a result of [ ]

Answer

A

test-scoring

Question 37

Q

Coefficient of stability provides information on error as a result of

Answer

A

length in the time of administration

Question 38

Q

Coefficient of equivalence provides information on error as a result of [ ]

Answer

A

instrument (items) itself

Question 39

Q

high degree of internal consistency

Answer

A

test homogeneity

Question 40

Q

low degree of internal consistency

Answer

A

tests heterogeneity

Question 41

Q

a characteristic where the best estimate of reliability would be obtained from a measure of internal consistency.

Answer

A

dynamic characteristic

Question 42

Q

a characteristic where the test-retest or the alternate-forms method would be appropriate.

Answer

A

static characteristic

Question 43

Q

If the variance of either variable in a correlational analysis is
[ ] by the sampling procedure used, then the resulting correlation coefficient tends to be lower.

Answer

A

restricted

Question 44

Q

If the variance of either variable in a correlational analysis is [ ] by the sampling procedure, then the resulting
correlation coefficient tends to be higher.

Answer

A

inflated (increased)

Question 45

Q

all the items are of the same degree of difficulty. There is time limit within which the test taker is required to answer all the items.

Answer

A

speed test

Question 46

Q

assesses the underlying ability of the individuals by allowing
them sufficient time; no time limit.

Answer

A

power test

Question 47

Q

designed to provide an indication of where a test-taker stands with respect to some variable or criterion, such as an educational or a vocational objective.

Answer

A

criterion-referenced test

Question 48

Q