Chapter 5 Flashcards

Question 1

Q

a proportion that indicates the ratio between the true score variance on a test and the total
variance

Answer

A

reliability coefficient

Question 2

Q

A statistic useful in describing sources of test score variability is the _______

Question 3

Q

Variance from true differences is _______

Answer

A

true variance

Question 4

Q

variance from irrelevant, random sources
is ________________

Answer

A

error variance

Question 5

Q

refers to the
proportion of the total variance attributed to true variance.

Answer

A

reliability

Question 6

Q

refers to collectively all of the factors associated
with the process of measuring some variable, other than the variable being measured.

Answer

A

measurement error

Question 7

Q

is a source of error in measuring a targeted variable caused by
unpredictable fluctuations and inconsistencies of other variables in the measurement process.

Answer

A

Random error

Question 8

Q

refers to a source of error in measuring a
variable that is typically constant or proportionate to what is presumed to be the true value of
the variable being measured.

Answer

A

systematic error

Question 9

Q

terms that refer to variation among items within a test as well as to
variation among items between tests

Answer

A

item sampling or
content sampling

Question 10

Q

is an estimate of reliability obtained by correlating pairs of scores
from the same people on two different administrations of the same test

Answer

A

Test-retest reliability

Question 11

Q

the estimate of test-retest reliability is often referred to as the ______

Answer

A

coefficient of
stability.

Question 12

Q

The degree of the relationship between various forms of a test can be evaluated by means of an alternate-forms or parallel-forms
coefficient of reliability, which is often termed the ________

Answer

A

coefficient of equivalence.

Question 13

Q

refers to an estimate of the extent to which item sampling and other errors have affected test scores on versions of the same test when, for each form of the test, the means and variances of observed test scores are equal.

Answer

A

parallel forms
reliability

Question 14

Q

are simply different versions of a test that
have been constructed so as to be parallel.

Answer

A

Alternate forms

Question 15

Q

refers to an estimate of the extent to which these different forms of the same test have been affected by item sampling error, or other error.

Answer

A

alternate forms reliability

Question 16

Q

is obtained by correlating two pairs of scores obtained
from equivalent halves of a single test administered once.

Answer

A

split-half reliability

Question 17

Q

This method yields an estimate of split-half
reliability that is also referred to as _________

Answer

A

odd-even reliability.

Question 18

Q

refers to the degree of correlation among all the
items on a scale.

Answer

A

Inter-item consistency

Question 19

Q

allows a test developer or user to estimate internal consistency reliability from a correlation of two halves of a test.

Answer

A

Spearman–Brown formula

Question 20

Q

is the degree to which
a test measures a single factor.

Answer

A

homogeneity

Question 21

Q

may be thought of as the mean of all possible split-half correlations, corrected by the Spearman–Brown formula.

Answer

A

coefficient alpha

Question 22

Q

describes the degree to which a test
measures different factors.

Answer

A

heterogeneity

Question 23

Q

as a measure used to evaluate the internal consistency of a test that focuses on the degree of difference that exists between item scores.

Answer

A

average proportional distance method
(APD)

Question 24

Q

Homogeneity VS heterogeneity of test items (essay)

Answer

A

Recall that a test is said to be homogeneous
in items if it is functionally uniform throughout. Tests designed to measure one factor, such as one ability or one trait, are expected to be homogeneous in items. For such tests, it is reasonable to expect a high degree of internal consistency. By contrast, if the test is heterogeneous in items, an estimate of internal consistency might be low relative to a more appropriate estimate of test-retest reliability.

Question 25

Q

is the degree of agreement or consistency between two or
more scorers (or judges or raters) with regard to a particular measure.

Answer

A

inter-scorer reliability

Question 26

Q

is a trait, state, or ability presumed to be ever-changing as a function of situational and cognitive experiences.

Answer

A

dynamic characteristic

Question 27

Q

ability presumed to be relatively unchanging is ______ such as
intelligence.

Answer

A

static characteristic

Question 28

Q

if some items are so difficult that no test-taker is able to obtain a perfect score, then the test is a _____

Answer

A

power test

Question 29

Q

generally contains items of uniform level of difficulty (typically uniformly low) so that, when given generous time limits, all test-takers should be able to complete all the test items correctly

Answer

A

speed test

Question 30

Q

is designed to provide an indication of where a test-taker stands with respect to some variable or criterion, such as an educational
or a vocational objective

Answer

A

criterion-referenced test

Question 31

Q

a value that according to classical test theory genuinely reflects an individual’s ability (or trait) level as measured by a particular test.

Answer

A

true score

Question 32

Q

also referred to as the true score (or classical) model of measurement. _________ is the most widely used and accepted model in the psychometric literature today

Answer

A

classical test theory (CTT)

Question 33

Q

seek to estimate the extent to which specific sources of variation under defined conditions are contributing to the test score.

Answer

A

domain sampling theory

Question 34

Q

is based on the idea that a person’s test scores vary from testing to testing because of variables in the testing situation.

Answer

A

generalizability theory

Question 35

Q

Cronbach encouraged test developers and researchers to describe the details of the particular test situation or ______ leading to a specific test score.

Question 36

Q

examines how generalizable scores from a particular test are if
the test is administered in different situations.

Answer

A

generalizability study

Question 37

Q

include things like the number of items in the test, the amount of training the test scorers have had, and the purpose of the test administration.

Question 38

Q

These coefficients are similar to reliability coefficients
in the true score model.

Answer

A

coefficients of generalizability.

Question 39

Q

developers examine the usefulness of test
scores in helping the test user make decisions.

Answer

A

decision study

Question 40

Q

Another alternative to the true score model is ________

Answer

A

Item response theory (IRT)

Question 41

Q

a synonym for Item response theory (IRT) in the academic literature is _____

Answer

A

latent-trait theory.

Question 42

Q

is a categorical variable with two possible response values (Yes/No, Agree/Disagree, Success/Fail).

Answer

A

dichotomous item

Question 43

Q

is a categorical variable ordinal or nominal with more than two possible values (e.g. strongly disagree, disagree, agree, strongly agree).

Answer

A

polytomous item

Question 44

Q

is a reference to an IRT model with very specific assumptions about the underlying distribution

Answer

A

Rasch model

Question 45

Q

is the tool used to estimate or infer the extent to
which an observed score deviates from a true score.

Answer

A

standard error of measurement

Question 46

Q

a range or band of test scores that is likely to contain the true score.

Answer

A

confidence interval

Question 47

Q

Comparisons between scores are made using the _________

Answer

A

standard error of the difference

Question 48

Q

refers to a group of personality tests.

Answer

A

Personality test battery