Test Development Flashcards

Question 1

Q

is an umbrella term for all that goes into the process of creating a test

Answer

A

Test Conceptualization

Question 2

Q

The thought that “there ought to be a test for…” is impetus to
developing a new test.

Question 3

Q

The stimulus could be knowledge of psychometric problems with other
tests, a new social phenomenon, or any number of things.

Question 4

Q

The process of setting rules for
assigning numbers in measurement.

Question 5

Q

are instruments
to measure some trait, state, or ability and may be categorized in many ways

Question 6

Q

was very influential in the
development of sound scaling methods

Answer

A

LL Thorndike

Question 7

Q

grouping of words, statements, or symbols on which
judgments of the strength of a particular trait, attitude, or emotion are
indicated by the test taker.

Answer

A

Rating Scales

Question 8

Q

Developed to be “a practical means of assessing
what people believe, the strength of their convictions, as well as individual differences in moral
tolerance” (p

Answer

A

Morally Debatable Behavior Scale Revision

Question 9

Q

Each item presents the test taker with five alternative responses (sometimes seven), usually on an agree–disagree or
approve–disapprove continuum.

Answer

A

Likert Scale

Question 10

Q

Offers a continuum of responses that allow for measurements of
attitudes on various topics

Answer

A

Likert Scale

Question 11

Q

Test takers must choose between two alternatives according to some rule.

Answer

A

Method of Pair Comparisons

Question 12

Q

For each pair of options, test takers receive a higher score for
selecting the option deemed more justifiable by the majority of a group
of judges.

Answer

A

Method of Pair Comparisons

Question 13

Q

Entails judgments of a stimulus in
comparison with every other stimulus on the scale

Answer

A

Comparative Scaling (Sorting Task)

Question 14

Q

Stimuli are placed into one of two or more
alternative categories that differ quantitatively with respect to some
continuum.

Answer

A

Categorical Scaling

Question 15

Q

Items range sequentially from weaker to stronger
expressions of the attitude, belief, or feeling being measured.

Answer

A

Guttman Scale

Question 16

Q

provide a list of terms
and the individual
selects that most
characteristic of
herself or himse

Answer

A

Adjective checklist

Question 17

Q

provide a list of adjectives that must be sorted into nine piles of increasing similarity to the target person.

Answer

A

Q - Sorts

Question 18

Q

Guide for item Writing

Answer

A

Define clearly what you wish to measure
2. Generate pool of items
3. Avoid items that are exceptionally long
4. Be aware of the reading level of those taking the scale and the
reading level of the items
5. Avoid items that convey two or more ideas at the same time
6. Consider using questions that mix positive and negative
wording

Question 19

Q

The reservoir or well from which items will or will not be
drawn for the final version of the tes

Answer

A

Test Pool

Question 20

Q

Includes variables such as the form, plan, structure, arrangement, and layout of individual test items.

Answer

A

Item Format

Question 21

Q

Items require test takers to select a
response from a set of alternative responses

Answer

A

Selected-response format

Question 22

Q

Items require test takers to supply or to create the correct answer, not merely to select it.

Answer

A

Constructed response format

Question 23

Q

Multiple-choice format has three elements:

Answer

A

1) a stem, (2) a correct
alternative or option, and (3) several incorrect alternatives or options
variously referred to as distractors or foils.

Question 24

Q

Distractions

Answer

A

b: standardized behavioral
samples; c: reliable assessment instruments; and d: theory-linked measures

Question 25

Q

A relatively large and easily accessible collection of test
questions.

Answer

A

Item Bank

Question 26

Q

An interactive, computer-
administered test-taking process wherein items presented to the test
taker are based in part on the test taker’s performance on previous
items.

Answer

A

Computerized Adaptive Testing

Question 27

Q

A discrepancy between scoring in an anchor protocol and the scoring
of another protocol is referred

Answer

A

Scoring Drift

Question 28

Q

refers to the revalidation of a test on a sample of
test takers other than those on whom test performance was originally
found to be a valid predictor of some criterion.

Answer

A

Cross Validation

Question 29

Q

test validation process conducted on two or more
tests using the same sample of test takers.

Answer

A

Co validation

Question 30

Q

Allows test developers to evaluate the validity of
items in relation to a criterion measure.

Answer

A

Item validity Index

Question 31

Q

Indicates how adequately an item separates
or discriminates between high scorers and low scorers on an entire test.

Answer

A

Item discriminatory Index

Question 32

Q

The quality of each alternative within a
multiple-choice item can be readily assessed with reference to the
comparative performance of upper and lower scorers.

Answer

A

Analysis of item alternatives:

Question 33

Q

is an item that favors one particular group of
examinees in relation to another when differences in group ability are
controlle

Answer

A

Biased Test Item