Test Development Flashcards

Question 1

Q

It is the product of the thoughtful and sound application of established principles of test construction

Answer

A

Test development

Question 2

Q

1st step of Test development

Answer

A

Test conceptualization (what, how, who, when, should?)

Question 3

Q

Preliminary research surrounding the creation of a prototype of the test

Answer

A

Pilot study/research

Question 4

Q

2nd step of test development

Answer

A

Test construction

Question 5

Q

Process of setting rules for assigning number in measurement

Question 6

Q

Credited for being at the forefront of efforts to develop methodologically sound scaling methods

Answer

A

LL Thurstone

Question 7

Q

Type of scale the consists grouping of words, statement, symbols on which judgments of the strength of a particular trait, attitude, emotion are indicated by the test-taker

Answer

A

Rating scale

Question 8

Q

A scale where the final score is obtained by summing the ratings across all items (e.g. Likert Scale)

Answer

A

Summative scale

Question 9

Q

A scale where test takers are presented with pairs of stimuli which they are asked to compare

Answer

A

Method of paired comparison

Question 10

Q

Entails sorting tasks and judgments of a stimulus in comparison with every other stimulus on the scale (e.g. sort items from most justifiable to least justifiable)

Answer

A

Comparative scaling (ordinal)

Question 11

Q

Stimuli placed into one of two or more alternative categories that differ quantitatively with respect to some continuum

Answer

A

categorical scaling

Question 12

Q

Respondents who agree with stronger statements of the attitude will also agree with the milder statements

Answer

A

Guttman scale (ordinal)

Question 13

Q

Item analysis procedure and approach to test development that involves a graphic mapping of a testtaker’s responses

Answer

A

Scalogram analysis

Question 14

Q

Scaling method used to obtain data that are presumed to be in interval in nature

Answer

A

Equal-appearing intervals (thurstone)

Question 15

Q

Reservoir or from which items will or will not be bdrawn for the final version of test

Answer

A

Item pool

Question 16

Q

Parts of a multiple-choice item format question

Answer

A

stem (sentence)
correct option
distractors/foils

Question 17

Q

Also called as short-answer item

Answer

A

Completion item

Question 18

Q

Limitations of essay items

Answer

A

Focus on a liimited area; subjectivity in scoring

Question 19

Q

Relatively large and easily accessible collection of test questions

Answer

A

item bank

Question 20

Q

Interactive, computer-administered test taking process wherein items presented to the testtaker are based in part on the testtaker’s performance on previous items

Answer

A

Computerized-adaptive testing (CAT)

Question 21

Q

Ability of the computer to tailor the content and order of the presentation of test items on the basis of responses to previous items

Answer

A

Item branching

Question 22

Q

Most commonly used scoring model

Answer

A

Cumulative scoring

Question 23

Q

A type of scoring used by some diagnostic systems wherein individuals must exhibit a certain number of symptoms to qualify to a specific diagnosis

Answer

A

Class/categorical scoring

Question 24

Q

Compare testtaker’s score on one scale within a test to another scale within that same test

Answer

A

Ipsative scoring

Question 25

Q

3rd step in test development

Answer

A

test tryout

Question 26

Q

4th step in test development

Answer

A

Item analysis

Question 27

Q

Items that spur motivation and positive testtaking attitude and lessen anxiety

Answer

A

Give away items

Question 28

Q

Percent of people who said yes, agreed, endorsed the item not who pass the item

Answer

A

Item endorsement index

Question 29

Q

Range of the optimal item difficulty

Answer

A

0.3-0.8(easy)

Question 30

Q

Formula for OID

Answer

A

chance performance +1/2

Question 31

Q

OID for true-false item

Answer

A

0.75 (chance=0.5)

Question 32

Q

OID for multiple choice item 4 options

Answer

A

0.63 (chance=0.25)

Question 33

Q

OID for multiple choice item 5 options

Answer

A

0.60 (chance=0.2)

Question 34

Q

Equal to the product of the item-score standard deviation and the correlation between the item score and the total test score

Answer

A

Item reliability index

Question 35

Q

Item Analysis Technique for Questions with right/wrong answers

Answer

A

Item Difficulty

Item Discrimination

Distractor Analysis

Question 36

Q

Item Analysis Techniques for either right/wrong answers or self-report scales

Answer

A

Item reliability index

Cronbach’s alpha

Question 37

Q

Equal to the item score SD and correlation between item score and criterion score

Answer

A

Item validity index

Question 38

Q

How adequately an item separates or discriminates between high scorers and low scorers on the entire test

Answer

A

Item discrimination index

Question 39

Q

What are the key properties of the Item-discrimination index?

Answer

A

Symbolised by d

Compares performance on a particular item by the high ability group & the low ability group
(i. e. the top 27% and the bottom 27%)
Items that discriminate well will have a high positive score (to a maximum of 1)
A negative d value is a red flag as it means low test takers are doing better on that item than high test takers

Question 40

Q

The quality of each alternative within a multiple choice item can be readily assessed with reference to the comparatives performance of upper and lower scorers

Answer

A

Analysis of item alternatives (test developer can get an idea of the effectiveness of a distractor by means of a simple EYEBALL Test

Question 41

Q

Graphic representation of item difficulty and item discrimination

Answer

A

Item characteristic curve (the steeper the slope, the greater the item discrimination)

Question 42

Q

Test developer addresses the problem of guessing by including in the test manual…

Answer

A

explicit instructions regarding this point for the examiner to convey to the examinees (ex. instruct answer only if certain)
specific instructions for scoring and interpretting omitted items

Question 43

Q

Can be used to identify biased items

Answer

A

item characteristic curves

Question 44

Q

Different shapes of item-characteristic curves for different groups when 2 groups do not differ in total test score

Answer

A

Differential item functioning

Question 45

Q

Rely primarily on verbal rather than mathematical procedures to explore how individual test items work

Answer

A

Qualitative item analysis (thru group discussion, interviews)

Question 46

Q

Approach to cognitive assessment entails having respondents verbalize thoughts as they occur

Answer

A

think aloud test administration (one-on-one basis)

Question 47

Q

Conducted during the test development process in which items are examined for fairness to all prospective testtakers and for the presence of offensive language, stereotypes or situations

Answer

A

Sensitivity review

Question 48

Q

last step in test development

Answer

A

test revision

Question 49

Q

Test revision in the life cycle of an existing test

Answer

A

*APA suggests that an existing test be kept in its present form as long as it remains useful but that it should be revised when significant changes in the doman represented or new conditions of test use and interpretation make the test inappropriate for its intended use

Question 50

Q

Revalidation of a test on a sample of testtakers other than those on whom test performance was originally found to be a valid predictor of some criterion

Answer

A

cross validation (key step in test development)

Question 51

Q

Decrease in item validities that inevitable occurs after corss-validation of findings

Answer

A

Validity shrinkage (is expected and integral to test development process)

Question 52

Q

Test validation conducted on 2 or more test using the same sample of testtakers

Answer

A

co-validation (also referred as co-norming)

Question 53

Q

Examiners undergo training of test administration using test manual

Answer

A

Quality assurance

Question 54

Q

A test protocol scored by a highly authoritative scorer that is designed as a model for scoring and a mechanism for resolving scoring discrepancies; ensure consistency in scoring

Answer

A

anchor protocol

Question 55

Q

A discrepancy between scoring in an anchor protocol and the scoring of another protocol

Answer

A

scoring drift

Question 56

Q

Evaluate how well an individual item is working to measure different levels of the underlying construct

Answer

A

IRT information curves

Question 57

Q

Item functions differently in one group of testtakers as compared to another group as compared to another group of testtakers known to have the same level of difficulty of the underlying trait (by culture, gender, age)

Answer

A

Differential item functioning (DIF)

Question 58

Q

Test developers scrutinize group-by-group item response curves looking for DIF items

Answer

A

DIF analysis

Question 59

Q

Items that respondents from different groups at the same level of underlying trait have different probabilities of endorsing a function of their group membership

Answer

A

DIF items

Question 60

Q

An advantage of the response format of the test

Answer

A

Great breadth (cover many topics)