Test Development Flashcards

Question

# Definition a family of theories that specifies the functional relationship between a response to a single test item and the strength of the underlying latent trait

Answer 1

Item response theory (IRT)

Answer 2

the term for a trace line in item response theory

Answer 3

Model of measurement

Answer 4

Test tryout

Answer 5

Psychological traits exist

Answer 6

Cross validation

Answer 7

the consistent set of behaviours, thoughts or feelings that is the target of a psychological test

Answer 8

exploratory factor analysis (EFA)

Answer 9

the various forms the content of a psychological test can take

Answer 10

Dimensionality

Answer 11

* Test-taker has to choose one of two options (e.g., a statement, object, picture) on the basis of some rule * The value (e.g., 1 or 0) of each option in each paired comparison is determined by judges prior to test administration

Answer 12

the hypothesised continuously and normally distributed dimension of individual differences that is the sole source of a consistent set of observable behaviours, thoughts and feelings, which is the target of a psychological test

Answer 13

The decrease in item validities that inevitably occurs after cross-validation

Answer 14

Testing/assessment can be fair and benefit society

Answer 15

* Write items using straight forward language that is appropriate for the reading level of the population * Avoid double barrelled items * Avoid slang and colloquial expressions that may quickly become obsolete * Consider if using positively and negatively worded items is a good idea * Write items that majority of respondents can respond to appropriately * Ask about sensitive issues using straightforward and nonjudgemental language * Choose the item response carefully *

Answer 16

the process of studying behaviour of items when administered to a group of respondents, usually with a view to the selection of some of the items to form a psychological test

Answer 17

Classical test theory

Answer 18

Content validity

Answer 19

The researcher aims to generate an item pool with good content validity

Answer 20

the possibility that a psychological test item may behave differently for different groups of respondents

Answer 21

Rasch model

Answer 22

a graphical scale originally with five points used by a respondent to represent the strength of an underling attitude or emotion

Answer 23

Likert scale

Answer 24

refers to how many attributes a dataset has

Answer 25

the extent to which items on a test represent the universe of behaviour the test was designed to measure

Answer 26

Does the item separate ‘high’ and ‘low’ scorers?

Answer 27

Test conceptualisation Test construction Test tryout Item analysis Test revision

Answer 28

assumes factors are correlated

Answer 29

Test manual

Answer 30

* Allows guessing (T/F) * Only suits content where a dichotomous response can be made * Content not as rich

Answer 31

* What is the test designed to measure? * What is the objective of the test? * Is there a need for the test? * Who will use the test? * Who will take the test? * What content will the test cover? * How will the test be administered? * What is the ideal format of the test? * Should more than one form of the test be developed? * What special training will be required of test users? * What types of responses will be required of test takers? * Who benefits from this test? * Is there any potential for harm in developing this test? * How will meaning be attributed to scores on this test?

Answer 32

Test is administered to a representative sample of test-takers under conditions that stimulate the conditions that the final version of the test will be administered under

Answer 33

A stage in the process of test development that entails writing test items (or rewriting or revising existing items), as well as formatting ideas, setting scoring rules,and otherwise designing and building a test

Answer 34

* Number of response options need to be considered * Odd vs even number of responses

Answer 35

assumes factors are uncorrelated

Answer 36

Test construction

Answer 37

a way of constructing psychological tests that relies on collecting and evaluating data about how each of the items from a pool of items discriminated between groups of respondents who are thought to show or not show the attribute the test is to measure; also an approach to personality that relates the reports that people make about their characteristic behaviours to their social functioning and thereby provide tools for personality prediction

Answer 38

a way of constructing psychological tests that relies on both reasoning from what is known about the psychological construct to be measured in the test, and collecting and evaluating data about how the test and the items that comprise it actually behave when administered to a sample of respondents

Answer 39

the first stage of test development where the idea for a test begins

Answer 40

Exploratory factor analysis

Answer 41

Traits/states can be measured

Answer 42

* Item difficulty/distribution * Dimensionality (i.e. factor analysis) * Item reliability * Item validity * Item discrimination

Answer 43

Trace line

Answer 44

* Determine the number of underlying latent variables or constructs * Help condense information * Define the content or meaning of the factors * Helps identify items that are performing better or worse * Items that do not fit into any factor, or those that fit into more than one can be considered for elimination

Answer 45

* Consider removing items with a highly skewed distribution * These are items that virtually everyone answers in the same way * Item conveys little information * Limited variability so will correlate weakly with other items (impacts on FA). * Keep items with a high variance/distribution * Likely to discriminate between the different level of the construct * Keep items with a mean close to the centre of the range of possible scores

Answer 46

a written statement of the attribute or construct that the test constructor is seeking to measure and the conditions under which it will be used

Answer 47

Test conceptualisation

Answer 48

Test specification

Answer 49

* Domains change * Interpretations change * The stimuli age * Certain words change in their meaning * Test norms become outdated * Theories behind the test change

Answer 50

The probability of guessing correctly is taken into account when deciding the optimal item-difficulty index.

Answer 51

Differential item functioning

Answer 52

a family of theories that specifies the functional relationship between a response to a single test item and the strength of the underlying latent trait

Answer 53

* Easy to construct * Easy to score * Quick to administer * Large number of questions

Answer 54

Item analysis

Answer 55

Latent trait

Answer 56

Item characteristic curve

Answer 57

a graph of the probability of response to an item as a function of the strength of or position on a latent trait

Answer 58

the use of factor analysis inductively to identify the factor structure of a set of variables

Answer 59

the extent to which the score on an item correlates with an external criterion relevant to the attribute or construct that is the subject of test construction

Answer 60

Test behaviour is predictive

Answer 61

any of various similar model validation techniques for assessing how the results of a statistical analysis will generalize to an independent data set. It is mainly used in settings where the goal is prediction, and one wants to estimate how accurately a predictive model will perform in practice

Answer 62

* Complex, imaginative or original knowledge * Written communication * Information generated not recognised

Answer 63

the assignment of numbers to objects according to a set of rules for the purpose of quantifying an attribute

Answer 64

Is the test applicable to this population?

Answer 65

Tests have strengths /weaknesses /error

Test Development Flashcards

(93 cards)