Chapter 5 Flashcards

Question 1

Q

Reliability (def)

Answer

A

consistency in measurement

Question 2

Q

Reliability coefficient

Answer

A

0 to 1 statistic.

Question 3

Q

4 types of reliability coefficients

Answer

A

1) test-retest reliability

2) alternate-forms reliability

3) split-half reliability

4) inter-scorer reliability

Question 4

Q

Measurement error (textbook def)

Answer

A

Inherent uncertainty with any measurement, even after minimizing preventable mistakes

Question 5

Q

2 influences that interfere with repeated measurement (in psych)

Answer

A

1) changes in object (eg. a constant flux of mood, alertness, motivation)

2) the act of measurement (i.e., carryover effects like fatigue, practice)

Question 6

Q

“True Score”

Answer

A

not actually true to concept. True score is tied to the specific measurement instrument.

Question 7

Q

What ‘score’ measures the truth independent of measurement?

Answer

A

Construct score.

the underlying score of some construct (eg. depression)

Question 8

Q

variance is made of what two subtypes of variance?

Answer

A

True variance (actual differences between people?) + Error variance (random variances that are irrelevant)

Question 9

Q

Define reliability in terms of variance

Answer

A

Proportion of total variance attributed to true variance

Question 10

Q

Random vs. Systematic Error

Answer

A

Random: unpredictable, inconsistent, without pattern

Systematic: predictable, constant, can be adjusted for

Question 11

Q

Bias (in error)

Answer

A

The degree of systematic error that influences measurement

Question 12

Q

How does item/content sampling contribute to error variance?

Answer

A

The specific content in some test may affect the results (eg. i hope they ask this question and not this)

Question 13

Q

What test administration effects contribute to error variance?

Answer

A

Environment: war, heat, gum, pencil, etc.

Testtaker variables: lack of sleep, emotions, drugs, etc.

Examiner-related variables: physical appearance, presence/absence

Question 14

Q

How does test scoring and interpretation contribute to error variance?

Answer

A

Some subjectivity in certain tests (eg. essays, creativity, etc.) can influence measurement.

Question 15

Q

test-retest reliability coefficient is also called what?

Answer

A

Coefficient of stability

Question 16

Q

What might affect test-retest reliability estimates?

Answer

A

Experience, practice, memory, fatigue, etc. may intervene.

Question 17

Q

alternate-forms/parallel-forms reliability estimates coefficient name

Answer

A

Coefficient of equivalence

Question 18

Q

Parallel vs. Alternate forms reliability

Answer

A

Parallel forms: Means and variances of test scores are equal

Alternate forms: different versions of same test, but aren’t parallel

Question 19

Q

2 similarities between parallel/alternate and test-retest reliability

Answer

A

1) two test administrations with same group

2) test scores can be affected by factors like fatigue, practice, learning, etc.

Question 20

Q

What additional source of error variance is present in alternate/parallel-forms reliability?

Answer

A

Item/Content sampling

Question 21

Q

Split-half reliability

Answer

A

Correlating two pairs of scores from a single test.

one half of a test Pearson r with another half, then adjust with Spearman-Brown formula

Question 22

Q

Odd-even reliability

Answer

A

split-half reliability by using odd vs. even numbers

Question 23

Q

How do number of items affect reliability coefficient? What method can see how many items needed?

Answer

A

Spearman-Brown.

More items is more reliability

Question 24

Q

What coefficient for inter-item consistency?

Answer

A

Coefficient alpha

Question 25

Q

Inter-scorer reliability

What coefficient?

Answer

A

Degree of consistency between 2 or more scorers.

Coefficient of inter-scorer reliability

Question 26

Q

DSM-5 Inter-rater reliability

Answer

A

Kappa = 0.44 (fair level moderately greater than chance)

Question 27

Q

Transient error

Answer

A

Error due to testtaker’s feelings, moods, or mental state over time

Question 28

Q

Homogeneity vs. Heterogeneity of test items

Answer

A

Homogenous: Functionally uniform items. Measures one factor (eg. one ability/trait). High internal consistency should happen

Heterogenous: Not just one factor measured in the test.

Question 29

Q

Does high internal consistency mean homogeneity of items?

Answer

A

Not necessarily.

More items will lead to high internal consistency coefficients as long as they’re positively correlated

Question 30

Q

Dynamic vs. static characteristics

Answer

A

Dynamic: Presumed to be relatively situational and changing

Static: presumed to be relatively unchanging

Question 31

Q

Restriction/Inflation of range

Answer

A

When some subgroup inflates or restricts the correlational analysis??

Question 32

Q

Power Test

Answer

A

Enough time to attempt all items, but so difficult that nobody gets perfect score

Question 33

Q

Speed test

Answer

A

Same level of difficulty in items and testtakers should complete everything correctly if unlimited time.

But only some will be able to complete the whole test

Question 34

Q

What’s differences in assumptions between CTT and IRT? (not specific, but ya..)

Answer

A

CTT assumptions are weak and easily met. IRT are rigorous.

Question 35

Q

Domain Sampling Theory

Answer

A

Reliability is based on how well a score assesses the domain of where a sample is drawn.

Question 36

Q

What is universe score in generalizability theory?

Answer

A

The true score (given same conditions, the same score will be obtained)

Question 37

Q

Generalizability Study

Coefficient of generalizability

Answer

A

how generalizable scores from a particular test are if administered in different situations.

Question 38

Q

Decision study

Answer

A

Usefulness of test scores in helping user make decisions. Follows generalizability study

Question 39

Q

Another way to say Item response theory

Answer

A

Latent-trait theory

Question 40

Q

Within CTT, what is the weight assigned to each item on a test?

Answer

A

Equal weight. IRT is differentital weight.

Question 41

Q

Dichotomous test items

Answer

A

can only answer with one of two responses

Question 42

Q

Polytomous test items

Answer

A

3 or more alternative responses

Question 43

Q

Rasch Model

Answer

A

a type of IRT model with underlying distribution assumption

Question 44

Q

Which measure is used to compare differences between scores?

Answer

A

Standard error of the difference