Lecture 4.2 Reliability Flashcards

Question 1

Q

Reliability

Answer

A

• The consistency with which a test measures what it purports to measure in any given set of circumstances

Question 2

Q

True

Answer

A

True or False

A reliable test will result in the same score every time it is used to measure the same thing under the same conditions

Question 3

Q

Reliability coefficient

Answer

A

An index of reliability that indicates the ratio between the true score variance on a test and the total variance (SD2)

Question 4

Q

> .90

Answer

A

Reliability coefficient of _______ is excellent for research purposes, appropriate for individual assessment purposes

Question 5

Q

> .80

Answer

A

Reliability coefficient of _______s good for research purposes, marginal for individual assessment

Question 6

Q

Reliability coefficient

Answer

A

higher scores = higher reliability
> .6 is marginal for research purposes
> .70 is adequate for research purposes

Question 7

Q

Classic Test Theory

Answer

A

assumes that each person has an innate true score. It can be summed up with an equation:
X = T + E,
Real score is true score plus error

Question 8

Q

more reliable

Answer

A

higher proportion of true variance =

Question 9

Q

less reliable

Answer

A

higher proportion of error variance

Question 10

Q

increase or decrease

Answer

A

error variance may______________ or _________________ a test score by varying amounts –leading to lower reliability

Question 11

Q

Systematic error and unsystematic

Answer

A

Two types of testing error

Question 12

Q

Systematic error

Answer

A

Testing error that doesn’t affect reliability. Consistent error, predictable (when aware) – leaking tyre

Question 13

Q

Unsystematic error

Answer

A

Testing error that effects reliability. Inconsistent, unpredictable – electrical problem

Question 14

Q

Test construction

Answer

A

Sources of Error Variance T_______ C_______
The content covered by test items, the way questions are asked, and the response format all add to the error variance of a test

Question 15

Q

Test administration

Answer

A

Sources of Error Variance T_______ A_______
• Test environment (including test materials), test-taker variables (e.g., alertness, wellbeing, mistakes) & administrator-related variables (e.g., presence or absence, demeanour, departure from procedure, unconscious cues, etc.)

Question 16

Q

Test scoring & interpretation

Answer

A

Sources of Error Variance T_______ s _______ a ________
Human error - data entry, transcription, coding, calculation, timing, etc.
Level of objectivity/subjectivity

Question 17

Q

Human fallibility

Answer

A

Sources of Error Variance h _______ f _________.
• Forgetting or misremembering
• Failing to notice or not being aware
• Not understanding or following instructions
• Under- and over-reporting
• Differences of opinion
• Lying or misleading

Question 18

Q

Time and practice effects

Answer

A

Sources of Error Variance ti_________ and pr______eff________.

Question 19

Q

Domain Sampling Model

Answer

A

This model assumes that the items that have been selected for any one test are just a sample of items from an infinite domain of potential items. Error that occurs in the development of a test.

Question 20

Q

Domain Sampling Model

Answer

A

• Seeks to determine how precisely the test score assesses the domain from which the test draws a sample

Question 21

Q

True score

Answer

A

The score you would get if you answered all the items that could be conceivable.

Question 22

Q

Standard Error of Measurement (SEM)

Answer

A

• Measures the precision of an observed score & provides an estimate of the amount of error inherent in an observed score or measurement

Question 23

Q

Standard Error of Difference (SED)

Answer

A

Can be used to compare:
• an individual’s scores on two different tests
• two different people’s scores on the same test
• two different people’s scores on two different tests

Question 24

Q

Test-Retest Reliability

Answer

A

Calculated by correlating scores from the same people on two different administrations of the same test
Used for measuring characteristics that are thought to be stable (e.g. personality traits or intelligence)

Question 25

Q

amount of time between administrations

Any interventions, treatment or trauma, taking place between test administrations;

Answer

A

Test-retest reliability will be affected by

Question 26

Q

Parallel & Alternate Forms Reliability

Answer

A

Different versions of a test, matched for content and difficulty

Question 27

Q

Split-Half Reliability

Answer

A

Scores from one half of a test are correlated with the other half of the test, using equivalent halves
• Random, odds & evens, content & difficulty

Question 28

Q

Inter-Rater Reliability

Answer

A

The degree of agreement between two or more scorers. Reduced by appropriate training.

Question 29

Q

Test-retest

Answer

A

correlate scores from 2 administrations of the same test

Question 30

Q

Parallel forms

Answer

A

correlate scores from 2 versions of the same test

Question 31

Q

Split-half

Answer

A

correlate scores from 2 equivalent halves of the same test

Question 32

Q

Internal consistency

Answer

A

correlate items within the same test

Question 33

Q

Inter-rater

Answer

A

correlate scores from 2 scorers for one test taker

Question 34

Q

reliability coefficients

Answer

A

Indicates the ratio between the true score variance on a test and the total variance
Range from 0 to 1: closer to 1, the higher the reliability

Question 35

Q

Homogenous

Answer

A

__________________ test unifactorial, so consist of items measuring a single trait or factor

Question 36

Q

Heterogenous

Answer

A

________________ test is multifactorial, so measure more than one trait or factor

Question 37

Q

static

Answer

A

a characteristic, trait, or ability that is presumed to be relatively unchanging

Question 38

Q

dynamic

Answer

A

a characteristic, state, or ability that is presumed to be ever changing as a function of situational and cognitive experiences

Question 39

Q

Restricted range or variance

Answer

A

sampling procedure used to gather the test scores does not result in a full spread of scores (e.g., having only university students complete an IQ test)

Question 40

Q

Inflated range or variance

Answer

A

when the sample includes people who are outside of the range of the test so the scoring range is inflated (e.g., adults completing a test designed for children)

Question 41

Q

speed test

Answer

A

all items of equal difficulty, and time limited so that no-one is likely to be able to answer all items

Question 42

Q

power test

Answer

A

time limit is long enough for all items to be attempted, but some items are so difficult that no-one is likely to get them all right

Question 43

Q

Criterion-Referenced

Answer

A

Designed to provide an indication of where a test taker stands with respect to some criterion (i.e., pass/fail type tests)

Question 44

Q

Validity

Answer

A

The extent to which evidence supports the meaning and use of a psychological test (or other assessment device)

Question 45

Q

The validity coefficient

Answer

A

A correlation coefficient that provides a measure of the relationship between test scores and scores on the criterion measure

Question 46

Q

Validity

Answer

A

How well a test or measurement tool measures what it purports to measure in a particular context

Question 47

Q

Classic (trinitarian) Model

Answer

A

focuses on three categories of validity

Question 48

Q

Content Validity

Answer

A

Type of validity - scrutinizing the test’s content

Question 49

Q

Criterion-related validity

Answer

A

Type of validity - relating scores obtained on the test to other test scores or other measures

Question 50

Q

Construct validity

Answer

A

Type of validity - ‘umbrella validity’; comprehensive analysis of how test scores relate to scores on other tests/measures & how test scores relate to the construct that the test was designed to measure

Question 51

Q

Unitary Model of validity

Answer

A

_____________ view takes everything into account, from implications of test scores in terms of societal values to the consequences of use

Question 52

Q

Test validation

Answer

A

The process of gathering and evaluating validity evidence.
Test developer is responsible for supplying validity info in the test manual and/or through a ‘test validation’ journal article

Question 53

Q

Content Validity

Answer

A

• Describes a judgement of how adequately a test samples behaviour representative of the universe of behaviour that the test was designed to sample

Question 54

Q

Face Validity

Answer

A

Type of content validity

A judgement concerning how relevant the test items appear to be to the test-taker

Question 55

Q

Quantifying content validity

Answer

A

Important in employment settings, where tests are used to hire & promote
• Tests must be shown to include relevant items in terms of job skills required for the position
• Lawshe (1975):
• Is the skill or knowledge measured by this item: 1) Essential; 2) Useful but not essential; 3) Not necessary to the performance of the job?

Question 56

Q

Culture

Answer

A

C____________ has an impact on judgements concerning the validity of tests and test items

Question 57

Q

Criterion-Related Validity

Answer

A

C __________ R________ V __________
A judgement of how adequately a test score can be used to infer an individual’s most probable standing on some measure of interest – the measure of interest being the criterion

Question 58

Q

criterion

Answer

A

A _____________ is the standard against which a test or test score is evaluated -can be almost anything:

Question 59

Q

RELEVANT
VALID
UNCONTAMINATED

Answer

A

A criterion should be:

R___________ – pertinent or applicable to the matter at hand
V___________ for the purpose for which it is being used
U____________ – not based on predictor measures

Question 60

Q

Predictive Validity

Answer

A

P ______________ V ______________ is the degree to which a test score predicts a criterion measure at a future time

Question 61

Q

Concurrent Validity

Answer

A

C___________ v_________ is the degree to which a test score is related to a criterion measure that is obtained at (about) the same time

Question 62

Q

Incremental Validity

Answer

A

I___________ V__________
The degree to which an additional predictor explains something about the criterion measure that is not explained by predictors already in use

Question 63

Q

False negatives

Answer

A

test takers predicted not to show characteristic but do

Question 64

Q

False positives

Answer

A

test takers predicted to show characteristic but don’t

Answer 65

A

M_____ r_______the proportion of people incorrectly classified

Answer 66

A

H________ r_______the proportion of people correctly identified

Answer 67

A

B______ r________ the extent to which a particular trait, behaviour, characteristic or attribute exists in the population

Answer 68

A

C_________ v___________
A judgement about the appropriateness of inferences drawn from test scores regarding individual standings on a variable called a construct

Answer 69

A

Evidence of construct validity
H\_\_\_\_\_\_\_\_\_\_\_\_\_ of items
Changes with a\_\_\_\_
Pre-test to p\_\_\_\_\_\_\_\_\_\_\_\_\_changes
G\_\_\_\_\_\_\_\_ differences
C\_\_\_\_\_\_\_\_\_\_ evidence
D\_\_\_\_\_\_\_\_\_\_ evidence
F\_\_\_\_\_\_\_\_\_ analysis

Answer 70

A

E__________ of h___________ - How uniform the test is in measuring a single concept

Answer 71

A

Some constructs are expected to change with age, particularly during childhood/adolescence

Answer 72

A

Evidence that scores change as the result of some experience between a pre-test and a post-test can be evidence of construct validity

Answer 73

A

Demonstrating that scores on the test vary in a predictable way as a function of membership in some group

Answer 74

A

When test scores on a new test are found to correlate highly in the predicted direction with scores on a older, more established and validated test designed to measure the same construct

Answer 75

A

Shown when test scores are found to have little or no relationship with test scores or variables for which theoretically there should be no relationship

Answer 76

A

Can be used to determine both convergent and discriminant evidence of construct validity

Answer 77

A

A factor structure is explicitly hypothesised and is tested for its fit with the observed covariance structure of the measured variables

Answer 78

A

Estimating or extracting factors, deciding how many factors to retain, rotating factors to an interpretable orientation