Reliability and Validity Flashcards

Question

# Define Method variance

Answer 1

the variability among scores on a psychological test or other assessment device that arises because of the form as distinct from the content of the test

Answer 2

the patterns of correlations resulting from testing all possible relationships among two or more methods of assessing two ro more constructs

Answer 3

Two forms of the same test developed; different items selected according to the same rules. Same distribution of scores (mean and variance equal)

Answer 4

the extent to which a score on a psychological test (or other assessment device) allows a statement about standing on a variable indexing important social behaviour independent of the test

Answer 5

the consistency with which a test measures what it purports to measure in any given set of circumstances

Answer 6

an index - often a Pearson product moment correlation coefficient - of the ratio of true score to error score variance in a test as used in a given set of circumstances

Answer 7

the proportion of those tested or assessed who can be allocated to the category of showing the behaviour of interest in a given psychological testing or assessment situation

Answer 8

a form of method variance common in the construction of psychological tests of personality that arises when people respond to questions that place them in a favourable or unfavourable light

Answer 9

applied to estimate reliability if each half of the test was the same length as the test. I.e. allows you to estimate internal consistency if the test was longer or shorter

Answer 10

the estimate of reliability obtained by correlating scores on the two halves of a test formed in some systematic way (e.g. odd versus even items)

Answer 11

the extent to which test scores remain stable when a test is administered on more than one occasion

Answer 12

an index of the amount of error in predicting one variable from another

Answer 13

an expression of the precision of an individual test score as an estimate of the trait it purports to measure

Answer 14

the estimate of reliability obtained by correlating scores on the test constructor is seeking to measure and the conditions under which it will be used

Answer 15

Factors that contribute to consistency - stable attributes under examination

Answer 16

a decision that correctly allocates a test taker or person being assessed to the category of those predicted not to show some behaviour of interest on the basis of their score on a test or other assessment device

Answer 17

a decision that correctly allocates a test taker or person being assessed to the category of those predicted to show some behaviour of interest on the basis of their score on a test or other assessment device

Answer 18

the extent to which evidence supports the meaning and use of a psychological test (or other assessment device)

Answer 19

Alternative

Answer 20

Classical test theory

Answer 21

Concurrent validity

Answer 22

Construct underrepresentation

Answer 23

Construct validity

Answer 24

Constuct-irrelevant variance

Answer 25

Content validity

Answer 26

Convergent and discriminant validity

Answer 27

Criterion-related validity

Answer 28

Cronbach's alpha

Answer 29

Cutting point

Answer 30

Domain-sampling model

Answer 31

Equivalent forms reliability

Answer 32

Errors of measurement

Answer 33

Face validity

Answer 34

Factor analysis

Answer 35

False negative decision

Answer 36

False positive decision

Answer 37

Generalisability theory

Answer 38

Incremental validity

Answer 39

Inter-rater reliability

Answer 40

Internal consistency

Answer 41

Kuder-Richardson 20 (KR20)

Answer 42

Method variance

Answer 43

Multitrait-multimethod matrix

Answer 44

Parallel forms relaibility

Answer 45

Predictive validity

Answer 46

Reliability

Answer 47

Reliability coefficient

Answer 48

Selection ratio

Answer 49

Social desirability bias

Answer 50

Spearman-Brown formula

Answer 51

Split-half reliability

Answer 52

Stability over time

Answer 53

Standard error of estimate

Answer 54

Standard error of measurement

Answer 55

Test-Retest Reliability

Answer 56

True scores

Answer 57

Valid negative decision

Answer 58

Valid positive decision

Answer 59

* The degree to which a test tool provides consistent results * A test is considered reliable when it produces the same results again and again, when measuring the same thing

Answer 60

Validity can be broadly understood as the extent to which a test measures the constuct it is intended to measure

Answer 61

* The test may have poor validity (i.e. it is measuring some other variable) * The test has poor reliability (i.e. when repeated, the test often shows different results)

Answer 62

* Diagnosis * Assessment of ability * Treatment * Decisions around recommending treatment * Monitoring treatment outcomes (e.g. reliability would be really important if you are repeating tests to see if the treatment is working) * The conclusions you can draw rely on the reliability and validity of the tests/assessments you are using. * Important clinically and in research

Answer 63

False Tests cannot be valid without being reliable

Answer 64

* Factors that contribute to consistency – stable attributes under examination (“_True Scores_”) * Factors that contribute to inconsistency – characteristics of tests taker, test or situation that have nothing to do with attribute being tested but effect scores. (“_Errors of Measurement_”)

Answer 65

Item selection Test administration Test scoring Systematic measurement error

Answer 66

sample of items chosen may not be equally reflective of every individual’s true score.

Answer 67

General environmental conditions e.g. temperature, lighting, noise; temporary “states” of test taker e.g. fatigue, anxiety, distraction. E.g. completing an IQ test in a loud noisy room. Examiner providing non-standerdised instructions

Answer 68

Domain Sampling Theory considers the problem of using only a **_sample_** of items to represent a construct

Answer 69

More test-retest reliability Practice effects Maturation Treatment effects or setting

Answer 70

Which of these would test-retest be approapriate for? * State anxiety * Weight of a baby * **Extraversion** * **Intelligence**

Answer 71

* Test a relatively stable construct * No intervention in between testing * Shorter time between testing

Answer 72

The both involve two forms of the same test developed; different items selected according to the same rules. **Parallel Forms:** same distribution of scores (means and variance equal) **Alternate Forms:** different distribution of scores (mean and variance may not be equal)

Answer 73

* Test is split into halves (randomly, odd-even system, top vs bottom) * Correlate the two halves * Estimate of reliability based on split half is smaller due to smaller number of items * Spearman-Brown formula is applied to estimate reliability if each half of the test was the same length as the test. * i.e. Allows you to estimate internal consistency if the test if it was longer or shorter

Answer 74

if scores on 2 half tests from single administration are highly correlated, scores on 2 whole tests from separate administrations should also be highly correlated.

Answer 75

A generalised reliability coefficient for scoring systems that are graded for each item (i.e. agree to disagree) * Mean of all possible split-half correlations, corrected by the Spearman-Brown formula * Ranges from 0 (no similarity) to 1 (perfectly identical)

Answer 76

Depends on the purpose to some extent * .70-.80 acceptable or good * Greater than .90 may indicate redundancy in items * High reliability is really important in clinical settings when making decisions for a person (e.g. decision making capacity assessment).

Answer 77

**_Kuder-Richardson 20 (KR20)_**: a particular case of Cronbach’s alpha for dichotomously scored items (i.e. scored as 0 or 1)

Answer 78

The **_larger_** the SEM, the less certain we are that the test score represents the true score.

Answer 79

* Clear administration and scoring instructions for test user * Clear instructions for the test taker * Unambiguous test items * Standardised testing environment and procedure * Reduced time between testing sessions * Increase assessment length/items * Test try-out and modification * Discarding items that decrease reliability (item analysis) * Maximise VALIDITY

Answer 80

Face validity Content validity Criterion-related validity Construct validity

Answer 81

* Construct underrepresentation: failure to capture important components of a construct. * e.g. A depression scale that assesses cognitive and emotional components of depression, but not behavioural components. * Construct-irrelevant variance: measuring things other than the construct of interest. * e.g. The wording of our depression scale may make it likely that people will respond in socially desirable ways.

Answer 82

* e.g. marks in Year 12 are used to predict performance at university * e.g. a marital satisfaction survey is used to predict divorce * e.g. scores on an anxiety scale you developed are correlated with clinical observations.

Answer 83

A test designed to measure anxiety may be issued in conjunction with a diagnostic interview by an experienced clinician using the DSM-5. The concurrent validity of the test represents the extent to which the test score corresponds with the clinician’s observations of the client’s anxiety levels.

Answer 84

VCE marks or ATAR scores are used to predict performance at university

Answer 85

e. g. Relationship between score on measure of psychopathy and low emotional arousal. e. g. Relationship between low self-esteem and depression

Answer 86

Scores on an anxiety measure should differ from scores on a depression measure, if each measure is assessing these individual constructs.

Reliability and Validity Flashcards

(114 cards)