Reliability & Validity Flashcards by Will Perchard

What can high reliability guarantee?

Consistency

How well did you know this?

Not at all

Perfectly

How can one test reliability?

Test-retest correlation

How well did you know this?

Not at all

Perfectly

How does test-retest correlation work?

Administering an instrument twice to same population

How well did you know this?

Not at all

Perfectly

How does one avoid the practice effect when doing test-retest?

Time difference must be long enough
But short enough so the underlying state does not change

How well did you know this?

Not at all

Perfectly

Time difference for test-retest in psychiatric studies

2-14 days

How well did you know this?

Not at all

Perfectly

What measures internal consistency of a test?

Cronbachs alpha

How well did you know this?

Not at all

Perfectly

How does cronbachs alpha test internal consistency?

By correlating each item with the total score and averaging the correlation coefficient

How well did you know this?

Not at all

Perfectly

Values of Cronbachs alpha

Negative infinity to 1

How well did you know this?

Not at all

Perfectly

What Cronbachs alpha values make sense

Positive values only

How well did you know this?

Not at all

Perfectly

Cut off for Cronbachs alpha for a test to be internally consistent?

0.7

How well did you know this?

Not at all

Perfectly

What is split half reliability?

Splitting scale in two parts and examining the correlation

How well did you know this?

Not at all

Perfectly

What is intraclass correlation coefficient used for?

Continuous variables

How well did you know this?

Not at all

Perfectly

What is the intraclass correlation coefficient?

Proportion of total variables of measurement that reflects true between subject variability

How well did you know this?

Not at all

Perfectly

Range of intraclass correlation coefficient?

0 (unreliable) - 1 (perfect reliability)

How well did you know this?

Not at all

Perfectly

What can ICC be measured for?

Relative or absolute agreement

How well did you know this?

Not at all

Perfectly

Difference between relative and absolute agreement

Relative ICC is always higher

How well did you know this?

Not at all

Perfectly

Levels of ICC and their meanings

0.6 = fair
0.8 = very good
0.9 = excellent

How well did you know this?

Not at all

Perfectly

What is ANOVA intraclass coefficient used for?

Quantitative data with more than 2 rates/groups

How well did you know this?

Not at all

Perfectly

What is used to test relaibility for nominal data with more than 2 categories?

Kappa or weighted kappa

How well did you know this?

Not at all

Perfectly

What is face validity?

Subjective measure of deciding whether a test measures the construct of interest at face value

How well did you know this?

Not at all

Perfectly

Types of construct validity

Content
Criterion
Convergent
Discriminant
Experimental

How well did you know this?

Not at all

Perfectly

What is criterion validity made up of

Concurrent
Predictive

How well did you know this?

Not at all

Perfectly

What is construct validity?

Measures whether a test really measures the construct of interest

How well did you know this?

Not at all

Perfectly

What is unified construct validity?

Both content and criterion validity

How well did you know this?

Not at all

Perfectly

What is content validity?

Whether the contents of the test are in line with specifications the test was designed to measure

What does content validity look for?

Good coverage of all domains thought to be related to the measured condition

How does one measure content validity?

Cannot be statistically tested Experts are called to comment on this validity

What is criterion validity?

Performance of a test against an external criterion such as an instrument or future diagnstic possibility

What is concurrent validity?

Ability of a test to distinguish between subjects who differ concurrently in other measures (using other instruments)

What is predictive validity?

Ability of a test to predict future group differences according to current group scores

What is incremental validity?

Ability of a measure to predict or explain variance over and above other measures

What can one divide construct validity into?

Concurrent & predictive Convergent, discriminant & experimental Factorial

What is convergent validity?

Agreement between instruments that measure same construct

What is discriminant validity?

Degree of disagreement between two scales measuring different constructs

What is experimental validity?

Sensitivity to change.

What is factorial validity?

Established via factor analysis of items in a scale

What is precision?

Degree to which the mean varies with repeated sampling

What leads to imprecision?

Random errors

Factors that reduce precision

Wide interval limits Expecting higher CI

What is accuracy?

Correctness of the mean value i.e. how close it is to the true population value

What compromises both validity and accuracy?

Bias

Disadvantages of percent agreement?

Overestimates degree of agreement

What does kappa indicate?

Level of agreement that could be expected beyond chance

What is kappa used for?

Agreement on categorical variables

What is weighted kappa used for?

Ordinal variables

What is used for beyond chance agreement in continuous variables?

Bland-Altman plot

Degree of agreement if kappa is 0

None

Degree of agreement if kappa is 0-0.2

Slight

Degree of agreement if kappa is 0.2-0.4

Fair

Degree of agreement if kappa is 0.4-0.6

Moderate

Degree of agreement if kappa is 0.6-0.8

Substantial

Degree of agreement if kappa is 0.8-1.0

Almost perfect

What affects kappa?

Prevalence of outcome studied - higher proportion of assessments leads to higher kappa

Calculations for kappa

(observed agreement beyond chance) / (maximum agreement beyond chance) OR (observed agreement - agreement by chance) / (100% - agreement expected by chance)

What numerical values are needed to calculate kappa?

Percentage of patients that the 2 assessors correctly classified Expected agreement be chance

What is kappa dependent on?

Prevalence of measured condition

What type of disorders will kappa be low for?

Common disorders

Disadvantage of kappa

One cannot test statistical significance from kappa values

What is another way of calculating beyond chance agreement for nominal values

Phi

Advantages of phi

Statistical significance testing is possible Small sample size can be used

What is plotted in bland-altman plot?

Pairs of score differences are plotted against the mean

Reliability & Validity Flashcards

(61 cards)