Module 5: Validity and reliability; screening and diagnostic tests Flashcards

1
Q

What are “constructs”?

A

Theoretical concepts are complex and cannot be directly observed (e.g., health status, quality of life, pain, depression, intelligence).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is operationalization?

A

The process of measuring unobserved constructs using observable indicators.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why are observable indicators helpful?

A

Indicators are our attempt to get at some truth. Even clinical conditions are based on observable indicators, e.g., high blood pressure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a nominal scale (categorical)? Provide two examples. What is the key property?

A

Uses numbers as labels for categories.

1 = Male; 2 = Female. Province of Residence

Key property is that A = B or A does not equal B.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an ordinal scale (ranks)? Provide two examples. What is the key property?

A

Uses numbers as labels for categories, but the numbers reflect ranked order of the characteristic being measured:

0 = no pain, 1 = a small amount of pain, …, 10 = the worst pain conceivable. 1 = lowest income group, 2 = middle income, 3 = highest income group.

Key property A < B < C. Higher numbers indicate more pain / income… distance from 1 to 2 not meaningful, does not relate to other distances (e.g., 2 to 3).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an interval scale (cardinal)? Provide an example. What is the key property?

A

Assigns numbers to response categories so that a unit change in scale values represents a constant change across the range of that scale.

Temperature in Celsius or Fahrenheit.

Key property is that A − B = (A + 10) − (B + 10). The distance from 20 to 30 is the same as the distance from 0 to 10. Ratios, however, don’t make sense: 40°C is not twice as hot as 20°C.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a ratio scale? Provide two examples. What is the key property?

A

A meaningful zero point exists.

Weight. Number of physician visits.

Key property is that A × B = C and C ÷ B = A. 10 visits are five times as many visits as 2 visits. A weight of 100 points is half the weight of 200 pounds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is random measurement error? What’s a term to summarize its extent?

A

The measure has random noise.

A term to summarize the extent the noise affects our ability to measure is “reliability” of an indicator.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is systematic measurement error? What can these errors create?

A

An error that is rooted in study design / the measurement tool / the data-to-estimation process such that the estimate will always deviate from the true value, aka “bias”.

Systematic errors can create invalid results (“validity”), which means a study result is incorrect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is reliability calculated?

A

Variability of the true construct / Variability of the true construct + Measurement error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When we express reliability via the base calculation it is between 0 and 1. If the ratio = 1, then measurement error equals what? What does this imply?

A

Zero. We are perfectly measuring our construct. If this ratio = 0, then our measure is entirely measurement error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A more reliable measure is less influenced by _____.

A

Error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When we describe something as “reliable” we mean that σ2e is low or non-existent compared to σ2T, which is a theoretical concept. What do we need to do in practice?

A

Express/quantify reliability, indicated by reproducibility: does the same measurement produce the same result when repeated?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Test-retest reliability asks what?

A

Does the measurement produce the same result when the same rater makes a second assessment? (The same subject and the same measure by the same rater at different times.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the consequences if a test-retest interval is too short? Too long? When is it appropriate?

A

Too short: the rater remembers first response.

Too long: the construct may change.

Appropriate interval: it depends (2-14 days?).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Gordis refers to test-retest reliability as _____ _____.

A

Intraobserver variation.

17
Q

Inter-rater reliability asks what?

A

Do different raters assessing the same thing obtain the same result? Same subject, same measure, different raters.

18
Q

Gordis refers to inter-rater reliability as _____ _____.

A

Interobserver variation.

19
Q

With what two ways can we calculate reproducibility?

A

Correlation coefficients (1: perfect positive correlation; 0: no correlation; 1: perfect inverse correlation).

The Kappa (κ) coefficient.

20
Q

Kappa is between 0 and 1; κ = 0 means what?

A

Observed agreement = Expected agreement, only agree by chance.

21
Q

We are very interested in instances of _____ when we talk about validity in measurement. Why?

A

Misclassification.

A measuring tool is not very good if it continually sorts people into the incorrect category.

22
Q

What is sensitivity?

A

The ability of the test to correctly identify the presence of disease.

23
Q

What is specificity?

A

The ability of the test to correctly identify the absence of disease.

24
Q

False-positives and false-negatives are both bad. Why is this?

A

False-positive: SCARY!, emotional and financial consequences, difficult to delabel a person (e.g., MRSA+).

False-negative: Missed what should have been found, “the doctor got it wrong!”

25
Q

When a test has a continuous variable as an outcome the discussion of validity focuses on what?

A

Acceptable cut-off for the test (e.g., blood sugar and diabetes).

26
Q

What is Positive Predictive Value (PPV)?

A

The probability of the patient having the disease given the test is positive.

27
Q

What is Negative Predictive Value (NPV)?

A

The probability of the patient not having the disease given the test is negative.

28
Q

Why are PPV and NPV useful in a clinical setting?

A

It’s what people want: If this patient tested positive (negative), what is the probability they have (don’t have) the disease?

29
Q

PPV and NPV are influenced by what?

A

Sensitivity, specificity, prevalence.

30
Q

What is a likelihood ratio?

A

Compares the probability that someone with the disease has a particular test result as compared to someone without the disease.

31
Q

How are post-test odds calculated?

A

Pre-test odds × Likelihood Ratio

32
Q

Often pre-test probability comes from what?

A

The prevalence of the condition in the group you draw patients from (e.g., general population).

33
Q

Post-test probability applies to whom?

A

The patient in front of you.

34
Q

What are three reasons screening is used?

A

Offered to asymptomatic people.

Detects latent disease (in detectable pre-clinical stage).

Used to guide a decision as to a diagnostic test.

35
Q

When are diagnostic tests used and why?

A

Offered to people who are suspected to have a disease. To aid the detection (and eventual diagnosis) of a suspected disease or condition.