2 - Measurement Flashcards

Question 1

Q

what is a discriminative instrument?

Answer

A

used to sort individuals into groups (ie based on who has criteria we want or not) eg diagnostic test, screening tool, methods of evaluating eligibility criteria

Question 2

Q

what are 2 ways to determine responsiveness? which is more common?

Answer

A

anchor-based approach (more common) and distribution-based approach

Question 3

Q

what is more important reliability or validity?

Answer

A

see answer to forum question

Question 4

Q

describe standard error of measurement

Answer

A

estimate of the measure’s ability to differeniate among patients
determines whether true change has occured
closer to 0 is better
ie looking at people who haven’t changed (a blood glucose reading says x, but that doesn’t men there is no error for this)

Question 5

Q

how do I make readers understand my results for comparing btw 2 groups?

Answer

A

provide mean difference and 95% CI around that mean diff
tell them the MCID ad whether MCID falls inside or outsde 95% CI (inside = inconclusive, outside = conclusive)
provice number needed to treat (proportion of patients in experimental or control group who changed by an important amount)

Question 6

Q

what type of differences does mean difference look for? Is this a validity or reliability issue?

Answer

A

systematic differences (ie diff in the way people are measuring - a validity issue)

Question 7

Q

what is disease-specific HRQOL? examples?

Answer

A

measures specific aspects of health (ie specific to the disease of interest)
cant compare across clinical areas (only withing - for example which one offers more relief)
easier to detect change bc questions are more specific
eg: WOMAC (for patients w osteoarthritis)

Question 8

Q

define cost analysis

Answer

A

does not consider the effect of treatment

from chart: examines only costs but there is a comparison btw 2 or more alternatives

Question 9

Q

define reliability in terms of the formula

Answer

A

a ratio of the true score to the true score plus its associated error

Question 10

Q

what is sensitivity to change

Answer

A

the ability to measure change

Question 11

Q

define face validity

Answer

A

face value (patients)
are questions asked reflective of what they experience with this particular disease?

Question 12

Q

describe the information about incremental cost-effectiveness quadrants

Answer

A

upper left and lower right easiest to make a decision on

Question 13

Q

what is agreement

Answer

A

how 2 things change according to each other (taking into account systematic differnces - y-intercept)
good for reliability (btw 0 and 1, 1 being perfect association)
ICC/Kappa
can’t have more validity than reliability

Question 14

Q

what is internal consistency reliability? most common example? what should values be at?

Answer

A

extent to which items on the questionnaire are associted w each other
eg a correlation of 100% means if you answer yes to 1, will answer yes to the next etc - these q’s are redundant so take out
values should be 80-90% (0.8/0.9)
common example = chronbach’s alpha

Question 15

Q

How does one use a standard error of measure with a confidence interval? How to calculate 95% CI for score of 64, SEM 5.

Answer

A

SEM x 1.96 = 95% CI

SEM x 1.64 = 90% CI

SEM x 1.28 = 80% CI

note the middle score is the z-value which is constant!
5 x 1.96 = +/- 10 and therefore 95% confident that score is btw 54 and 74

Question 16

Q

define pearson’s r, Interclass correlation coefficient (ICC), spearman’s rho and weighted kappa wrt association vs agreement and continuous vs categorical

Answer

A

pearson’s r: association, continuous

Interclass correlation coefficient (ICC): agreement, continuous

spearman’s rho: association, categorical

weighted kappa: agreement, categorical

Question 17

Q

what is criterion validity?

predictive vs concurrent

Answer

A

behaves as expected compared to gold standard (predictive/concurrent)
the correlation of a scale with some other measure of the trait or disorder (ideally a gold standard or criterion measure) * gold standard needed for this!!
predictive = administer new scale and see how well it predicts the event in the future
concurrent = simultaneously administer the new scale with the criterion measure and determine the association

Question 18

Q

for the ICF (international classification of functioning, disability, and health), what are the 4 defining health areas? what are the modifiers?

Answer

A

1) body function: physiological/psychological (includes pain and mental disorder) 2) body structures: anatomical 3) activity: performance of a task or action 4) participation: involvement in meaningful, fulfilling, and satisfying activities contextual factors: age, coping strategies, social attitudes, education, experience etc (can modify ur health in any of these areas)

Question 19

Q

what is a predictive instrument?

Answer

A

used to predict the future (or result/product of the experiment) - measure something now that will predict something happening in the future an important validity indicator eg MCAT, LSAT etc

Question 20

Q

challenges: applicability (costs vary)

Question 21

Q

compare and contrast self-reported function and performance based measures

Answer

A

both attempt to measure activity limitations
performance: ie walk test, strength, ROM
self-reported function: a patient reported outcome measure (PRO), more clinically relevent, eg: lower extremity functional scale

Question 22

Q

what are the 3 types of cost in the full economic evaluation?

Answer

A

cost-effectiveness analysis
cost-utility analysis
cost-benefit analysis

Question 23

Q

why do we use surrogate outcomes?

Answer

A

they increase efficiency
easier faster and cheaper to measure

Question 24

Q

describe the distribution-based approach for measuring responsiveness

Answer

A

for people who aren’t expected to change (ie maybe chronic disease) - average of T1-T2 will be 0 (not expected to change)
again measuring at 2 different time points
plot distribution and decide cut-off point above which significant change has occured
for people who are expected to change, same thing but this time arbitrary line is to left of bell curve

Question 25

Q

what is the tool’s metric?

Answer

A

interpreting your results or making sure your results are interpretable to readers

Question 26

Q

define: precision

Answer

A

a measure of the extent to which repeated measurements come up with the same value
this is about the error - how much can you trust that the value is representative of the true score?

Question 27

Q

what does PRO stand for? example?

Answer

A

patient reported outcome measure

eg health releted QOL

Question 28

Q

what is a surrogate outcome? examples?

Answer

A

outcome measures that are not of direct practical importance but are beleived to reflect outcomes that are important

it is indirectly important ot patients (they don’t care
these outcomes arent perfect, can’t conclude it causes something
eg: cholesterol level

Question 29

Q

from what prespective can costs be represented as an outcome? (4) - what is the most common?

Answer

A

individual
ministry of health (most common)
society (sick days etc)
third-party power (insurance company)

* there is usually more than 1 of these views being represented

Question 30

Q

describe STC wrt responisveness

Answer

A

STC is a necessary but insufficient condition for responsiveness
the problem with responsiveness is how are we going to determine/define what is clinically important?
see lecture notes p 22, last slide

Question 31

Q

what are systematic errors a measure of?

Question 32

Q

examples of continuous outcomes

Answer

A

wieght, blood pressure, etc

Question 33

Q

what does it mean if your score exceeds MDC (CI = 95%)

Answer

A

we can be 95% confident that a true change has occurred
OR upon repeated assessments, 95% of stable patients will change by less than the reported interval

Question 34

Q

How do we use SEM to detect real change (ie change assessed over time)?

Answer

A

to calculate difference btw present score and prevous score
use Minimal Detectable Change (MDC) (aka smallest detectable difference)
SEM x 1.96 X (√ 2) = MDC₉₅
then take difference in score (ie first score was 64, next was 80, so 16 diff) and compare w MDC₉₅

Question 35

Q

what is a patient-important outcome? examples? what part of ICF is this related to?

Answer

A

outcome measures that are of direct practical importance (patients consider them to be important)
eg: survival, pain, PROs (patient-reported outcome measure) (eg QOL, functional ability)
related to ICF activity/participation

Question 36

Q

what is health related QOL

Answer

A

an attempt to measure the broad concept of health (physical mental social)

Question 37

Q

define: cost effectiveness

Answer

A

measurement of resource consumption and outcome of the intervention
requires a common outcome btw interventions being compared
eg effect per unit cost (life year gained per dollar spent), costs per unit of effect (cost per case detected etc)

Question 38

Q

describe pearson’s R in terms of whether or not it is a good measure of validity

Answer

A

pearson’s r is good for validity (association) but it is not the best measure for reliability (precision/agreement)

Question 39

Q

What is construct validity?

convergent vs discriminant

Answer

A

like a mini-theory to explain the relationships among various behaviours or attitudes
more abstract than criterion validity
convergent = where a measure of constuct x correlates w other measures of the same construct (eg using participant observaition and a survey to assess anger) - change in the same way
discriminant = a measure of constuct x does not correlate with measuements of dissimalar/unrelated constructs (eg measurement of age should not change in the same was as a survey measurement for anger) - predicting change in one instrument while the other stays the same

Question 40

Q

define accuracy

Answer

A

a measure of how close a measurement comes to a true score for a variable
ie how accurately a measure measures what you want it to

Question 41

Q

what is something that can greatly enhance instrument interpretability?

Answer

A

knowing MID (minimally important difference)
this is the smallest differnce in the score that informed patients have perceived as important, leading patients or clinicians to consider a change in management

Question 42

Q

inter- vs intra- rater reliability

Answer

A

both = test-retest (need a time 1 and time 2 measurement - either by same person or diff person)
inter = between 2 different people and how well they agree
intra = measuring the same thing at diff times of the day for example

Question 43

Q

describe mean difference

Answer

A

systematic difference between groups (ie not at the individual level!)
t-test will give us a p-value saying whther there is a statistically significant diff btw the 2 group means, but thats it
closer to 0 is better
ie take 100 patients and measure at t1 then colleague measures at t2 and compare results

Question 44

Q

what is validity

Answer

A

the extent to which an instrument measures what it is intended to measure

Question 45

Q

informing applicability - what is a sensativity analysis?

Answer

A

substituting uncertainty in cost based on differences btw places (or a refelction about the uncertainty of the analysis - ie uncertainty around treatment effect)
helps us to increase readership in terms of applicability
uncertainty around many things, could be methods of administration, unsure about proportions of patients who will experience an adverse effect, etc

Question 46

Q

what terms go together: accuracy, precision, validity, and reliability?

Answer

A

accuracy = validity

precision = reliability

Question 47

Q

what is reliability

Answer

A

the extent to which an instrument yields the same results in repeated administrations in a stable population,

Question 48

Q

explain pros and cons of trying to improve precision with increased measurements (n-size)

Answer

A

pro: reduces the amount of random error in the study, narrows the CI
con: if experiment contains systematic errors (procedural or measurement), these are not corrected by increasing n-size, you are simply increasing your ability to reproduce a measurement of the wrong thing!

Question 49

Q

describe the anchor-based approach for responsivness

Answer

A

measure at T1 before anything has changed, then again at T2 after (ie at time 2 use original Q with the a global rating of change scale)
can get idea of MCID (difference in averages at t1 and t2 for people who scored 2 or 3 on GRC
can also give yourself some construct validity using this method
for people who’s scores are the same, can’t use them for responsiveness but can use for reliability
see notes pg 23, slide 1 and 2

Question 50

Q

define content validity

Answer

A

representative of the content domains of the construct (experts)
the same thing as face validity but from experts (more broad experience w disease as opposed to a single patient)

Question 51

Q

define cost minimization analysis

Answer

A

between 3b and 4 on chart
when the effect of treatment is similar across groups (no longer need ot consider effect bc we know its the same, so this is better than strict cost analysis)

Question 52

Q

what are spearman and pearsons r examples of?

Answer

A

ctiterion validity

look at how strongly related or correlated 2 measures are (expected and new measure)

Question 53

Q

what is generic heath related QOL

Answer

A

measures general health status, very vague, can span across diff medical conditions (can compare across diff states of health)
relevent to all health states
eg SF-12

Question 54

Q

what is an evaluative instrument?

Answer

A

used to evaluate change, can track change over time (must have properties that can detect change) - therapy studies use this

Question 55

Q

what are the 4 features of a good outcome measure?

Answer

A

validity
reliability
sensitivity to change
responsiveness

Question 56

Q

what are the 4 ways of measuring validity?

Answer

A

face
content
criterion
construct

Question 57

Q

another way of defininf costs = methods of evaluation - review chart!

Answer

A

top left 2, if there is no comparison group
bottom left = RCT
bottom right = study with more than 1 group and also includes both cost and effectiveness
rarely see just cost analysis

Question 58

Q

what is association

Answer

A

how 2 things change according to each other
good for validity (btw 0 and 1, 1 being perfect association)
can’t have more validity than reliability

Question 59

Q

define: cost utility analysis - common measures

Answer

A

the value you place on health benefits and avoiding poor health outcomes (measuring the value people place on certain health outcomes)
how they would value avoiding another poor outcome - not direct, requires different measures
can measure impacts of different interventions on different diseases
common measure EQ5D (most common) or HUI or QALY - quality of adjusted life years (utility!)

Question 60

Q

difference btw kappa and weighted kappa

Answer

A

kappa is dischotomous (categorical) and weighted kappa is ordered

Question 61

Q

what is responisveness

Answer

A

the ability to measure clinically meaningful change

Question 62

Q

what makes a surrogate endpoint valid?

Answer

A

a causal relationship btw changes in the surrogate and changes in the patient important outcome (strongly predictave)

Question 63

Q

what are the common summary measures for reporting reliability? association or agreement for bottom 3?

Answer

A

mean difference and standard error of measurement (for both the greater deviation from 0 the worse the agreement)
pearson’s R (ass), interclass correlation coefficient (agree), and kappa/weighted kappa (agree) (range from 0 no agreement to 1 perfect agreement)

Question 64

Q

what is sensitivity to change (STC)/how is it measured?

Answer

A

often represented by standard response mean (SRM)
administered before and after change in a population expected to change
calculate mean change (T1avg-T2avg) over SD change (>1 = good)
this is saying whether we can see the signal over the noice (>1) and if so, we have an instrument that is sensitive to change

Answer 63

A

value of resources used up compared to those saved or created (eg willingness to pay)
rare to use this one

Answer 64

A

referred to as events (dead/alive, healed/not healed, etc)
disadvantage = can’t detect change easily
advantages = easily interpretable (even without CI’s)