lecture 5: standardized assessment and psychometrics (1) Flashcards

Question

what are the ways to establish reliability?

Answer 1

1. test-retest 2. inter-rater 3. internal consistency

Answer 2

- stability of the measure overtime - determined by calculating the agreement of scores at two different times for a characteristic that have NOT changed > don't use in client where statues to often variable - ICC > .70 = acceptable test-retest reliability (under = poor) - time interval can vary depending on what is being measured

Answer 3

- degree to which scores by different raters yield the same results - applies to assessments where test administrator assesses result - determined by having several raters measure the same phenomena - acceptable inter-rater = > .70 - descriptive, evaluative and predictive assessments should have high inter-rater reliability

Answer 4

- degree of the relatedness among the items of an instrument - used to determine if items on test are consistent with one another - an estimate of the homogeneity of the structure of the test - high internal consistency = items closely related - measured with Cronbach's alpha - acceptable = 0.8 - 0.9 - if too high (0.97) = item redundancy

Answer 5

descriptive: - internal consistency - observer predictive: - test-retest - observer evaluative: - test-retest - observer

Answer 6

- accuracy - extent to which assessment measures what it is intended to measure (ex. fatigue, balance, OP)

Answer 7

1. face 2. content 3. criterion - concurrent and predictive 4. construct - convergent, divergent, discriminative 5. responsiveness

Answer 8

- assumption of validity based on a measure's appearance - subjective judgement - does test appear to measure what is says - least reliable validity - ONLY used as preliminary screening - if minimum requirement of face validity can not be established, its unlikely that it'll hold up against other validity measures

Answer 9

- degree to which the instrument items are a comprehensive reflection of what the instrument reports to be measuring - does measure include ALL elements of a given concept - established based on theoretical frameworks, expert opinions, or literature review

Answer 10

- extent to which scores of assessments relate to gold standard/valid external criterion - assessed by correlating the scores of a sample of individuals on the predictor with the scores on the criterion - test = predictor, gold standard = criterion

Answer 11

1. concurrent: criterion data collected at the same time as data on predictor test > ex. TB test (skin test = predictor, chest x-ray = criterion) 2. predictive: criterion data collected after the predictors test was administered > scores on the MCAT (predictor) predict performance in medical school (criterion)

Answer 12

- degree to which scores of an instrument are consistent with a hypothesis about how they should perform - based on testing a measure against an idea based on theory - involves... 1. developing theoretical hypotheses relevant to the construct being assessed 2. investigating whether these hypotheses are upheld when the assessment is used - most difficult of all validities to establish

Answer 13

1. convergent - degree to which scores are consistent with a hypothesis that the instrument WILL CORRELATE with another measurement 2. divergent - degree to which the scores of an instrument are consistent with a hypothesis that the instrument WILL NOT CORRELATE with another measurement 3. discriminative - degree to which the scores of an instrument are consistent with a hypothesis concerning DIFFERENCES BETWEEN GROUPS

Answer 14

- ability of instrument to detect change over time in what it reports to be measuring - aka. sensitivity to change - evaluative assessments must have evidence of responsiveness - done by taking a group that does change and seeing if the measure picks up change - results expressed as effect size/standardized response mean

Answer 15

- minimal detectable change (MDC): what amount of change, taking error into account, means that a change has actually occurred > change by 1 point = probably error > change by 3 points = change has occurred - minimal clinically important difference (MCID): what a patient would notice to be a meaningful change > grip strength increased by 3, MCID = 2 (can assume there was change) > grip strength increased by 3, MCID = 5 (yes, grip strength improved BUT it hasn't effected QOL)

Answer 16

descriptive: - content - construct predictive: - content - criterion evaluative: - content - construct

Answer 17

- degree to which the performance of items on a translated or culturally adapted measure are a reflection of the performance of items of the original version

Answer 18

- degree to which a measure reflects real life - ex. ability to memorize random words vs an address/name

Answer 19

1. obtain a copy of the measure - not always easy - detective work 2. refer to books that evaluate the measure 3. read literature (especially the first publication of a measure) 4. check wide range of literature 5. follow a template for evaluation and do your own evaluation

Answer 20

- search engine - textbooks - library database - measurement cupboard

lecture 5: standardized assessment and psychometrics (1) Flashcards

(44 cards)