Psychometric Properties Flashcards
sensitivity
-the proportion of people who have a disease who have a positive result; how the test accurately assesses correctly those who do have a diagnosis
-decrease the threshold/cutoff to increase this
-e.g., assessments for suicidal ideation need to be very ____
specificity
-the proportion of people who have a disease who have a positive test result; how the test accurately assesses correctly those who do NOT have a diagnosis
-increase the threshold/cutoff to increase this
-e.g., ASD dx needs to be more ____
true positive
have the disease and have a positive test; hit rate
false positive
do NOT have the disease, but have a positive test; false alarm
false negative
have the disease, but have a negative test (the test does not pick it up); fall thru the cracks
true negative
do not have the disease and have a negative test; correct rejection rate
criterion referenced
test developer/publisher sets the cutoffs/thresholds and interpretation guidelines; an outside criterion influences the score thresholds
norm-referenced
comparison scores in which individual scores are compared to a population norm - the group that originally took the test to determine the scores that an individual receives
raw scores
no inherent meaning, summer of item responses (total points) when scored; needs interpretation guidelines from the test developer
standard scores
raw scores that have been converted to an interpretable scale that are based on normal distribution and the norm group
-e.g., z-scores, T scores, etc.
percentages
raw score that reflects the number of correct responses obtained out of the total possible number of correct responses on a test (no inherent meaning)
percentiles
scores that reflect the rank or position of an individual’s test performance in comparison to others who took the test
reliability
consistency or stability of the scores/responses across time
inter-rater reliability
-consistency of scores across examiners (across coders)
-must operationalize the constructs measured
-if a test does not have this, the effects observed may be due to the individual who coded
-ideal to have higher ___ ____ (r = .90)
internal reliability
-consistency of the structure (across items); homogeneity of the group of items in a response set or within subscales
-how well do the items measured in the test strongly associate with the construct measured
-e.g., in the BSI –> depression subscale has highest level of ___ ____