Reliability and Validity Flashcards
what is psychometrics?
it is a branch of psychology with the design, administration, and interpretation of quantitative tests for the measurement of psychological variables
- involves the measurement of directly unobservable mental constructs
- what does something have to be to be considered a measured construct
what is a mental construct?
theoretical, intangible quality or trait
- A psychological test aims to measure mental constructs by inferring an individual’s true score on the construct from their observed test score
- try to get as close a match of the observed score and the true score
true score: a latent construct most correct to the specific individual,
what are the first two (2/5) assumptions of psychometrics?
- it is possible to measure psychological states and traits
- you must be able to measure, and assign values to the state
- you must define the construct being studied (operationalisation) - various methods of measuring aspects of the same thing can be useful, including
- format - multiple choice, interview, etc
- administration - individual vs group, timed, self-reported or administered
- scoring - is it categorical or dimensional
what are the last three (3-5) assumptions of psychometrics?
- assessment can provide answers to some of life’s most momentous questions
- for example hsc means you are competent for a job - error is part of the assessment process
- true score + measurement error = observed score - testing and assessment can be conducted in a fair and unbiased manner
- cultural bias, use criteria that applies to the culture
what is reliability?
consistency of measurement (does the test give consistent results when it’s given at different times, by different raters)
what is validity?
accuracy of measurement (does it measure what it’s supposed to measure)
what are the two types of measurement error?
random error: unpredictable influences that vary from measurement to measurement
- in two directions - May increase or decrease a score
systematic error: biases that influence scores in a similar way across multiple measurements
- in one direction - only increasing or decreasing a score
what are the types of reliability?
temporal stability (test-retest reliability), inter-rater reliability, internal consistency reliability
what is test-retest reliability?
does taking the assessment again yield the same results?
- potential limitations may include the effect of practice or fatigue and the fact that it can be expensive to undertake.
what is internal consistency reliability?
are there positive correlations among items that are intended to measure the same construct?
- the extent to which test items that propose to measure the same construct actually show similar scores
it can be increased by increasing the number of items, or to remove the items that are uncorrelated with majority of the others
what is inter-rater reliability?
can raters agree on the scores of an instrument? can they yield the same results?
- helps watch for rater bias etc
what are the types of validity?
content validity, construct validity, criterion validity
what is content validity?
the degree to which the items adequately sample the mental construct in question
- for example, when measuring intelligence if you used a scale that only measured vocabulary in comparison to a scale that measured vocabulary, arithmetic, etc then the latter would have better content validity
what is construct validity?
it is seperated into two subtypes:
convergent validity - the extent to which a scale correlates with other tests that appear to measure the same construct
discriminant validity - the extent to which a scale does not correlate with other tests that do not appear to measure the same construct
what is criterion validity?
the degree to which the test is able to predict present or future performance on a real-world outcome
it is seperated into two subtypes:
concurrent validity: present
predictive validity: future