L11 - Psych Assessment: Reliability Flashcards
What’s the difference between psychological assessment and psychological testing?
Psychological Assessment is the process of gathering all relevant psychological data about a person and interpreting those data in the context of the person’s broader well-being, and the benefit of society. BROADER
Psychological Testing is the process of administering one or more psychological tests. However, since both Testing and Assessment should be conducted in an ethical framework, there should be little difference in practice, that is, testing should only be conducted in a broader ethical and professional standards framework.
Assumptions underlying the use of psychological tests?
- Psychological traits and states exist.
- Psychological traits and states can be quantified and measured.
- Test-related behaviour predicts non-test related behaviour.
- Tests and related techniques have strengths and weaknesses.
- Various sources of error or unreliability are part of the assessment.
- Assessment can be conducted in a fair and unbiased manner.
- Psychological assessment benefits society.
What should all psychological tests posses?
- Established Validity.
- Standardised administration and scoring
- Clear rules for administration and scoring. - Adequate normative sample
- Known reliabilities and standard errors of measurement.
- Test publications and manuals.
- Ongoing test development & revision.
What is reliability?
The consistency of test scores – the study of relationships between items on the test on one or more testing occasions is usually reported as “reliability”. Study of reliability provides the key to interval estimation for one person’s score on a test. It is the extent to which a test correlated with itself in another form (version) or on another occasion. It is a necessary but not sufficient condition for validity, and sometimes termed as “internal validity”.
What is Validity?
Degree to which inferences made on the basis of test scores are scientifically justified.
What is variance?
Variance is the measure of the spread of individual differences on a test.
For two test with equivalent reliability and both measuring the same construct, the test with more variance (a larger S.D.) is more informative than a test with less variance.
What is error variance?
Measurement error
the part of variance of scores that we don’t understand - isn’t conceptually theorised.
What is the general model of reliability in classical test theory?
X = T (true score) + E (measurement error)
any observed score (X) has 2 components.
Assumptions involved in general model of reliability?
Mean error of measurement = 0
True scores and errors are not correlated: rte=0
Errors on different measures are not correlated: re1e2= 0
- even in situations of unreliable measurement, average true score effects may emerge over multiple data collections / replications
What can the variance of an observed score be deconstructed into?
σ2(X) = σ2(tryue) + σ2(error)
What is the theoretical reliability coefficient?
r(tt) = true σ^2 / observed σ^2
this is the proportion of variance in observed scores that is due to variance in true scores.
a highly reliable test will have little error in its observed score variance.
What is a theoretical entity?
True σ^2
hard to measure in real life..
What do we do since true variance is actually a theoretical entity and is hard to measure?
We use a correlation coefficient instead, by using re-test correlations.
If r_tt = .5 then 50% of the variance in observed scores is due to the variance in true scores.
What are some methods for estimating reliability?
➢ Test-retest reliability!!!!!
➢ Internal consistency (KR20 or Alpha)
➢ Split-half (correlation coefficient)
➢ Parallel-form or alternate-form (correlation coefficient)
➢ Inter-rater or inter-scorer reliability (correlation coefficient or Kappa).
What is the standard error?
Term used to describe the standard deviation of a distribution for an inferred statistic, e.g. sample mean (standard error of the mean), or individual test score (standard error of measurement).