Lecture 2 Flashcards
What are reliability and validity in tests?
Reliability: The test measures one and only one thing (precision)
Validity: The test measures what it’s supposed to measure
What are the test standards?
Recommendations for using and interpreting test scores, developed and distributed by:
- American Psychological Association (APA)
- American Educational Research Association (AERA)
- National Council on Measurement in Education (NCME)
What is the Test Standards definition of “validity”?
The degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests
Why are the Test Standards important?
They act as a framework
♣ Represent current consensus (therefore current operational guidelines)
♣ Alternative viewpoints, arguments, propositions
♣ Psychometric models for evaluating validity, reliability, and bias (including generalisability theory)
What is the criterion view of validity?
Validity of a test: How well the test predicts the outcome it was designed to predict
o Validity as an absolute and static property of a test
o Validity = correlation with criterion (e.g. intelligence with school grades)
What are the problems with the criterion view of validity?
o Not always one obvious criterion available (no pure measures of the attribute)
o Some tests used for different purposes, in different groups (e.g. due to language, age)
o Validity is dependent on:
♣ Test purpose and use
♣ Characteristics of the test-takers
What are the three key components of the tripartate view of validity?
- Criterion validity (correlations with a criterion)
- Content validity
- Construct validity
What is criterion validity?
♣ Concurrent: Criterion measured at same time as test administered
♣ Predictive: Criterion measured at some time after test administered
What is content validity?
- The theoretical framework about what the test should measure
- Content of test is both relevant to domain and representative of domain
What is construct validity?
♣ Convergent: concepts that are theoretically related demonstrate empirical relationships
♣ Discriminant: concepts that are theoretically UNrelated; show no empirical relationships
What are the problems with the tripartate view?
o Too much emphasis on different forms of validity
♣ Test can have “convergent validity” but not “predictive validity” – what does this mean?
♣ Distinction between convergent and concurrent not always clear
• E.g. Correlation between a vocabulary test and English-language grade?
o Overemphasis on correlations as proof
o No explicit mention of test use and consequences
What is ‘validity’ in the 1999/2014 Test Standards?
Validity is a property of the interpretation of test scores, not the test scores themselves
Where is the Test Standards evidence derived from that the interpretation of test scores is valid? (The sources of evidence for test validity)
- The content of the test
- The response processes captured by the test
- The internal structure of the test
- The relationship of the test to other variables
- The intended and unintended consequences of testing
What is the test content evidence for validity?
♣ Relevance
♣ Representativeness (all items must be representative relate to important & critical parts)
What are the response processes as evidence for validity?
♣ Evidence should show that the test does measure the specific process it’s meant to capture
♣ Think aloud protocols, eye-tracking, computer models, susceptibility to manipulations, coaching, etc. (in line with theory)