Lecture 2 Flashcards

Question 1

Q

What are reliability and validity in tests?

Answer

A

Reliability: The test measures one and only one thing (precision)
Validity: The test measures what it’s supposed to measure

Question 2

Q

What are the test standards?

Answer

A

Recommendations for using and interpreting test scores, developed and distributed by:

American Psychological Association (APA)
American Educational Research Association (AERA)
National Council on Measurement in Education (NCME)

Question 3

Q

What is the Test Standards definition of “validity”?

Answer

A

The degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests

Question 4

Q

Why are the Test Standards important?

Answer

A

They act as a framework
♣ Represent current consensus (therefore current operational guidelines)
♣ Alternative viewpoints, arguments, propositions
♣ Psychometric models for evaluating validity, reliability, and bias (including generalisability theory)

Question 5

Q

What is the criterion view of validity?

Answer

A

Validity of a test: How well the test predicts the outcome it was designed to predict
o Validity as an absolute and static property of a test
o Validity = correlation with criterion (e.g. intelligence with school grades)

Question 6

Q

What are the problems with the criterion view of validity?

Answer

A

o Not always one obvious criterion available (no pure measures of the attribute)
o Some tests used for different purposes, in different groups (e.g. due to language, age)
o Validity is dependent on:
♣ Test purpose and use
♣ Characteristics of the test-takers

Question 7

Q

What are the three key components of the tripartate view of validity?

Answer

A

Criterion validity (correlations with a criterion)
Content validity
Construct validity

Question 8

Q

What is criterion validity?

Answer

A

♣ Concurrent: Criterion measured at same time as test administered
♣ Predictive: Criterion measured at some time after test administered

Question 9

Q

What is content validity?

Answer

A

The theoretical framework about what the test should measure
Content of test is both relevant to domain and representative of domain

Question 10

Q

What is construct validity?

Answer

A

♣ Convergent: concepts that are theoretically related demonstrate empirical relationships
♣ Discriminant: concepts that are theoretically UNrelated; show no empirical relationships

Question 11

Q

What are the problems with the tripartate view?

Answer

A

o Too much emphasis on different forms of validity
♣ Test can have “convergent validity” but not “predictive validity” – what does this mean?
♣ Distinction between convergent and concurrent not always clear
• E.g. Correlation between a vocabulary test and English-language grade?
o Overemphasis on correlations as proof
o No explicit mention of test use and consequences

Question 12

Q

What is ‘validity’ in the 1999/2014 Test Standards?

Answer

A

Validity is a property of the interpretation of test scores, not the test scores themselves

Question 13

Q

Where is the Test Standards evidence derived from that the interpretation of test scores is valid? (The sources of evidence for test validity)

Answer

A

The content of the test
The response processes captured by the test
The internal structure of the test
The relationship of the test to other variables
The intended and unintended consequences of testing

Question 14

Q

What is the test content evidence for validity?

Answer

A

♣ Relevance

♣ Representativeness (all items must be representative relate to important & critical parts)

Question 15

Q

What are the response processes as evidence for validity?

Answer

A

♣ Evidence should show that the test does measure the specific process it’s meant to capture
♣ Think aloud protocols, eye-tracking, computer models, susceptibility to manipulations, coaching, etc. (in line with theory)

Question 16

Q

What is the internal structure evidence as for validity?

Answer

Study These Flashcards

A

♣ Number of sub-components discovered empirically equals number of sub-components theoretically expected
• Number of elements found = number of elements expected
• E.g. 6 facets of Conscientiousness from NEO-PI-R personality model

Question 17

Q

What is the relationship to other variables as evidence of validity?

Answer

Study These Flashcards

A

♣ Convergent and discriminant evidence (as in Tripartate Model)
♣ Test-criterion relationships (as in Tripartate Model)
• Suitability and technical quality of the criteria (relation to test scores)
♣ Validity generalisation
• Replication in different situations
o Different POPULATIONS (e.g. different countries, states, sectors)
o Different CONDITIONS (e.g. proctored/unproctored, timed/untimed)
• Replication for different purposes
o Job performance vs. academic achievement
o Performance on different types of jobs

Question 18

Q

What are the intended and unintended consequences of testing as evidence for validity?

Answer

Study These Flashcards

A

♣ Consider the consequences of testing (which can be unforeseen by the test developer and test user)
♣ EXAMPLE: Test scores (e.g. NAPLAN) intended by developer to be used to identify progress used to target policy towards areas of need
• ACTUAL CONSEQUENCES: League tables, high-SES flight from low-NAPLAN schools
♣ EXAMPLE: Test scores measuring Attribute A result in different hiring rates for members of different groups. For the test to show evidence of validity:
• The difference must be solely due to an unequal distribution of Attribute A
o Differences not due to construct-irrelevant variance/test bias
o E.g. Spatial skill test for pilot selection (men have better spatial skills than women)
♣ Distribution of relevant attribution
• Attribute A must be important for the job

Lecture 2 Flashcards

(18 cards)