Chapter 4 Flashcards
What’s sufficient for a ‘good test’?
Reliability AND Validity
Reliability
Consistency. Does not have to be valid. (can be consistently wrong!)
Validity
How well a test measures the definition of its concept
norm-referenced testing and assessment
Derive meaning from test scores by comparing an individual’s test score with a group of testtakers.
Normative sample
group of people’s performance analyzed for reference to evaluate individual testtakers
Norming
process of deriving norms
Stratified Sampling
Samples that include different subgroups of a population
Purposive Sampling
Selecting a sample that you think is representative (eg. Cleveland to infer nation)
Incidental sampling
Also convenience sampling. Using sample that’s convenient
Standardization sample vs. Normative sample
After standardization, there might be changes in normative sample to make in representative (eg. more Hispanic immigration, greater proportion needed). They could be different.
Percentile
Expresses the percentage of people whose score falls below a raw score.
Why might percentiles be deceiving for something normally distributed?
In the middle of distribution, raw scores might differ slightly but have large percentile differences.
Opposite for extremes of scores.
Age-equivalent scores or Age norms
Average performance of samples who were at various ages.
(eg. height, mental age (highly debated))
Grade norms
Average test performance in a given school grade.
What are age norms and grade norms referred to more generally as?
Developmental norms
National norms
Nationally representative of a population.
National anchor norms
Equipercentile Method
Equivalency table for scores on two different tests, providing stability to test scores.
Equipercentile is using percentile scores to calculate equivalency.
Need same sample to take both the tests.
Local norms
Modifying a test to fit the purpose of a local population (?)
Fixed reference group scoring system
eg. SAT.
There are scaled scores to calibrate. Purpose is so different years can compare scores (2021 score of 10 means the same in 1995 score of 10 even if raw scores differ)
Criterion-referenced testing and assessment
A set standard. Different to norm-referenced. Not relative to others.
eg. get higher than a certain raw score to get liscense.