Chapter 11 - Assessing Psychometric Quality Flashcards

Question 1

Q

Item Analysis

Answer

A

How developers evaluate the performance of each test item.

Question 2

Q

Quantitative Item Analysis

Answer

A

Statistical analyses of the responses test takers gave to individual items.

Question 3

Q

Item Difficulty

Answer

A

The percentage of test takers who respond correctly.
We calculate each item’s difficulty or p-value by dividing the number of persons who answered correctly by the total number of persons who responded to the questions.
We get this information for the pilot test.
They disregard questions that are too hard or too easy.

Question 4

Q

Discrimination Index

Answer

A

Compares the performance of those who obtained very high test scores with the performance of those who obtained very low test scores.

Question 5

Q

Item-Total Correlation

Answer

A

Another way to assess the ability of individual test items to discriminate high-scoring individuals from lower-scoring ones.
This is a measure of the strength and direction of the relationship between the way test takers responded to one item and the way they responded to all of the items as a whole.

Question 6

Q

Interitem Correlation Matrix

Answer

A

Displays the correlation of each item with every other item.
Usually, each item has been coded as a dichotomous variable - correct 1 or incorrect 0.

Question 7

Q

Phi Coefficients

Answer

A

Result of correct two dichotomous variables.

Question 8

Q

Empirically Based Tests

Answer

A

Tests designed so that test scores can be used to sort individuals into two or more categories based on their scores on the criterion measure.

Question 9

Q

Subtle Questions

Answer

A

Questions that have no apparent relation to the criterion.

Question 10

Q

Item Response Theory (IRT)

Answer

A

This theory relates the performance of each item to a statistical estimate of the test taker’s ability on the construct being measured.
A measure of the relationship between an individual’s performance on one test item and the test takers’ levels of performance on the overall measure of the construct the test is measuring.

Question 11

Q

Item Characteristic Curves (ICCs)

Answer

A

The line that results when we graph the probability of answering an item correctly with the level of ability on the construct being measured.
The ICC provides a picture of the item’s difficulty and how well it discriminates high performers from lower performers.

Question 12

Q

Computerized Adaptive Testing (CAT)

Answer

A

All test takers start with the same small set of questions.
As the test progresses, the computer software chooses and presents each test taker with harder or easier questions depending on how well the test taker answered previous questions.

Question 13

Q

Item Bias

Answer

A

When an item is easier for one group than for another group.

Question 14

Q

Acculturation

Answer

A

The degree to which an immigrant or a minority member has adapted to a country’s mainstream culture.

Question 15

Q

Qualitative Item Analysis

Answer

A

Non-statistical means of evaluating qualitative data which normally refers to the analysis of text.
Used when qualitative analysis procedures are used when test developers ask test takers for verbal or written feedback about test questions.

Question 16

Q

Construct Bias

Answer

A

Arises when items do not have the same meaning from culture to culture or subculture.

Question 17

Q

Method Bias

Answer

A

Arises when the mechanics of the test work differently for various cultural groups.

Question 18

Q

Differential Item Functioning

Answer

A

Arises when test takers from different cultures have the same ability level on the test construct, but the item or test yields very different scores for the two cultures.

Question 19

Q

Why is revision important in test development?

Answer

A

They use different kinds of analysis in order to pick the best questions to fulfill their goal.

Question 20

Q

How are the final items chosen?

Answer

A

Choosing the items that make up the final test requires the test developer to weigh each item’s evidence of validity, item difficulty and discrimination, interitem correlation, and bias.

Question 21

Q

What is the first part of the validation process?

Answer

A

Establishing evidence of validity based on test content.
Carried out as the test is developed.

Question 22

Q

Generalizable

Answer

A

Meaning the test can be expected to produce similar results even though it has been administered in different locations.

Question 23

Q

Replication

Answer

A

The process of replication involves a final round of test administration to another sample of test takers representative of the target audience.

Question 24

Q

Cross-Validation

Answer

A

This process breaks the original sample used in the original validation study into two parts.
This can be done without having to administer the test to a second group of test takers.

Question 25

Q

Measurement Bias

Answer

A

When the scores on a test taken by different subgroups in the population need to be interpreted differently because of some characteristic of the test not related to the construct being measured.

Question 26

Q

Predictive Bias

Answer

A

Occurs when the predictions made about a criterion score based on a test score are different for subsets of test-takers.

Question 27

Q

Differential Validity

Answer

A

When a test yields significantly different validity coefficients for subgroups.

Question 28

Q

Single-Group Validity

Answer

A

Test is valid for one group but not for another group.

Question 29

Q

Slope Bias

Answer

A

It occurs when the slopes of the separate regression lines that relate to the predictor to the criterion are not the same for one group as another.

Question 30

Q

Why is test fairness important?

Answer

A

Psych tests are used to compare individuals, their purpose is to identify or illuminate the differences among individuals.
The results of differences should be based on the trait or characteristic being measured.

Question 31

Q

Accessibility

Answer

A

Pertains to the opportunity test takers have to demonstrate their standing on the constructs the test is designed to measure.

Question 32

Q

Universal Design

Answer

A

The idea behind universal design is that tests should be constructed from the outset in such a way that accessibility is maximized for all individuals who may take the test in the future

Question 33

Q

Cut Scores

Answer

A

Decision points for dividing test scores into pass-fail groupings.

Question 34

Q

What is the purpose of test norms?

Answer

A

To provide a reference point or structure for understanding one person’s score.

Question 35

Q

Subgroup Norms

Answer

A

Statistics that describe subgroups of the target audience.

Question 36

Q