Ch.6 for UNIT 4 Flashcards

Question

Evidence of homogeneity in construct validity ()reasons why findings would be contrary to prediction / how to improve the homogeneity of a dichotomous test

Answer 1

homogeneity refers to how uniform a test is in measuring a single concept. =One way a test developer can improve the homogeneity of a test containing items that are scored dichotomously (such as a true–false test) is by eliminating items that do not show significant correlation coefficients with total test scores. If all test items show significant, positive correlations with total test scores and if high scorers on the test tend to pass each item more than low scorers do, then each item is probably measuring the same construct as the total test. Each item is contributing to test homogeneity.

Answer 2

Each response is assigned a numerical score, and items that do not show significant Spearman rank-order correlation coefficients are eliminated. If all test items show significant, positive correlations with total test scores, then each item is most likely measuring the same construct that the test as a whole is measuring (and is thereby contributing to the test’s homogeneity).

Answer 3

If a test score purports to be a measure of a construct that could be expected to change over time, then the test score, too, should show the same progressive changes with age to be considered a valid measure of the construct.

Answer 4

-Evidence that test scores change as a result of some experience between a pretest and a posttest can be evidence of construct validity. -

Answer 5

Also referred to as the method of contrasted groups, one way of providing evidence for the validity of a test is to demonstrate that scores on the test vary in a predictable way as a function of membership in some group.

Answer 6

when scores on the test undergoing construct validation tend to correlate highly in the predicted direction with scores on older, more established, and already validated tests designed to measure the same (or a similar) construct,

Answer 7

A validity coefficient showing little (a statistically insignificant) relationship between test scores and/or other variables with which scores on the test being construct-validated should not theoretically be correlated provides discriminant evidence of construct validity (also known as discriminant validity)

Answer 8

the matrix or table that results from correlating variables (traits) within and between methods. Values for any number of traits (such as aggressiveness or extraversion) as obtained by various methods (such as behavioral observation or a personality test) are inserted into the table, and the resulting matrix of correlations provides insight with respect to both the convergent and the discriminant validity of the methods used.

Answer 9

is the correlation between measures of the same trait but different methods.

Answer 10

the similarity in scores due to the use of the same method.

Answer 11

shorthand term for a class of mathematical procedures designed to identify factors or specific variables that are typically attributes, characteristics, or dimensions on which people may differ. In psychometric research, factor analysis is frequently employed as a data reduction method in which several sets of scores and the correlations between them are analyzed.

Answer 12

typically entails “estimating, or extracting factors; deciding how many factors to retain; and rotating factors to an interpretable orientation” confirmatory factor analysis, researchers test the degree to which a hypothetical model (which includes factors) fits the actual data.

Answer 13

which is “a sort of metaphor. Each test is thought of as a vehicle carrying a certain amount of one or more abilities” Factor loading in a test conveys information about the extent to which the factor determines the test score or scores. A new test purporting to measure bulimia, for example, can be factor-analyzed with other known measures of bulimia, as well as with other kinds of measures (such as measures of intelligence, self-esteem, general anxiety, anorexia, or perfectionism). High factor loadings by the new test on a “bulimia factor” would provide convergent evidence of construct validity.

Answer 14

factor inherent in a test that systematically prevents accurate, impartial measurement.

Answer 15

occurs when the use of a predictor results in consistent underprediction or overprediction of a specific group’s performance or outcomes. Slope bias occurs when a predictor has a weaker correlation with an outcome for specific groups. For example, on high-stakes educational tests, some individuals with math disabilities are allowed to use calculators as a part of their testing accommodations.

Answer 16

rating is a numerical or verbal judgment (or both) that places a person or an attribute along a continuum identified by a scale of numerical or word descriptors known as a rating scale. Simply stated, a rating error is a judgment resulting from the intentional or unintentional misuse of a rating scale.

Answer 17

is, as its name implies, an error in rating that arises from the tendency on the part of the rater to be lenient in scoring, marking, and/or grading

Answer 18

severity error. Movie critics who pan just about everything they review may be guilty of severity errors. Of course, that is only true if they review a wide range of movies that might consensually be viewed as good and bad. a central tendency error. Here the rater, for whatever reason, exhibits a general and systematic reluctance to giving ratings at either the positive or the negative extreme. Consequently, all of this rater’s ratings would tend to cluster in the middle of the rating continuum.

Answer 19

central tendency, leniency, severity errors rankings, a procedure that requires the rater to measure individuals against one another instead of against an absolute scale. By using rankings instead of ratings, the rater (now the “ranker”) is forced to select first, second, third choices, and so forth.

Answer 20

describes the fact that, for some raters, some ratees can do no wrong. More specifically, a halo effect may also be defined as a tendency to give a particular ratee a higher rating than the ratee objectively deserves because of the rater’s failure to discriminate among conceptually distinct and potentially independent aspects of a ratee’s behavior. . Men have been shown to receive more favorable evaluations than women in traditionally masculine occupations. Except in highly integrated situations, ratees tend to receive higher ratings from raters of the same race (Landy & Farr, 1980). It is also possible that a particular rater may have had particularly great—or particularly distressing—prior experiences that lead them to provide extraordinarily high (or low) ratings on that irrational basis.

Answer 21

extent to which a test is used in an impartial, just, and equitable way.4

Answer 22

-Some tests, for example, have been labeled “unfair” because they discriminate among groups of people = We would all like to believe that people are equal in every way and that all people are capable of rising to the same heights given equal opportunity. A more realistic view would appear to be that each person is capable of fulfilling a personal potential -Another misunderstanding of what constitutes an unfair or biased test is that it is unfair to administer to a particular population a standardized test that did not include members of that population in the standardization sample. In fact, the test may well be biased, but that must be determined by statistical or other means. The sheer fact that no members of a particular group were included in the standardization sample does not in itself invalidate the test for use with that group.

Answer 23

-Arguments in favor of group-related test-score adjustment have been made on philosophical as well as technical grounds. From a philosophical perspective, increased minority representation is socially valued to the point that minority preference in test scoring is warranted. - In the same vein, minority preference is viewed both as a remedy for past societal wrongs and as a contemporary guarantee of proportional workplace representation. - it is argued that some tests require adjustment in scores because (1) the tests are biased, and a given score on them does not necessarily carry the same meaning for all testtakers; and/or (2) “a particular way of using a test is at odds with an espoused position as to what constitutes fair use”

Answer 24

- those who view such adjustments as part of a social agenda for preferential treatment of certain groups. These opponents of test-score adjustment reject the subordination of individual effort and ability to group membership as criteria in the assignment of test scores -“minority applicants who are selected under a quota system but who also would have been selected under unqualified individualism and must therefore pay the price, in lowered prestige and self-esteem” -