Test 2 Flashcards

Question

what is standard error of measurement (SEM)

Answer 1

an estimate of how much the observed test score might different from the true test score a statistic that obtains the confidence interval for many obtained scores. It represents the hypothetical distribution we would have if someone took a test an infinite # of times

Answer 2

SD(sq root of 1 minus reliability)

Answer 3

Give an estimate of how much error is likely to exist in an individual’s observed score, that is, how big the difference between the individual’s observed score and his or her true score is likely to be

Answer 4

coefficient of internal consistency- commonly used. Looks at interval scale. Determines which questions on the scale are interrelated. Used for test questions such as rating scales that have more than one correct answer

Answer 5

used for dichotomous items (ex. 0 or 1, true or false). Dichotomous scale. Ordinal in nature. Used when There is either a right or wrong answer. There is only one correct answer

Answer 6

used in split- half analysis is used to adjust the reliability coefficient. It is designed to estimate what the reliability would be if the tests had not been cut in half

Answer 7

inter-rating reliability

Answer 8

.70 and above

Answer 9

.70 and above

Answer 10

the greater the heterogeneity (differences in the kind of questions or difficulty of the question) of the items, the greater the change for low reliability correlation coefficients. Ex. test contains multiple choice, true and false, fill in the blank, etc

Answer 11

the greater the homogeneity (similarity in the kind of questions or difficulty of the question) of the items, the greater the chance for high reliability correlation coefficients. The similarity of the questions ex. test contains only multiple choice

Answer 12

refers to measuring what we intended to measure, can we do it accurately

Answer 13

the amount or strength of evidence of validity based on the relationship of the test and criterion

Answer 14

gradual accumulation of evidence that the scores on the test relate to observable behaviours in the way predicted by the underlying theory involves comparing a new measure to an existing, valid measure Usually existing valid measures don’t exist. That is often why the new scale is being created in the first place

Answer 15

Involves logically examining and evaluating the content of a test (including the rest questions, format, wording, and tasks required for test takers) to determine the extent to which the content is representative of the concepts that the test is designed to measure without

Answer 16

Involves correlating test scores with other measures to determine whether those scores are related to other measures to which we would expect them to relate. We would also like to know if the test measures are not related to other measures to which we would not expect them to relate to

Answer 17

Focuses on whether the conceptual framework used in test development could be demonstrated using appropriate analytical techniques

Answer 18

Involves observing test takers as they respond to the test or interviewing them when they complete the test

Answer 19

Differentiating between intended and unintended consequences of testing

Answer 20

is when we evaluate the test and we look at things such as test questions, the format, the scoring and the wording

Answer 21

traits or characteristics that tests are designed to measure (usually not observable)

Answer 22

attirbute or characteristic, make it easier to define and also created items for. these are easily observable when compared to abstract characteristics or traits. ex. playing a piano

Answer 23

characteristics or attributes that are harder to observe for instance intelligence

Answer 24

process of providing a detailed description of the relationship between specific behaviours and abstract constructs. the process of trying to figure out what items are inside or outside the test construct/content

Answer 25

1. identify behaviours related to the construct 2. identify other constructs and decide whether they are related or unrelated to the construct being measured 3. identify behaviours that are related to the additional constructs and determine if these are related or unrelated to the construct being measured

Answer 26

a method for defining a construct by illustrating its relation to as many other constructs and behaviours as possible

Answer 27

provides a measure of agreement among the judges/experts

Answer 28

Face validity answers the question “does it appear to the test taker that the question on the tests are related to the purpose for which the test is given” Face validity is only concerned with how test takers perceive the appropriateness of the test

Answer 29

- If the respondent knows what information we are looking for, they can use “context” to help interpret the questions and provide more useful, accurate answers - The respondent can make an educated decision

Answer 30

- If the respondent knows what information we are looking for, they might try to bend & shape their answers to what they think we want - Ie. Faking good or faking bad

Answer 31

the extent to which the scale correlates with measures of the same or related concepts

Answer 32

the extent to which the measure does not correlated with measures of unrelated or distinct concepts

Answer 33

The researcher chooses two or more constructs that are unrelated in theory and two ore more types of test to measure each of the constructs used to assess a test’s construct

Answer 34

multiple traits and multiple ways of assessing those traits

Answer 35

more than one trait acorss the same way of assessment

Answer 36

same trait measured by two different methods

Answer 37

same trait using the same method

Answer 38

``` Highest- Monotrait monomethod monotrait heteromethod heterotrait monomethod heterotrait heteromethod -Lowest ```

Answer 39

a combination of variables that are intercorrelated and thus measure the same characteristics

Answer 40

statistical techniques used to analyze patterns. of correlations among different variables and measures - Factor analysis looks at the relationship between all the factors and creates groups of factors based on the relationships between the factors

Answer 41

to reduce the numbers of dimensions needed to describe data derived from a large number of data

Answer 42

a series of mathematical calculations, designed to extract patterns of intercorrelations among a set of variables (ex. division questions are correlated with division question and multiplication questions with multiplication)

Answer 43

There is a subjective element to factor analysis because once the statistical results have been computed the researcher must review the grouping to see if they make sense based on the construct the test items were designed to measure

Answer 44

Researchers do not propose a formal hypothesis about the factors that underlie a set of test scores, but instead use the procedure broadly to help identify underlying components

Answer 45

The researcher specifies in advance what they believe the factor structure of their data should look like and then statistically tests how well that model actually fits the data The researcher relies on existing theoretical or empirical knowledge to design the model that is being tested Evidence for construct validity would be provided if the results from the factor analysis fit the model created by the researcher. If not the model should be revised and retested

Answer 46

retains factors with eigenvalues greater than 1.0 | to be considered a factor is much have a eigenvalue greater than 1.0

Answer 47

is the calculation that go into a factor

Answer 48

plot factors on the horizontal axis and eigenvalues on the vertical axis. look for an elbow

Answer 49

- Simplifies interpretation | - Can learn more about the composition of variables

Answer 50

- Do the combining of factors capture the essential aspects of what is being measured? - Are the factors generalizable to other populations (ex. different cultures, gender, individuals with disabilities)

Answer 51

measures the relationship between the predictor and the criterion, and the accuracy with which the predictor is able to predict performance on the criterion

Answer 52

criterion date are collected before or at the same time that the predictor is administered

Answer 53

criterion data are collected after the predictor is administered

Answer 54

based upon a individuals judgement ex. peer ratings

Answer 55

based upon specific measurement (how fast someone is, how many absence from class)

Test 2 Flashcards

(79 cards)