Reliability Flashcards

Question

What are the major methods of estimating reliability?

Answer 1

- TEST-RETEST - ALTERNATE FORMS (simultaneous administration, delayed administration) - INTERNAL CONSISTENCY (Split-Half, KR Formulas, Coefficient Alpha) - INTERRATER

Answer 2

1) The percentage of accuracy that the test measures the real differences among test takers, not random error. 2) Reliability coefficient always pertains to a GROUP of test scores, not individual scores. 3) The methods most often used to estimate reliability/precision use the reliability coefficent (r).

Answer 3

random error, test-taker performance

Answer 4

Error = 1 - r (reliabiilty coefficient)

Answer 5

1 - 0.85 = .15 Answer: the error is 15%. Meaning 15% is attributed to random error.

Answer 6

Test-retest method.

Answer 7

The same test given twice with time interval between testings.

Answer 8

It is called the coefficient of stability. | It reflects the stability of the test scores over time.

Answer 9

time interval; stability

Answer 10

When measuring variables (constructs) that do not change over time, such as traits, abilities, and characteristics.

Answer 11

When measuring variables (constructs) that are transient and constantly changing, such as someone's mood.

Answer 12

1) The length of the time interval between test administrations. 2) The type of construct (variable) being tested.

Answer 13

Parallell reliability

Answer 14

It helps us determine if two equivalent forms of the same test are really equivalent. In other words, it tests if different tests with similar items measure teh same content, knowledge, or skill.

Answer 15

The two tests should - have the same number of items, - use the same type of format, - same directions for administering, scoring, and interpreting the test.

Answer 16

Simultaneous administration | Delayed administration

Answer 17

The two forms of the test are given simultaneously to the same group of people on the same day.

Answer 18

Giving the two forms of the test on two different occasions.

Answer 19

Giving the two forms of the test on two different occasions.

Answer 20

It provides a coefficient of equivalence because simultaneous administration detects errors related to content sampling.

Answer 21

Delayed administration provides a coefficient of equivalence and a coefficient of stability because it detects errors related to content and time-sampling.

Answer 22

Because very few tests have alternate forms. The process of developing an equivalent test that mirrors the other, but is different is time consuming so most test developers do not pursue this option.

Answer 23

the interrelatedness of items within an instrument/the extent to which the items on the test measure the same ability or trait. .

Answer 24

the test items are homogenous, which increases confidence that items assess a single construct.

Answer 25

They require only a single test and a single test administration to gather the initial psychometric property.

Answer 26

They require only a single test and a single test administration to gather the initial psychometric property.

Answer 27

- Split-Half Reliability - Kuder-Richardson Formulas - Coefficient Alpha

Answer 28

A test is divided into two comparable halves and both halves are given during one testing session. the results on one half of the test are then correlated with the results on the other half of the test.

Answer 29

Coefficient of equivalence. It detects errors related in content sampling. It's number states how well the items on the test consistently measure the same construct.

Answer 30

A test is split into two halves and this formula is used to calculate the coefficient. It provides an estimate of what the coefficient would be if each half had been the length of the whole test.

Answer 31

KR20 and KR21

Answer 32

Used in tests that have dichotamous items (answers that are right or wrong (with 0 indicating an incorrect answer and 1 indicating a correct answer). Reliability can be assessed without splitting the test in half.

Answer 33

Used when items in the test are not dichotamous. It is equal to the KR 20.

Answer 34

Chronbach's alpha

Answer 35

Chronbach's alpha because the results are jsut as good as KR 20 and it is not limited to dichotamous tests.

Answer 36

Chronbach's alpha.

Answer 37

the extent to which two or more raters agree.

Answer 38

Correlating the scores obtained independently by two or more raters.

Answer 39

Reflects - interrater agreement | Does not reflect - content sampling error or time-sampling error.

Answer 40

Test-retest or alternate forms reliability with delayed administration because both are sensitive to time-sampling errors.

Answer 41

Interrater reliability.

Answer 42

1) Split half method, The test would be divided into two equivalent halves, each one consisting of constructs A and B, and correlate the halves. 2) KR 20 and coefficient alpha may be used if the differing constructs are placed in homogenous subgroups. Each subgroup would be tested with KR 20 or coefficient alpha to calculate the reliabiilty/precision of internal consistency.

Answer 43

``` There is no set threshold, but according to Sheperis et al. (2020) the following are the thresholds: (A) >.90 Very high (B) .80-.89 High (C) .70-.79 Acceptable (D) .60-.69 Moderate/Acceptable (F) ```

Answer 44

It is a simple measure of an individual's test score fluctuations (due to test error) if they took the test repeatedly. It is an estimation of the accuracy of an individual's observed score related to the true score. had the individual been tested infinite times.

Answer 45

SEM is the measure of the spread of scores obtained by a SINGLE INDIVIDUAL if the individual was tested multiple times.

Answer 46

Standard deviation is the spread of scores obtaine by a GROUP OF TEST TAKERS on a SINGLE TEST. SEM the measure of the spread of scores obtained by a SINGLE INDIVIDUAL if the individual was tested MULTIPLE times.

Answer 47

The SEM is used to create confidence intervals around specific observed scores, which can guide score interpretations.

Answer 48

write it out and check on page 144 of textbook

Answer 49

The upper and lower limit within which a person's true score will fall.

Answer 50

68%, 95%, and 99.5%

Answer 51

68% is associated with a z score of 1.00 95% is associated with a z score of 2.96 99.5% is associated with a z score of 2.58

Answer 52

See page 145 of the textbook Answers: 68% probability that the individual's true test score fall between 96 - 104 95% probability that the individual's true test score fall between 94-106 99.5% probability that the individual's true test score fall between 94 - 106

Answer 53

1. Increase the number of items on the test (reduces content-sampling error) 2. Writing understandable, unambiguous test items. 3. Use selected response items (multiple choice) rather than constructed-response items (essays) 4. Make sure items are not too difficult or too easy. 5. Have clearly stated administration and scoring procedures. 6. Require training before individuals and administer, grade or interpret a test.

Reliability Flashcards

(77 cards)