Week 4: Survey Methods Flashcards

Question

What is split half internal reliability?

Answer 1

internal reliability: are all questions measuring the same thing? Split half: can take first half and second half and calculate scores on first half and second half should correlate btn eachother. Note: Items are split into two halves, based on: • Odd vs. even numbers • Randomly selecting items for each half • First half vs. second half of the test Correlate the total scores for each half Pearsons R Correlation of 0.80 or higher for good/adequate reliability

Answer 2

• Create a large pool of items. • Randomly divide the items into two separate tests. • Administer the two tests to the same participants. • Calculate the correlation between the two forms. Problem: Difficult to generate the large number of items required.

Answer 3

Measures internal reliability • Cronbach's Alpha is mathematically equivalent to the average of all possible split- half estimates. • Usually a figure of +0.7 or greater indicates acceptable internal reliability. • Calculates correlation of every possible combination of half one and half two of test.

Answer 4

DOES THE SAME THING BASICALLY AS CRONBACH BUT ONLY FOR DICHOTOMOUS SCALES * Measures internal reliability for measures with dichotomous choices (i.e., 2 choices Yes/No). * Usually a figure of +0.7 or greater indicates acceptable internal reliability.

Answer 5

Parallel forms bitches

Answer 6

Internal reliability assesses the consistency of results across items within a test. External reliability refers to the extent to which a measure varies from one use to another.

Answer 7

Test restest reliability measure: measures stability of test over time. * Perform the same survey, with the same respondents, at different points in time. * The closer the results, the greater the test- retest reliability of the survey. * The correlation coefficient between the two sets of responses is often used as a quantitative measure of the test-retest reliability.

Answer 8

practice effect

Answer 9

Inter-rater reliability determines the extent to which two or more raters obtain the same result when coding the same response. Cohen’s Kappa: larger numbers indicate better reliability, used when there are two raters. Fleiss’ Kappa: an adaptation which works for any fixed number of raters. NOTE: Measures agreement, not accuracy. IMP DISTINCTION

Answer 10

The same assessment is completed by the same rater on two or more occasions. These different ratings are then compared, generally by means of correlation. Problem: Since the same individual is completing both assessments, the rater's subsequent ratings are contaminated by knowledge of earlier ratings.

Answer 11

* Guessing * Ambiguous items * Test length * Instructions * Temperature, illness * Item order effects * Response rate * Social desirability

Answer 12

Test retest r

Answer 13

the degree to which the measurement process measures the variable that it claims to measure

Answer 14

``` • Faith • Face • Content • Construct - Convergent - Discriminant • Predictive ```

Answer 15

Faith Validity is the least defensible type of validity but the most difficult to influence. It is simply a conviction, a belief of blind faith that a selection test is valid. There is no empirical evidence and, what is more, none is wanted.

Answer 16

Again the least scientific type of validity along with faith. The superficial appearance, face value of a measurement procedure. e.g. asking yourself does the measurement technique loooook like it measures the variable we want to measure? yeah. Can have high face validity: as in its obvious what is being tested through the measure (problem participants may adjust their answers to appear socially desirable). Low face validity: ambiguous In relation to the academic vindictiveness scale: Find experts in academic vindictiveness, and ask them to judge whether the questionnaire represents a good measure of that construct.

Answer 17

The extent to which a measure represents all facets of the phenomena being measured. ``` So, in the case of academic vindictiveness: There might be different types: Academic vindictive behaviours Academic vindictive attitudes Academic vindictive feelings ``` Content would need to measure all three of these factors for instance.

Answer 18

Construct v: Seeks to establish a clear relationship between the construct at a theoretical level and the measure that has been developed. TWO SUBTYPES: Convergent validity: That the measure shows associations with measures that it should be related to, e.g., academic vindictiveness should be related to other aspects of vindictiveness; such as a tendency to seek revenge, or spitefulness. Discriminant validity: That the measure is NOT related to things that it should not be related to. If they dont correlate strongly with eachother then thats good in this instance. e.g. Academic vindictiveness (revenge) should be measuring something different from Five Factors of personality, so should not correlate highly with extraversion, neuroticisim, agreeableness, openness and conscientiousness.

Answer 19

Assesses whether a measure can accurately predict future behaviour. Scores on the academic vindictiveness scale should be able to predict people acting in an academically vindictive way in the future: Not sharing notes Not helping other people revise

Answer 20

not sufficient

Answer 21

Reliability: A questionnaire is reliable if all of the questions in your test are consistently measuring the same underlying concept, and that this remains stable over repeated times that the test is administered Validity: A test is valid if it is actually measuring what you intend it to measure.

Answer 22

Content validity

Answer 23

Nominal: tells us only that difference exists Ordinal: tells us the direction of the difference (which is more and which is less) Interval: can determine the direction and the magnitude of the difference. Ratio: tells us the direction, magnitude and ratio of the difference. (remember this order, it will help you, as we go down list gets more informed and tells us more). NOIR (black)

Answer 24

The idea that youre only testing maybe the motivated people who respond, instead of the lazier ones who dont respond to surveys.

Week 4: Survey Methods Flashcards

(48 cards)