Stats Flashcards

Question

Criterion deficiency and relevance

Answer 1

Criterion relevance: the extent to which the actual criterion (e.g. whatever measure you're using) truly measures the conceptual criterion Criterion deficiency: the actual criterion is deficient in measuring the conceptual criterion Greater deficiency, less relevance

Answer 2

Number that tells you how strong a factor is in factor analysis - how much variance is explained

Answer 3

Zero order: most basic, just X and y Partial/first order remove effect of third variable Z from both X and y Semi-partial/ part: remove effect of third variable Z from from only one variable, X or y

Answer 4

Multivariate test of: Two or more X's correlated with one y

Answer 5

In a correlational relationship, the amount of variability in y explained by X Calculated by squaring the correlation coefficient For multiple R, it's called the coefficient of multiple determination (amount of variability in y explained by multiple X's)

Answer 6

Stepwise regression: computer decides which X variable to enter first based on strength of correlation with y Hierarchical regression: researcher has theory-based order of entering the X variables into the regression

Answer 7

Correlation between a variable (EG. Test item or subtest) and the underlying factor Interpret if greater than or equal to .3

Answer 8

Methods of doing factor analysis Orthogonal rotation: Factors end up uncorrelated Oblique rotation: factors are correlated (EG WAIS)

Answer 9

Amount of variability explained by factors combined. Calculated by squaring and adding Factor loadings

Answer 10

Subtypes types of factor analysis Principle components analysis: no a priori hypothesis, factors are empirically derived Principle factor analysis: theoretically derived factors, already know communalities

Answer 11

1. Content sampling: by chance you know more or less on a test 2. Time sampling: different scores due to the passage of time 3. Test heterogeneity: items tap multiple domains

Answer 12

1. Number of items: more items, more reliable 2. Homogeneity of items more homogeneous, more reliable. You want the items to be testing a similar thing 3. Range of scores: greater range, more reliability. So you want a heterogeneous sample 4. Ability to guess: more ability to guess, lower reliability. True, false less reliable than multiple choice

Answer 13

1. Test retest AKA coefficient of stability 2. Parallel forms AKA alternate forms, AKA equivalent forms 3. Internal consistency reliability - split half - Kuder Richardson or coefficient alpha 4. Interrater reliability

Answer 14

Aka coefficient of stability Same subjects, same test, different time points Main source of error: time

Answer 15

Aka alternate forms Aka equivalent forms Coefficient of equivalence Same subjects, different tests, different time points Main source of error: time and content sampling

Answer 16

Consistency of scores/items within the test Same subjects, same test, administered once 1. Split half reliability - underestimates reliability because fewer items - to correct for this use. Spearman Brown prophecy formula: how much more reliable is test with X number items? - Bad for speeded tests, good for power tests - main source of error: content sampling 2. Kuder- Richardson and Chronbach's coefficient alpha - analyzes all possible ways of splitting tests in half - main source of error: content sampling and test heterogeneity

Answer 17

Agreement between scorers on a subjectively score test Improves with group discussion, practice and feedback Measures: % agreement, r, Kappa, Yule's Y

Answer 18

1. Raw score 2. Standard error of measurement NOT group mean

Answer 19

1. Content validity: skills and knowledge 2. Criterion-related validity: prediction - concurrent validity: X and Y measured at the same time - predictive validity: X and Y measured with delay 3. Construct validity: traits - convergent - divergent

Answer 20

Probability that a person 's criterion (y) score will fall in a range given their predictor (x) score

Answer 21

How much improvement will hiring /selection decisions gain when using a certain test 1. Base rate: rate of successful employees without test. MODERATE (.5) 2. Selection ratio: proportion of available openings to applicants. LOW (.1) 3. Incremental validity: amount of improvement from base rate and success rate when you use the test. Optimized when test has good criterion-related validity

Answer 22

When developing a predictor test, how you determine which items in a test to keep Factors: 1. item difficulty: proportion of people who got the item right 2. item discrimination: how well the item discriminates between high and low scorers 3. item validity: correlation between item and whole test score 4. ICC 5. Test revision calculate criterion-related validity coefficient with the revised set of items - Cross validate on a new sample, which always results in shrinkage of the criterion-related validity coefficient

Answer 23

To what extent an item correlates with an underlying trait Developed from the item characteristic curve (ICC) Used to develop individually tailored adaptive tests

Answer 24

Relates to cross validation when you are developing a predictor test. When you compare the new test with the original test, the criterion related validity coefficient will always be smaller, because the first sample was used to tailor the items

Answer 25

1. Range of scores: broad range, heterogeneous subjects 2. Reliability of predictor: reliability puts a ceiling on validity 3. Correction for attenuation: formula that tells you if X and Y were perfectly reliable, how much more valid is the instrument? 4. Criterion contamination: with subjectively scored tests, when the raider knows how person did on the predictor, artificially inflates validity

Answer 26

Formula that tells you if criterion (X) and predictor (Y) were perfectly reliable, how much more valid is the instrument?

Answer 27

With subjectively scored criterion tests, when the raider knows how the person did on the predictor test, artificially inflates the validity of the criterion

Answer 28

Way to test construct validity, including convergent and divergent validity 1. Convergent validity: how much does scores on new test converge with other measures of the same or similar traits? ---> MONO-trait, HETERO-method, HIGH correlation 2. Divergent validity AKA discriminant validity. How much does the new test differentiate between measures of different traits? ---> HETERO-trait, MONO-method, LOW correlation

Answer 29

Establish a longitudinal baseline by taking a bunch of measurements over a period of time, then introduce your experimental manipulation and see if the trend changes Threat of History

Answer 30

Mean of means = population mean A sample size increases distribution of means becomes more normal

Answer 31

Part of an item analysis, basis of item response Theory (IRT) Graphs that depict individual test items in terms of the percentage of individuals in different ability groups who answered the item correctly For example, on one item, 80% of people in the highest ability group got it correct, 40% in the middle group, etc