Research Design, Statistics, & Test Construction Flashcards

Question

Solomon Four-Group design

Answer 1

control for testing threats to validity; divide subjects into four groups: measured pre- and post- and get intervention; measured pre- and post- and don’t get intervention, measured post and gets intervention, measured post and does not get intervention

Answer 2

threat to internal validity; changes in observers or the calibration of equipment; control group corrects for this

Answer 3

threat to internal validity; tendency for extreme scores (scores very much above or below the mean ) to become less extreme (closer to the mean) on retesting, even without any type of intervention; control group controls for this

Answer 4

threat to internal validity; caused by non-random assignment; best avoided with random sampling

Answer 5

threat to internal validity; differential loss of subjects from the groups; to assess for this, compare subjects who drop out using t-tests on relevant variables

Answer 6

threat to internal validity; occurs when no treatment group gets some of the treatment; difficult to eliminate completely, but tighter control over experimental situation can help

Answer 7

refers to factors other than the desired specifics of our intervention that result in differences; often lumped under threats to external validity; not measuring what you think you are measuring

Answer 8

threat to construct validity; difficult to tell whether changes are due to treatment or attention

Answer 9

threat to construct validity; cues or clues transmitted to the subjects by the experimenter; Rosenthal effect; can be controlled by masking experimenter to conditions

Answer 10

refers to experimenter expectancies

Answer 11

threat to construct validity; factors in the procedures that suggest how the subject should behave; control by masking subjects to their condition

Answer 12

hreat to construct validity; occurs when persons in a control group try harder than usual in the spirit of competition with the experimental group; control by making sure experimental and control groups do not know about each other and, if not possible, do not give groups any sense of competition

Answer 13

interfere with generalizability of effects

Answer 14

threat to external validity; difference between sample and population

Answer 15

threat to external validity; features of the study with which the intervention is associated (e.g., research assessing memory functioning in the laboratory may not be generalizable to memory functioning in naturalistic settings)

Answer 16

threat to external validity; conditions in which intervention is embedded; e.g., reactivity

Answer 17

subjects behave in a certain way just because they are participating in research and being observed

Answer 18

threat to statistical conclusion validity; diminished ability to find significant results; small sample size and inadequate interventions can contribute

Answer 19

threat to statistical conclusion validity; unreliable outcome measure

Answer 20

threat to statistical conclusion validity; inconsistency in treatment procedures; especially of concern in psychotherapy outcome research

Answer 21

threat to statistical conclusion validity; subject heterogeneity makes it more difficult to find significant differences between groups

Answer 22

as one variable increases so does the other (e.g., a varies directly with b in a=b/c

Answer 23

as one variable increases the other decreases (e.g., a varies indirectly with c in a = b/c)

Answer 24

involve tallying people to see which ordered category a person falls into (e.g., likert scale, SES, percentile rank); group means cannot be calculated

Answer 25

involve obtaining numerical scores for each person, where the score values have equal intervals; no zero score or zero is absolute (e.g., IQ test, t-score, temperature); group means can be calculated

Answer 26

involve obtaining numerical scores for each person, where the score values have equal intervals and an absolute zero (e.g., score on EPPP, money in bank, weight, number of children)

Answer 27

average deviation (or spread) from the mean in a given set of scores

Answer 28

standard deviation squared

Answer 29

higher proportion of scores in the lower range of values (mode has lowest value, mean highest)

Answer 30

higher proportion of scores in the higher ranges of values (mean has lowest value, mode highest)

Answer 31

refers to how peaked a distribution is

Answer 32

distribution with a very sharp peak

Answer 33

distribution that is very flat

Answer 34

example is percentage correct

Answer 35

provides information on how person performed relative to group

Answer 36

based on standard deviation from the sample

Answer 37

standard scores that correspond directly to standard deviation units; transforming into Z-scores does not normalize a distribution (exact same distribution shape); z score = (score - mean)/SD

Answer 38

-3 = .1, -2 = 2.5, -1 = 16, 0 = 50, 1 = 84, 2 = 97.5, 3 = 99.5

Answer 39

population values

Answer 40

sample values

Answer 41

average amount of deviation of sample means from the population mean; equal to population SD divided by square root of sample size

Answer 42

states that assuming an infinite number of equal sized samples (of large enough size) are drawn from the population, and the means of these samples are plotted, a normally distributed distribution of means will result; the mean of the means ( the grand mean) will equal the population mean, and the standard deviation of the means will equal the standard deviation of the population divided by the square root of sample size (the standard error of the mean); allows researcher to calculate whether the obtained mean is most likely due to treatment or experimental effects, or to chance

Answer 43

also called region of unlikely values; unlikely researcher will obtain values by chance; size corresponds to alpha level

Answer 44

treatment effects and chance (sampling error)

Answer 45

incorrectly reject null hypothesis; likelihood directly corresponds to size of alpha

Answer 46

incorrectly accept null hypothesis; corresponds to beta

Answer 47

provides probability of making Type II error

Answer 48

ability to correctly reject null hypothesis; increased when sample size is large, magnitude of intervention is large, random error is small, statistical test is parametric, test is one-tailed; inversely related to beta (power = 1- beta); direct relationship with alpha

Answer 49

three assumptions must be met: data are interval or ratio, homoscedasticity, normally distributed

Answer 50

used for nominal or ordinal DV

Answer 51

t-test, ANOVA

Answer 52

Chi-Square, Mann-Whitney, Wilcoxin

Answer 53

similar variability or standard deviations in the different groups

Answer 54

independence of observations

Answer 55

number of possible variations in outcomes that can be obtained

Answer 56

df = # of columns - 1

Answer 57

df = (# rows - 1)(# columns - 1)

Answer 58

df = N - 1

Answer 59

df = # of pairs - 1

Answer 60

df = N - 2

Answer 61

df = N - 1

Answer 62

df within = df total = df between

Answer 63

df between = # of groups - 1

Answer 64

expected frequency for any cell = (sum of row * sum of column)/N

Answer 65

F ratio = Mean Square between groups/Mean square within groups; typically significant as it gets above 2.0

Answer 66

measure of average variability

Answer 67

in order of most to least conservative: Scheffe, Tukey, Duncan/Dunette/Neuman-Kuels, Fisher’s LSD

Answer 68

when groups are being compared on two IVs; permits analysis of main effects and interaction effects; when interaction is significant, main effects must be interpreted in context of interaction effect

Answer 69

add up diagonals in each individual 2x2 set of squares

Answer 70

used when there are multiple DVs

Answer 71

calculated by squaring correlation coefficient; represents amount of variability in Y that is shared with or explained by X

Answer 72

linear relationship between X and Y, homoscedasticity, unrestricted range of scores on X and Y

Answer 73

Spearman’s rho, Kendall’s Tao

Answer 74

point-biserial

Answer 75

tetrachoric

Answer 76

examines relationship between X and Y when it is believed there are no extraneous variables affecting the relationship

Answer 77

also called first-order correlation; examines the relationship between the predictor and the criterion with the effect of a third variable removed that is thought to be affecting both variables

Answer 78

also called a semi-partial correlation; examines the relationship between the predictor and the criterion with the influence of a third variable removed from only one of the original variables

Answer 79

involve several predictors and one or more criterions (DVs)

Answer 80

multiple correlation; correlation between two or more IVs and one DV, where Y is always interval or ratio data, and at least one X is interval or ratio data

Answer 81

obtained by squaring multiple R; index of the amount of variability in the criterion (Y) that is accounted for by the combination of all the predictors (Xs)

Answer 82

Has multiple predictors

Answer 83

problem that occurs in a multiple regression equation when the predictors are highly correlated with one another, and therefore essentially redundant

Answer 84

computer-generated; in forward regression, the computer adds predictor variables one at a time, starting with the predictor that has the highest correlation with criterion outcome; in backward regression, predictor variables are removed one at a time, starting with the variable that contributes the least to criterion outcome; allows for fewest possible predictors

Answer 85

researcher controls regression analysis, adding variables in order consistent with theory

Answer 86

correlation between two or more IVs and two or more DVs; evaluate relationship between two sets of variables

Answer 87

special case of multiple regression; two or more predictors and one criterion that is nominal (rather than interval or ratio); allows to predict membership in group

Answer 88

sometimes referred to as logit analysis; used to predict a categorical criterion based on categorical predictors

Answer 89

not correlations and regressions; path analysis and SEM

Answer 90

applies multiple regression techniques to testing a model that specifies causal links among variables; relies on researcher having developed theoretically-based causal model; straight arrows denote causal relationships, curved denote correlations; path coefficients are analyzed to see if the pattern predicted by the model has emerged

Answer 91

test of structure that extracts as many significant factors from set of data as possible

Answer 92

another name for eigenvalues for factors (indicate strength of factors); less than 1.0 usually not interpreted

Answer 93

equal to or exceed 0.30

Answer 94

factor rotation in which axes remain perpendicular; results in factors with no correlation

Answer 95

much of a test’s variability is explained by the combination of all the factors; can be calculated in orthogonal rotation; factor loadings squared and added together

Answer 96

factor rotation in which angle between axes is non-perpendicular and factors are correlated

Answer 97

subtype of factor analysis; trying to extract factors and there is no empirical or theoretical guidance on the values of the communalities; results in a few uncorrelated factors called components; no prior hypotheses

Answer 98

communality values ascertained before analysis

Answer 99

also called true score model; total variability = true score variability + error variability

Answer 100

proportion of true score variability; often symbolized as rxx or rtt; minimum acceptable is 0.80

Answer 101

occurs when a test, by chance, has items that do or do not tap into a test-taker’s knowledge base

Answer 102

occurs when a test is given at different points in time and scores differ because of factors related to passage of time

Answer 103

occurs when a test has heterogeneous items tapping more than one domain

Answer 104

number of items, homogeneity of items, range of scores, ability to guess

Answer 105

test-retest reliability, parallel forms reliability, internal consistency reliability, interrater reliability

Answer 106

expression of test-retest reliability

Answer 107

expression of parallel forms reliability

Answer 108

used when calculating split-half reliability; tells us how much more reliable the test would be if it were longer

Answer 109

split-half reliability inappropriate for speeded tests because only easy items included; preferred test of reliability is alternate forms

Answer 110

have items that are of varying difficulty level, and subjects are provided sufficient time to complete them all

Answer 111

sophisticated forms of internal consistency and reliability; involve analysis of the correlation of each item with every other item in the test; calculated by taking the mean of the correlation coefficients for every possible split-half; KR-20 (vary in difficulty) and KR-21 (consistent difficulty) when items scored dichotomously, Coefficient alpha when not scored dichotomously

Answer 112

standard deviation of a theoretically normal distribution of test scores obtained by one individual on equivalent tests; assumed to be consistent across all persons; Smean = SDx * square root (1 - rxx); ranges from 0 (perfectly reliable test) to the standard deviation of the test (not at all reliable test)

Answer 113

how adequately a test samples a particular content area; quantified by asking a panel of experts if each item is essential, useful/not essential, or not necessary, yet no numerical validity coefficient is derived

Answer 114

how adequately a test score can be used to infer, predict, or estimate criterion outcome; calculated by using a Pearson r to correlate the test scores (also known as predictor scores) with criterion scores (also known as outcome scores)

Answer 115

subtype of criterion-related validity; predictor and criterion measured and correlated at about the same time

Answer 116

subtype of criterion-related validity; delay between the measurement of the predictor and the criterion

Answer 117

average amount of error in estimating each person’s criterion score; standard deviation of a theoretically normal distribution of criterion scores obtained by one person measured repeatedly; Sest = SDy * square root (1-rxy2); ranges from 0 to value of standard deviation of criterion

Answer 118

expectancy tables, Taylor-Russell tables, decision-making theory

Answer 119

list the probability that a person’s criterion score will fall in a specified range based on the range in which that person’s predictor score fell

Answer 120

numerically describe the amount of improvement that occurs in selection decisions when a predictor test is introduced

Answer 121

proportion of available openings to number of applicants

Answer 122

amount of improvement in success rate that results from using predictor test (e.g., if proportion of successful improves from base rate of .4 to .65, incremental validity is .25 or 25%)

Answer 123

criterion-related validity coefficient of the predictor test, the company’s base rate, and the selection ratio

Answer 124

takes the predictions of performance that were based on the predictor tests and compares them with the actual criterion outcome

Answer 125

(1.0 + probability of getting item by chance)/2.0

Answer 126

correlation between item score and criterion score

Answer 127

plot of the relationship between item performance and total score

Answer 128

used to calculate to what extent a specific item on a test correlates with an underlying construct; subject’s performance on a test item as representing the degree to which the subject has a latent trait

Answer 129

range of scores, reliability of predictor, reliability of the predictor and the criterion, criterion contamination

Answer 130

test must have some reliability to be valid, but a reliable test does not imply a valid test; validity can be greater than reliability; reliability determines ceiling for validity but is not always greater than validity

Answer 131

calculates how much higher validity would be if predictor and criterion were both perfectly reliable

Answer 132

occurs with subjectively-scored criterion outcomes when the rater is informed of subjects’ predictor scores before assigning them criterion ratings

Answer 133

factor analysis or multi-trait, multi-method matrix

Answer 134

table with information about convergent and divergent validity, both of which are necessary for construct validity

Research Design, Statistics, & Test Construction Flashcards

(163 cards)