Methods Flashcards

Question

When is Glass' delta normally used?

Answer 1

when several treatments are compared to the control group

Answer 2

compares the means of two measurements taken from the same individual, object or related units ie. a measurement taken at two different times

Answer 3

d between 0.2 and 0.49 = small d between 0.5 and 0.79 = medium d of 0.8 and higher = large

Answer 4

Can be used for a factorial design (ANOVA). Measures linear and nonlinear association (in contrast to correlation)

Answer 5

a measure of effect size that tells you how much of the total variation in your outcome (dependent variable) can be explained by group differences or the factor you are studying.

Answer 6

how much of the variation in the outcome (dependent variable) can be explained by a specific factor (independent variable), after accounting for other factors in the analysi

Answer 7

1- Relative Risk 2- Odds Ratio 3- Risk Difference

Answer 8

smokers are 23 times more likely to develop lung cancer

Answer 9

When risk is small the odds ratio approximates the relative risk

Answer 10

Difference between proportion of treatment group that contract the disease and proportion of controls that contract the disease. Can be used to estimate number of cases avoided by a treatment. Reflects overall probability of getting disease.

Answer 11

Type 1 error

Answer 12

Type 2 error

Answer 13

power is the probability of correctly rejecting the null hypothesis. Power determines how likely an effect is detected

Answer 14

-Effect Size (unreliable measures reduce effect size by inflating estimate of sigma) -Alpha level (one tailed test has higher power than two tailed) -Sample size

Answer 15

Computed before the study’s data are collected. estimate how likely your planned study is to detect an effect of a certain size, assuming that effect truly exists. Researchers use this analysis to determine the minimum sample size needed to achieve a desired level of power three steps; hypothesis effect size; alpha level; planned sample size

Answer 16

-do a pilot experiment and compute effect size -do a meta analysis and compute weighted effect size -use Cohen's estimates for small, medium, and large effect size

Answer 17

A power of at least 80% is usually considered acceptable. Underpowered (power<80%) studies are useless and unethical (waste of resources and people’s time)

Answer 18

Computed after study is completed. Assumes effect size in the sample equals effect size in the population Generally not very useful

Answer 19

In a meta-analysis. It provides an indication as to which results to assign a higher weight

Answer 20

-Adding Participants -Choose a less stringent significance level (usually not an option) -Increase the hypothesised effect size -Use as few groups as possible -Use covariates variables -Use a repeated measure design -Use measures sensitive to change

Answer 21

1. Calculate covariance between the X and Y variables, and then standardize 2. Convert the X and Y scores to z-scores (standard scores), then divide by n

Answer 22

correlation coefficient: measuring a linear correlation between two variables

Answer 23

a statistical relationship between two variables where the data points tend to fall along a straight line when plotted on a graph

Answer 24

Correlation: is there a relationship between 2 variables? Regression: how well does one variable predict the other variable?

Answer 25

Prediction requires calculating a line of best fit (an equation)

Answer 26

the variable that you are trying to predict

Answer 27

the variable that you are trying to predict from

Answer 28

Simple linear regression = 1 predictor variable Multiple regression = 1+ predictor variables

Answer 29

Y=a + bX (where a= intercept and b=slope)

Answer 30

Line of Best Fit: find a regression line that provides the best prediction possible i.e., a regression line that minimizes error.

Answer 31

Step1: for each data point, calculate the deviation, then square it. Step2: across the dataset, add up all deviations (→ sum of squared deviations). Best fit: the equation that produced the smallest SSERROR.

Answer 32

Step1: convert X and Y into z-scores Step2: multiply z(X) by z(Y) Step3: add up Step4: divide by n-1

Answer 33

Format: Y=a + bX (where b is the slope, and a is the starting point) Calculate B: b = r × (SD of Y / SD of X) (where r is the correlation between X and Y) Calculate A: a = mean(Y) - b × mean(X)

Answer 34

SSREGRESSION + SSERROR (SSX) (SSRESIDUAL) Sum of Squares Y (SStotal)= how much variance there is in Y in total Sum of Squares X (SSregression)= variance X can explain Sum of Squares Residual (SSerror)= how much variance is not explained

Answer 35

Total variance in Y Step1: calculate the difference (deviation) between each score and the mean Step2: square the deviations Step3: add up

Answer 36

Step1: calculate the difference (deviation) between the predicted and the observed score Step2: square the deviations Step3: add up

Answer 37

SSTotal= SSregression (variance in Y) + SSerror (variance in X)

Answer 38

-Calculate R2

Answer 39

F statistic Step1: calculate Mean Squares (SS / df) Step2: calculate F ratio (MSregression/ MSresidual)

Answer 40

relationship between 2 variables * calculate r and R2

Answer 41

relationship between 2 variables while accounting for another variable or variables * calculate partial r and R2

Answer 42

predicting one variable from another variable * calculate R and R2

Answer 43

predicting one variable from 2+ other variables * calculate multiple R and multiple R2

Answer 44

1) Least squares 2)Variance accounted for (R^2)

Answer 45

R2 (coefficient of determination) is the amount of variance explained by that single predictor

Answer 46

Multiple R2 (coefficient of multiple determination) is the amount of variance explained by those multiple predictors

Answer 47

Y=a+bx+bx+bx+bx+.......bx -where a is the Y value when all predictor variables are zero -where b is a partial regression coefficient and represent the change in Y associated with a 1 unit change in a particular x

Answer 48

variance in the model that is not explained

Answer 49

find regression line that provides the best prediction possible, i.e., a regression line that minimizes error

Answer 50

X is not a strong predictor

Answer 51

variance that can be attributed only to 1 variable

Answer 52

variance that can be attributed to 2+ variables

Answer 53

1. no change in correlation 2. weaker (still significant) correlation 3. stronger correlation

Answer 54

you start with a specific hypothesis → use only the predictors necessary to test this hypothesis

Answer 55

you start with a broad hypothesis → use as many predictors as necessary to test this hypothesis or as indicated by previous literature

Answer 56

all the variables are entered in together, irrespective of their absolute or relative importance

Answer 57

you decide (you can enter variables in blocks, with your decisions being driven by previous research and hypotheses

Answer 58

Forward regression: your computer programme (e.g., SPSS) will find the single best predictor and enter it as the first variable; the variable that accounts for the highest proportion of the remaining variance is entered next and so on Backward regression: all variables are entered initially and the worst predictors (i.e., the predictors that account for the least variance) are removed in turn

Answer 59

what the model can explain / what it cannot explain

Answer 60

unexplained variance

Answer 61

variance explained by the regression

Answer 62

all the coefficients for the predictor variables in the model are equal to zero. under the null hypothesis, the model does not explain the outcome variable any better than would be expected by chance,

Answer 63

Regression

Answer 64

Two predictors are said to be collinear if they are highly correlated with one another (r>.75). In other words, they may be measuring the same construct and it is difficult to estimate independent contributions of each variable

Answer 65

Unless you have a research question that specifically requires keeping both predictors, the best approach is to drop one of these predictors from your model. Or conduct a PCA (principal component analysis).

Answer 66

any type of variable can be employed as a predictor (i.e., independent) variable. Researchers often want to use, or are forced to use, categorical variables

Answer 67

continuous and normally distributed

Answer 68

a method used in statistics to convert categorical variables (like colors, types, or groups) into numerical values so they can be used in regression and other analyses To code category membership where k = number of categories, you need k - 1 dummy variables -2 categories need 1 dummy variable -3 categories need 2 dummy variables

Answer 69

Linear regressions only provide a valid measure of the relationship between two variables when that relationship is linear (when it can be described by a straight line)

Answer 70

Polynomials...a trick to get a linear method to model a non linear relationship -non linear quadratic: modelled by entering square of predictor/independent variable -non-linear cubic: modelled by entering cube of the predictor/independent variable

Answer 71

Hierarchal regression: 1- enter predictor variable 2- then the square of it (f there is a significant change, then there is a significant non-linear (quadratic) component to the relationship between the predictor and criterion 3- then the cube of it. ( if there is a significant change in R2 then there is a significant non-linear cubic component to the relationship between the predictor and criterion)

Answer 72

An alternative is to estimate the indirect effect and its significance using the Sobel test (Sobel. 1982). If the Sobel test is significant, there is significant mediation

Answer 73

Are the differences between groups still there after we take into account another factor that could affect the results?

Answer 74

the continuous variable

Answer 75

ANOVA is multiple regression with only categorical predictors

Answer 76

ANCOVA is an ANOVA (and hierarchical multiple regression), where 1 continuous variable is entered first into the model, to “control for” that variable

Answer 77

“partialling out” the influence of that variable (the covariate) on the outcome. You adjust the means of the categorical predictor to account for the influence of the covariate

Answer 78

You measured a continuous variable that covaries with the outcome, but which is not of interest. The ANCOVA first takes away any variance in Y that is due to this covariate, then proceeds with the rest of the analysis

Answer 79

* To test for differences between group means when we know that an extraneous variable affects the outcome variable. * Used to adjust for known extraneous variables

Answer 80

Reduces Error Variance * By explaining some of the unexplained variance (SSR) the error variance in the model can be reduced. Greater Experimental Control: * By controlling known extraneous variables, we gain greater insight into the effect of the predictor variable(s).

Answer 81

-No collinearity between the two IV’s, i.e. no high correlation between the categorical IV and the covariate -Homogeneity of slopes: The relationship between covariate and and DV has to be similar in all conditions of the categorical IV

Answer 82

The combined effect of two variables on another is known conceptually as moderation, and in statistical terms as an interaction effect

Answer 83

makes the model’s intercept and group effects easier to interpret by anchoring them at the average value of the covariate, and it can also improve statistical stability and align with standard calculation methods

Answer 84

Centring refers to the process of transforming a variable into deviations around a fixed point. centering turns your variable into “how much above or below the average” each person is.

Answer 85

Refers to a situation when the relationship between a predictor variable and outcome variable can be explained by their relationship to a third variable (the mediator) aka it means that the reason one thing (the predictor) is related to another thing (the outcome) is because both are connected through a third thing (the mediator)

Answer 86

Mediation is tested through three regression models 1. Predicting the outcome from the predictor variable. 2. Predicting the mediator from the predictor variable. 3. Predicting the outcome from both the predictor variable and the mediator.

Answer 87

1. The predictor must significantly predict the outcome variable. 2. The predictor must significantly predict the mediator. 3. The mediator must significantly predict the outcome variable. 4. The predictor variable must predict the outcome variable less strongly in model 3 than in model 1

Answer 88

How much of a reduction in the relationship between the predictor and outcome is necessary to infer mediation? * people tend to look for a change in significance, which can lead to the ‘all or nothing’ thinking that p-values encourage

Answer 89

An alternative is to estimate the indirect effect and its significance using the Sobel test (Sobel. 1982). If the Sobel test is significant, there is significant mediation

Methods Flashcards

(114 cards)