Research Methods III Flashcards

Question

t-test to test the Regression coefficient b-cofficients

Answer 1

unstandardized (raw) regression coefficients

Answer 2

One standard deviation increase in X results in an expected change of beta standard deviation units in Y. Increase in X, change in Y

Answer 3

A second predictor variable (X2) that is unrelated to Y (dependent variable) raises the amount of variance explained by the first predictor by eliminating certain irrelevant aspects of the first predictor (X1). X2 suppresses some of the "error" or "irrelevant" variance in X1.

Answer 4

r = 0, but beta = 0

Answer 5

beta > r Surprising because the Beta values increased from both X1 and X2.

Answer 6

Correlation between 1 DV and 1 IV variable (Y and X1) with two or more variables (e.g., X2, X3) partialed from BOTH DV and the first IV variables.

Answer 7

The part correlation has variable 2 ONLY partial out of predictor 1. It is the correlation of Y with that part of X1 which is independent of X2.

Answer 8

Regression framework is flexible. Categorical or nominal independent variables can be used in Multiple Regression/Correlations.

Answer 9

g - 1 Represent df of IVs All coding systems come up with the same correlations, BUT they produce different regression equations.

Answer 10

Os and 1s Representation of a variable consisting of g categories by creating g - 1 variables (vectors) for which each g - 1 categories is coded 1 on a single variable while the remaining categories are coded 0 on these variables.

Answer 11

Identification of specific comparisons of interest & assigning values that enable the treatments to be directly compared. In a two group design, one group is assigned +1 and the other group is assigned -1.

Answer 12

A method of coding categorical variables in which each group is compared to the weighted or unweighted mean of all the groups.

Answer 13

Used when you wat to compare the mean of a particular group with the grand mean, regardless of proportions in population or in sample.

Answer 14

The reference group.

Answer 15

A coded group.

Answer 16

Grand mean across all groups.

Answer 17

Difference between the group mean and the grand mean.

Answer 18

Indicate differences between conditions.

Answer 19

Used to calculate difference between conditions compared.

Answer 20

Indicates difference between Ya & Yt.

Answer 21

Y-bar when all X's are 0; the mean of the reference group

Answer 22

Y-bar; mean of all the groups

Answer 23

Y-bar T; mean of all the groups

Answer 24

When range is sufficient (i.e., approximately) > 6ish, treat as continuous; if not, trat as nominal. Only need 1 DS if treated as continuous. The DF is larger (k -1) if treated as nominal, which can reduce power.

Answer 25

Continuous, ordinal, nominal

Answer 26

The effect of 1 IV on DV changes based on level of another IV. If each factor has 2 levels (or 2 groups), only one vector is needed to differentiate the 2 factors, in Multiple Regression (MRC).

Answer 27

The code for the interaction is simply the multiplication of Vector A and Vector B.

Answer 28

Correlation is different from 0 when there is an IV effect because the means for both treatment groups are different.

Answer 29

Sum all the R-squareds for the main effect of A, the main effect of B and the interaction.

Answer 30

SS is used for ANOVA and R-squared is analogous to Mean Square.

Answer 31

Remove any shared variance between the interaction (A x B) variable & the independent variables that make up the interaction term. Correlations for the variables X & Z with Y do not change when you center.

Answer 32

...does change to 0 when you center.

Answer 33

...avoid accounting for parts of Y more than once.

Answer 34

...the main effect of the predictor.

Answer 35

The y-intercepts are different bu the b-weights are exactly the same.

Answer 36

The y-intercepts and b-weights are different.

Answer 37

If lines cross there is an interaction.

Answer 38

Y = bo + b1 x X1 + b2 x X2 +b3 x X1 x X2 + ey

Answer 39

Regression coefficient specific to when the value of predictor X2 = 0

Answer 40

Regression coefficient specific to when the value of predictor X1 = 0

Answer 41

The interaction's predictive effect. Describes how b1 and b2 change as a function of X2 and X1.

Answer 42

For every one unit increase in X2.

Answer 43

For every one unit increase in X1.

Answer 44

A NHT (null hypothesis test) for X1 & X2.

Answer 45

Reject Ho & conclude the magnitude of b1 depends on the level of X2 and that b2 depends on X1.

Answer 46

Fail to reject Ho & conclude the magnitudes of b1 & b3 are constant across all values of X2 & X1 -- usually droped x1 x x2 to improve precision.

Answer 47

Interpret the simple-slope plots to see if the coefficient for the interaction term is positive, 0, or negative.

Answer 48

Squaring the predictor

Answer 49

If the lines are curving down, it is negative, and if it is curving up, it is positive.

Answer 50

For a dichotomous variable Y coded [ 0, 1 ]

Answer 51

Can't do multiple linear regression (OLS) with nominal DV because it violates normality and homoscedastic assumption.

Answer 52

A method for estimating the unknown parameters in a linear regression model, based on the smallest sum of the squared errors between the regression line and the raw Y scores.

Answer 53

- Linearity. - Homoscedasticity. - Normality.

Answer 54

Dependent variable should be measured on a dichotomous scale. Have one or more independent variables (ICs), which can be continuous or discrete. Independence of observations. Dependent variable should have mutually exclusive and exhaustive categories. Independence of errors.

Answer 55

Little to no multicollinearity. Linear relationship between any continuous independent variables (IVs) and the logit transformation of the dependent variable. Large sample sizes. Needs at least 10 cases per independent variable, some recommended at least 30 cases. No outliers.

Answer 56

a variable that cannot be directly measured or observed one approach to conceptualizing Logistic Regression (LogReg)

Answer 57

Relates the independent variable, x, to the rolling mean of the DV

Answer 58

b-coefficient for continuous normal y as in, linear regression were estimated using 'least squares' no possible for dichotomous Y (logistic model) Rely on Maximum Likelihood Estimation (MLE)

Answer 59

Exponential of B Indicator of the change in odds resulting in a unit of change in the predictor.

Answer 60

common measure of "effect size"; measures how well the logistic regression model fits the data in predicting the dependent variable

Answer 61

This is an adjusted version of Cox & Snell R-square that adjusts the scale of the statistic to cover the full range from 0 to 1. Chi-square test of independence to statistically test the logistic regression.

Answer 62

Represents the deviance of the model that contains no predictors other than the constant.

Answer 63

Represents the deviance of the model. This deviance should be LESS than the null deviance because the lower value represented BETTER accuracy.

Answer 64

Want to test if against the other models within predictors to see if they are better models.

Answer 65

A statistical phenomenon that occurs whenever you have a nonrandom sample from a population and two measures that are imperfectly correlated.

Answer 66

- Statistical phenomenon, can occur because sample is not random. - Group phenomenon. - Happens between any two variables. - Relative phenomenon. - You can have regression to the mean occur going up and down. - The more extreme the sample group, the greater the regression to the mean. - The less correlated the two variables, the greater the regression to mean.

Answer 67

- In a between-groups scenario TRUE random sampling & assignment ensures that RtM effects are equivalet across conditions. - Some studies sample 'special' populations and/or non-randomly assign subjects to conditions.

Answer 68

The x-axis is suppressed and two X axes are constructed.

Answer 69

- It is similar to the pair link plot but standardized variables are used & the Y axis represents the means on the post test for each score of Y.

Answer 70

They don't keep into account baseline.

Answer 71

They don't keep into account baseline.

Answer 72

A sequential regression model that examines the treatment effect while controlling for pretest scores. Covary out baseline scores. In a repeat measures design, the covariate is the pre-test or baseline scores. It examines post-test scores while controlling for pre-test.

Answer 73

Raises the issue of when it is appropriate to control from baseline status. In three papers, Frederic Lord noted that different results obtain if researchers adjust for pre-existing differences. Depending on the data analyses you run, you can end up with completely different results.

Answer 74

What they don't take into account is that within a group, individuals have different baselines and can have different levels of change.

Answer 75

- Change scores - General Linear Model (GLM; aka repeated measures ANOVA or ANCOVA) - Mixed Effects Model

Answer 76

This approach attempts to model differences in Ts 'controlling for' a person's T1 score. Ironically, whenever groups are not exactly equivalent at T1, this approach actually introduces RtM.

Answer 77

Long format.

Answer 78

What's left after accounting for average group effects.

Answer 79

This default approach to missing data in nearly all statistical packages is Listwise Deletion, which drops any observation with any missing data on any variable involved in the analysis. If the percentage missing is small & the missing data are a random sample of the data set, this is a reasonable approach.

Answer 80

Because of the way the Sum of Squares are calculated in the multivariate approach, post-hoc tests are not available for repeated measures factors. They are available, however, using the mixed model.

Answer 81

Depending on the design of the study rather than consider time as four categories, it can be more accurate to treat time as a continuous variable. This allows you to model a regression line for time, rather than estimate four means.

Answer 82

A study have two-factor (2 x 4) repeated measures design to see if the impact of these two factors on an outcome was mediated by a third variable. Each subject has eight values of the mediator (one for each of the conditions) and eight values on the final outcome.

Answer 83

In a study examining schools and teacher, we can have a cluster of children within teachers. If so, we would need to include teacher as another level in the mixed model. Changing from a 2 to a 3 level model is simple to do if the model is already set up as a mixed model.

Answer 84

Population or Group average effects.

Answer 85

Deviations (of the individuals in a group) from the population (group) average effects.

Answer 86

- fixed intercept = the population average of the y-intercept

Answer 87

the deviations from the population average of the y-intercept effect

Answer 88

the deviations of the population average slope for each of the subjects

Answer 89

- RAW ML (full Information Maximum Likelihood; FIML) - Assumes missing data are Missing at Random - Generally biased data downward (that can result in negative variances)

Answer 90

For the coefficients that you get from the model. Used to compare nesting, to compare various different structures that we fit to our random effects. When comparing fixed effects models to each other, we cannot use residual MLs. Need to use Raw ML.

Answer 91

For each cluster, a different intercept is allowed to exist. uo represents the deviations from the population average intercept.

Answer 92

The slope will be something varying within that level of the cluster. So the variable "u1" is exactly the same as x variable in that ui represents the deviation of the population average slope for each of the subjects. Random slopes should vary across the subjects. Slope is allowed to be different between clusters.

Answer 93

Random slopes are generally nested within a random intercept. There could be multiple random slopes within an intercept. Random intercepts can be nested within each other.

Answer 94

the Y score when all predictors are zero

Answer 95

this is the within subjects factor (change overtime)

Answer 96

This is the interaction between the within subjects factor (time) and between subjects factor (treatment group)

Answer 97

random person & time-specific error/residual

Answer 98

A raw score version of a correlation matrix where the diagonal are measuring the relationships between the same variable (time point).

Answer 99

The relationships between variances of time points (random effect variance) after taking out any variance due to fixed effect s (Group Average Main Effects, Interactions, & Y-intercept).

Answer 100

Not imposing any constraints on the values.

Answer 101

In the growth study dataset, for example, the response variable of each subject is measured at various ages. We may suspect that the residual error terms within a subject are correlated. A reasonable choice of the residual error covariance will therefore be a block diagonal matrix, where each block is a first-order autoregression (ARI) covariance matrix.

Answer 102

If BIC is smaller, we have improved the model.

Answer 103

If lines cross, there is an interaction.

Answer 104

Need g-1 vectors.

Answer 105

Testing for all possible simple effects of Time at different levels of the Condition.

Answer 106

Pairwise comparison table shows the group comparisons at a particular level of treatment condition. If you try to interpret all of these group comparisons, you will need to worry about experimentwise alpha Type I error.

Answer 107

Need 1 vector for continuous variables.

Answer 108

Discrete IVs.

Answer 109

Continuous variables.

Answer 110

Testing main effects and interactions.

Answer 111

Shows the means of all 3 treatment groups at each time point.

Answer 112

Simple effects table.

Answer 113

If the p value is less than .05, then the IV is significant.

Answer 114

- Unfinished surveys - Experimenter-level error (in research design or implementation) - Participant being unavailable for a certain time or dropping out - Data entry error

Answer 115

In OLS, if data is missing at one time for a participant all of that participant's data must be deleted before analysis. - Different opinions on how much missing data is alloweable (ex: 5% (Shafer, 1999); 10% (Behnet, 2001))

Answer 116

- Assumption that the probability of missing an observation does not depend on any variables. - No selection bias. - Events that lead to the data being missing are independent of observable & unobservable parameters of interest. - If data is truly MCAR, analysis can be done without bias -- however, data is rarely MCAR.

Answer 117

- Assumption that the missing data are unrelated to true data value after accounting for other known characteristics of the subject. - Missingness is not random but can be accounted for by variables that can be added into analysis. - Ex: Highly depressed individuals may be more likely to drop out of an experiment or not finish their surveys because they are heavily depressed.

Answer 118

- Missing data is due to unfixable selection bias. - Nonignorable nonresponse. - Leads to biased results.

Answer 119

You can use logistic regression with the missing cores dummy coded as the dichotomous criterion or outcome variable (Missing score versus Not Missing score) and possible predictors.

Answer 120

Full Information Maximum Likelihood (FIML): Directly estimates parameters using ALL observed data for every case.

Answer 121

- Requires a single step for imputation & analysis. - Uses all available data even if some cases are missing data. - Produces unbiased standard error. - Can be used with smaller sample (N < 100)

Answer 122

All variables related to the missing data need to be included in the analysis.

Answer 123

2- step iterative process. These 2 steps repeats until results converge (Successive iterations do not show different parameters) 1: Expectation: Uses parameter (initially base don complete-case data) to estimate values for missing data. 2: Maximization: Uses complete-case data& estimated values for missing data to estimate new model parameters.

Answer 124

- Minimize bias in parameters so that larger samples yield less bias. - Ideal for exploratory & reliability analyses.

Answer 125

- Initial estimates based on list-wise deletion (so it does not use all available data). - Biased standard errors (but bias reduces as sample size increases) - Less efficient than FIML for hypothesis testing.

Answer 126

Gives the worst estimate of the true mean.

Answer 127

Estimating the group means in all missing types.

Answer 128

All of our prior statistical models treated the Time (week) variable as a discrete ordinal 'factor.'

Answer 129

Instead of predicting the average group change, the emphasis is now on trajectories of change that vary randomly across persons i.

Answer 130

Provides a potentially more powerful parameterization.

Answer 131

The model predicts the Y score (DV) based on a straight line. Needs 3 time points and no curves.

Answer 132

This model predicts the Y score (DV) based on a line with 1 curve (or bend in the line). Needs 4 time points.

Answer 133

This model predicts the Y score (DV) based on a line with 2 curves. Needs 5 time points.

Answer 134

(e.g. change in memory, change in height, change in anxiety, etc.) do not change constantly at the same rate. Linear is many times not realistic.

Answer 135

Measurement occasion or time is a nested factor that is nested within a person. It is multilevel or clustered because the Person (Level 2) is a higher level than the measurement of the DV at different time points (level 1).

Answer 136

Growth curves are implicitly nested. E.g. Time nested in people.

Answer 137

In this graph, all 12 linear equations trajectory are plotted in the same graph. The lines represents linear 'trajectories' for each of the first 12 people shown previous database comparing CBT, MSBR, & both.

Answer 138

average group y-intercept

Answer 139

average group slope

Answer 140

deviation of individual in y-intercept & slope from the group y-intercept & slope

Answer 141

fixed effects (intercept & slope) at the Pearson level or level 2

Answer 142

Yoo is average score across all groups

Answer 143

Y10 average regression coefficient

Answer 144

Represents person-specific y-intercept deviations.

Answer 145

represents person-specific slope deviations

Answer 146

A measure of what was not predicted by the MEM model, the deviation from the actual score from the predicted score.

Answer 147

Reflects the error or deviation of the individual subjects' raw scores from predicted y-scores.

Answer 148

Reflects predictors based on individual y-itnercepts & slopes

Answer 149

This reflects the group average y-intercept & regression coefficient (slope)

Answer 150

Linear growth curve with only Time as IV

Answer 151

unexplained variance (errors in prediction) at Level 1 (occasion)

Answer 152

Time is squared in this equation.

Answer 153

Smaller is better. It means that the MEM is doing a better job of fitting the data.

Answer 154

Predicting intercepts & trajectories.

Answer 155

Include IVs called Time-Invariant covariates (TICs) other than Time. These IVs try to explain between group effects. They are "invariant" because the TIC code or value is the SAME across the rows of a subject. For instance, treatment condition (CBT, MBSR, & BOTH) is a TIX, someone in the CDT group has the same code for all time points.

Research Methods III Flashcards

(181 cards)