exam 2 Flashcards

Question

how can bias be decreased

Answer 1

having a larger sample = gerater proportion of the population = less bias

Answer 2

the research objectives, resources/cost/times, and population characteristics

Answer 3

- pre-experimental - quasi- experimental - true experimental

Answer 4

- exploratory - used when rigorous approaches are not feasible - weak evidence of causality - no control group or random assignment

Answer 5

- moderate evidence of causality - no random assignment

Answer 6

- random assignment of participants to treatment or control groups - strong evidence of causality

Answer 7

- case study - pretest-posttest - static group comparison - nonequivalent groups

Answer 8

- interupted time series - natural experiment

Answer 9

- independent groups - matched groups - randomized controlled trial - repeated measures - factoral - pretest-posttest - solomon four-group

Answer 10

single group is exposed to an intervention/treatment and the outcome is measured

Answer 11

theres no way of knowing whether other factors contributed to the outcome

Answer 12

a single group is measured before and after an intervention/treatment

Answer 13

- one of two groups recieves the intervention/treatment - includes a control group - no randomization

Answer 14

pre-existing differences between groups may influence the outcomes

Answer 15

multiple measurements taken before and after intervention/treatment

Answer 16

observation of effects of natural occurances/changes

Answer 17

non labratory based setting and in the natural environment can be used to study how the real world operates and generalize findings

Answer 18

- random assignment to study groups - can have more than two groups

Answer 19

minimizes the effect of individual differences

Answer 20

- participants matched on key attributes then randomly assigned to groups - helps to minimize individual differences further

Answer 21

when specific attributes are expected to interact with the IV

Answer 22

- participants are randomly assigned to treatment or control group - uses blinding to reduce bias

Answer 23

Randomized controlled trials

Answer 24

particicpants unaware of grouping

Answer 25

participants and researchers unaware of grouping

Answer 26

participants complete all conditions - the participants are their own controls - random/counterbalanced order to minmize carryover effects - smaller sample sizes needed comapted to equiavalent independent/matched groups design

Answer 27

the effects of the IV

Answer 28

- examines the effects of multiple IVs on a single DV - can be incorporated into between, within subjects design

Answer 29

- separate treatment and control groups may or may not be "pretested" - controls for carryover effects from the pretest, improved internal validitiy - requires a larger sample size and randomization into each group

Answer 30

source: population calculated: no constants: yes examples: mean, standard deviation, population

Answer 31

source: sample calculated: yes constants: no examples: mean, SD, n

Answer 32

estimating population parameters from sample statistics

Answer 33

amount of error in th estimate of a population paramter that is derived from a sample statistic

Answer 34

because of the uncertainty in our parameter estimate

Answer 35

as a sample size increases, the sample mean approaches the population mean if 1. samplesa are independenct 2. samples are identically distributed

Answer 36

means of small random samples = larger sampling error

Answer 37

means of large random samples = smaller sampling error

Answer 38

theoretical frequency distribution of all possible sample means that can be calculated from a population

Answer 39

the variability of the sampling distribution decreases as sample size of each sample mean increases

Answer 40

the variability of sampling distribution is smaller than the variability of the population

Answer 41

how much the sample mean (statistics) is likely to differ from the true population mean (parameter)

Answer 42

the standard deviation of the sampling distribution of the mean

Answer 43

- they are homogenous - they have a larger sample size

Answer 44

the accuracy of a parameter estimate is inversely proportional to the square root of the sample size

Answer 45

it willl half the SEm (half the variablity) according to the square root law

Answer 46

just like SD on a normal curve - e.g. SEm = Z score of +/- 1.0

Answer 47

there is a 68% chance the population mean is within 163.5 <= mu <= 182.5 lbs

Answer 48

that this is also the confidence interval of 68%

Answer 49

that there is also a 32% probability of error, or a chance that the mean is not within that range - p = 0.32

Answer 50

the area under the curve that represents the probability of error, the liklihood of chance ocurrence

Answer 51

that there is 5% chance of rejecting the null hypothesis incorrectly

Answer 52

C.I. = Z-score mean +/- Z-score * SEm

Answer 53

A 95% CI will give the mean +/- 1.96(SEm) - the 1.96 is the interval where 95% of the data is found

Answer 54

with 95% confidence we conclude that the mean weight of all college-ages men is between 154.38 and 191.62 lbs. However, there is a 5% chance (p = 0.05) that the true mean falls outside of this range

Answer 55

- less likely to be wrong - less precise

Answer 56

with two mutually exclusive, exhaustive mathematical statements about the relationship between variables/groups are formed

Answer 57

- null hypotheses (H0) (this is assumed to be true unless evidence is found to the contrary) - alternative hypothesis

Answer 58

only one can be true

Answer 59

that no other option exists

Answer 60

H0: mean 1 = mean 2 H1L mean 1 does not equal mean 2

Answer 61

H0: mean 1 < mean 2 H1: mean 1 > mean 2

Answer 62

indicates the probabilituy of obtaining the data collected IF the null hypothesis H0 is true

Answer 63

that the result is statistically significant and the H0 can be rejected and you accept the alternative hypothesis

Answer 64

depending on what the H0 is, it would be indicating that there is a difference bettwen the two variables or that there the treatment group is significant

Answer 65

H0: mean 1 = mean 2 HA: mean 1doesnt = mean 2

Answer 66

- set by alpha value - split between tails of the distribution (each 2.5% AUC)

Answer 67

when prior research/logical reasoning does not suggest a direction or different, a difference should be expected

Answer 68

H0: category 1 > category 2 HA: category 1

Answer 69

- set by alpha value - concentrated at one tail of the distribution

Answer 70

when there is a strong evidence to think a difference exists

Answer 71

H0 is rejected when it is actually true (a false positive)

Answer 72

conclude that an effect/relationship exists when, in reality, if does not

Answer 73

by decreasing alpha

Answer 74

H0 is accepted when it is actually false (falsse negative)

Answer 75

that no effect/relationshiup exists when it really does

Answer 76

through decreasing beta

Answer 77

the probability of committing a Type II error (typically strive for beta=0.2)

Answer 78

probability of rejecting H0 when H0 is false

Answer 79

power = 1 - beta - typically strive for 0.8

Answer 80

- measurement error - lack of random sample - alpha value too liberal (a=0.10) - investigator bias - improper use of one tailed test

Answer 81

- measurement error - lack of sufficient power (N too small) - alpha value too conservative ( a = 0.01) - treatment effect not properly applied

Answer 82

- decrease a priori significance level a (a bonferonni correction) - control confounding variables - increase sample size

Answer 83

you will increase the chance of a Type II error

Answer 84

correction to the alpha value dividing 0.05/# of tests

Answer 85

increase a priori significance level alpha

Answer 86

may increase the chance of a Type I error

Answer 87

conducting a power analysis

Answer 88

the degree of association between betwen two interval- level variables

Answer 89

a coefficient between +1.00 and -1.00

Answer 90

- perfect positive correlation - the size of deviations from the mean in both variables are equal in the same direction

Answer 91

- perfect negative correlation - or the size of deviations from the mean in both variables are euqal in opposite directions

Answer 92

- no correlation - there is no pattern to the size and direction of deviations from the mean between variables

Answer 93

to visualize the correlation between variables

Answer 94

best linear estimate of the relationship between variables given the data used to calculate it

Answer 95

error between measured and predicted values by the lines equation

Answer 96

pearon's product moment correlation coefficient

Answer 97

r = sum of (ZxZy)/N - Zx being number of score pairs - Zy being product of z-scores for each variable

Answer 98

r = (sum of (x-mean)(y-mean))/sqrt(sum of (x-mean x)^2) sum(y-mean y)^2))

Answer 99

- both variables must be on a continuous (interval or ratio) scale - each pair of variables must be indepoendent - both variables should be approximately normally distributed - the relationship between variables (if one exists) must be linear - the dataset should not contain outliers

Answer 100

use spearman's rank

Answer 101

it is really sensitive to outliers so it may creat an overly strong correlation or weak correlation

Answer 102

p = 1- (6*sum of di^2)/(n(n^2-1)) - di^2: the difference between variable ranks - n = number of observations

Answer 103

- a nonparametric test - w/ fewer assumptions including about the data distribution

Answer 104

- variables do not need to be normally distributed - variables can be discrete - relationship between variables can be non-loinear but must be monotonic - less sensitive to outliers

Answer 105

r^2 - quantifies the shared variance betwen variables - how well the indeoendent variables explain the variation in the dependent variables

Answer 106

_____% of the variance in the dataset can be explained by the variance in what is being looked at.

Answer 107

the number of scores that are free to vary when the sum the scores is set

Answer 108

df = N-#of variables in the correlation

Answer 109

correlation does not necessarily mean that a change in one variable will result in a change in the other

Answer 110

strong enough correlations allow for predictions of one variable based on the values of another variable

Answer 111

y = beta not + beta1x + e

Answer 112

- the relationship between variables must be linear - each pair of variables must be independent - for any value of a predictor (independent variable) the dependent variable must be approximately normally distributed - the variance of the residuals must be consistent across the range of predictor values

Answer 113

when the spread of residuals is relatively consistent within the regression model

Answer 114

beta1: (r(SDy)/(SDx)) beta0: mean y - ((rSDy)/(SDx))mean x

Answer 115

yes, unless there is a perfect correlation between variables

Answer 116

- using the standard error of the estimate - or the SD of the residuals

Answer 117

SEe = sqrt ((sum(yactual-ypred)^2)/(n-2)

Answer 118

SEe = SDysqrt(1-r^2)

Answer 119

it underestimates SEe when the sample size is small

Answer 120

beta1 does not equal 0

Answer 121

determine significance of beta1

Answer 122

quantifies the degree of relationship/association betwen a function of independent variables and one dependent variabl

Answer 123

a coefficient R between 0 and 1

Answer 124

no correaltion, or there is no relationship/association between independent variables and the dependent variable

Answer 125

perfect correaltion, or the independent variables completely explain the dependent variable

Answer 126

- R^2 - same interpretation as bivariate r^2

Answer 127

quantifies the relationship between an independent variable and dependent variable after removing the effect of another variable

Answer 128

an independent variable that can influence the outcome of a given statistical trial, but which is not of direct interest

Answer 129

the variance in Y explained by X1 after removing the effects of X2 on both

Answer 130

- interested in association between children's age (X1) and muscle strength (Y) - children grow and get heavier with age (X2) and may be a covariate - using partial correlation = partial out the effect of weight and can leave the variance in strength due solely to age

Answer 131

- (1-R^2) - the amount of variation in a dependent variable that a model can explain using the independent variables

Answer 132

- both variables must be on a continious (interval or ratio) scale - each pair of variables must be independent - both variables should be approx. normally distributed - the relationship between variabels (if one exists) must be linear - the dataset should not contain outliers

Answer 133

prediction of one dependent variable from multiple predictor variables (independent variables)

Answer 134

Y = a + b1X1 + b2X2 + .... bkXk - b values are the slope coefficients - x values are the independent variables - a is the Y-intercept

Answer 135

reseracher has full control over the model equation and which predictors are included

Answer 136

when hypothesis testing is the goal rather than accurate, efficient dependent variable prediction

Answer 137

computer software/algorithms construct the model equation

Answer 138

- forward selection - backward elimination - stepwise

Answer 139

starts with the intercept only, predictors are added to the model one-by-one and assessed, if R^2 increases that shows unique variablility

Answer 140

- starts with all predictors - eliminates predictors one-by-one and assesses the resulting model - if the removal of the variable decreses explained the varible the least (not sig decrease) the variable is eliminated

Answer 141

same as forward selection but previously entered variables can be eliminated in later steps - if R^2 is not affected by the inclusion or exclusion

Answer 142

requires a larger sample size compared to other methods to return reliable results

Answer 143

20:1 to 40:1 ratio

Answer 144

the values are the r values that indicate the strength of correaltion between variables - values close to 1.00 indicate strong correalation - this is then squared to report how much variance of the dataset is explained through this variable

Answer 145

- based on if the variable is highly correlated with other variables - if the addition of the variable in the R^2 calculation increases significantly, if it does this indicates unique variance

Answer 146

- if the circle overlaps heavily with the dependent variable - if the overlap is present but more overlap is seen with another variable, it doesnt explain that much for the variance and therefore isnt unique

Answer 147

- the relationship between variables must be linear - each pair of variables must be independent - for any value of a predictor (independent variable), the dependent variables must be approx normally distributed - variance of the residuals must be consistent across the range of predictor values - independent variables (predictors) should not be correlated with each other

Answer 148

leads to inflated confidence intervals for slope coefficient estimates and unstable slope coefficient estimates when addtional predictors are added

Answer 149

greater than 10 should be suspicious

Answer 150

VIF = 1/1-R^2

Answer 151

two IVs are perfectly related (r=1.00) usually because one was mathematically derived from the other

Answer 152

the process of testing regression equations on a separate and equivalent sample from which they were built to ensure accuracy in their predictions

Answer 153

higher prediction errors

Answer 154

training data - the correlation coefficient will undergo shrinkage and would be smaller on different samples

Answer 155

william sealy

Answer 156

- we do not know the distribution of the population - we have a relatively small sample relative to the population

Answer 157

as sample size increases, the t distribution approaches a normal distribution

Answer 158

the ratio between mean differences and variability

Answer 159

the value that must be met to reach statistical significance at a given alpha level

Answer 160

t = mean difference/SEof mean difference

Answer 161

a signal to noise ratio

Answer 162

the variability of the difference between two groups

Answer 163

- the data must be normally distributed - the data must be on the interval or ratio scales - the sample is randomly selevted from the greater population - when two samples are taken, they should have homogeneity of variance

Answer 164

used to compare a single sample mean with a known population meanat i

Answer 165

t = (sample mean - population mean)/SEM

Answer 166

the degrees of freedom

Answer 167

if the |t statistic| > criticial statistic

Answer 168

C.I. = sample mean +/- tcv(SEM)

Answer 169

when unequal sample sizes are present

Answer 170

SED = Square root([((n1-1)(SD1^2)+(n2-1)(SD2^2))/(n1+n2-2)][(1/n1)+1/n2)]

Answer 171

used ot compare two means from the same or correlated samples

Answer 172

t = sample mean pre - sample mean post /SED

Answer 173

SED = square root((SD1^@)/n1)+(SD2^2/n2)-2r(SD1^2/n1)(SD2^2/n2))

Answer 174

t = d/SED - d = mean difference between individual's scores SED = SDd/sqrt n - SDd = standard deviationof the difference

Answer 175

CI= mean difference between individuals socres +/-tcv(SED)

Answer 176

single samples: Wilcoxon signed rank test independent samples: Mann-Whitney U test paired samples: Tilcoson signed rank test

Answer 177

the stregnth of the relationship between variables

Answer 178

estimate of the varinace explained by the influence of the independent variable

Answer 179

percent change