Quiz 4 Flashcards

Question

To Transform … or Not

Answer 1

- The CLT: sampling distribution will be normal in samples >30 (unless sample distribution has a substantial skew; heavy tails) - Transformations sometimes fail to restore normality and equality of variances - They make the interpretation of results difficult, as findings are based on transformed data

Answer 2

Can be used in place of their parametric counterparts when it is questionable that the normality assumption holds sometimes more powerful in detecting population differences when certain assumptions are not satisfied Nonparametric tests cannot assess interactions

Answer 3

Refers to approaches dealing with non normal data that require a lot of computing power - E.g. bootstrapping methods

Answer 4

Goal is to observe the sampling distribution shape directly to allow us to do hypothesis testing Uses sample data to estimate the sampling distribution itself By drawing random samples with replacement from the sample data Because sampling distribution is estimated directly, no need to assume normality P-value can be calculated based on how rare it is to get the observed test-statistic value or more extreme values in the estimated sampling distribution (regardless of whether it is normal or not)

Answer 5

5,000 - 10,000 bootstrap samples

Answer 6

Find central value (mean) of the data points and look at what falls at lower 2.5th percentile and upper 97.5th percentile and use them as upper and lower bounds for CI

Answer 7

1. The sampling distribution(s) is(are) normally distributed 2. Homogeneity (equality) of Variance 3. Data from your population are independent

Answer 8

The assumption that the dependent variable exhibits similar amounts of variance across the range of values for an independent variable

Answer 9

Visual inspection of graphs - Scatter plot, residual plot Levene’s Tests Can become overly sensitive in large samples Variance Ratio (Hartley’s FMAX)

Answer 10

With 2 or more groups VR = Large variance/smallest variance If VR < 2, homogeneity of variance can be assumed If the group sizes are roughly equal, hypothesis testing results are robust to the violation of homogeneity Would still likely still get valid results even if If the largest group size is smaller than 1.5 times the smallest group size → can use this concept

Answer 11

- Tests if variance in different groups are the same - Significant = variances not equal - Non significant = variances are equal Null hypothesis is that there’s homogeneity of variance

Answer 12

- Scatter plot, residual plot *Space between line of best fit and actual point --> Deviation, error, residual

Answer 13

1. Using robust methods 2. Bootstrapping 3. Transforming an outcome variable

Answer 14

- general formula for test statistic involves two types of variability, one in numerator and one in denom - Formula for test-statistic = explained variability/unexplained variability - We want test statistic to be bigger such that we have greater explanatory power BUT with dependent data, unexplained variability becomes artificially smaller In the case of dependent data → increased Type I error rate

Answer 15

We can see whether, as one variable deviates from its own mean, the other deviates in the same way from its own mean, the opposite way, or stays the same This can be done by calculating the covariance If there is a similar (or opposite) pattern, we say the two variables covary

Answer 16

measure of how much a group of scores deviates from the mean of a single variable Average squared deviation from the mean

Answer 17

Tells us by how much a group of scores on two variables differ from their respective means

Answer 18

Calculate the deviation (error) between the mean and each subject’ score for the first variable (x) Calculate the deviation (error) between the mean and their score for the second variable (y) Multiply these deviations (error) values These multiplied numbers are called the cross product deviations Add up these cross product deviations Divide by N-1 → result is covariance

Answer 19

CROSS-PRODUCT DEVIATION

Answer 20

depends upon the units of measurement E.g. the covariance of two variables measures in miles and dollars would be much smaller than if the variables were measures in feet and cents, even if the relationship was exactly the same

Answer 21

Solution --> standardize it Divide by the product of standard deviations of both variables The standardized version of covariance is known as the correlation coefficient Relatively unaffected by units of measurement

Answer 22

When x and y are both continuous, their correlation is called the Pearson Product Moment Correlation Coefficient Correlation statistics are standardized and can range from -1 to 1 It can be used as a measure of effect size

Answer 23

both a standardized statistic that can compare across samples

Answer 24

Convention .1 = small effect .3 = medium effect .5 = large effect

Answer 25

Correlation is a necessary but not sufficient criteria for causality Possible directions of causality: X → Y X ← Y A third factor leads to changes in both x and y Correlation is by coincidence

Answer 26

Temporal precedence Demonstrating empirical association between variables Control for confounds

Answer 27

Pearson’s Correlation (r) Spearman’s p (greek rho) (rs) Kendall’s Tau (t) Point Biserial Correlation (rpb) Biserial Correlation (rb) Phi Coefficient (φ)

Answer 28

For analyzing relationship between two continuous variables Assumes normality, homogeneity of variance, independence of data, ANDlinear relationship

Answer 29

When one or both variables are ordinal E.g. SAT score and high school class standing Nonparametric alternative to pearson’s coefficient Does not assume linear relationship

Answer 30

Better for smaller samples Possible to be helpful in cases of ranks?

Answer 31

One continuous and one dichotomous variable Used for continuous and binary variable that is quantitatively dichotomous

Answer 32

One continuous and one dichotomous variable When the variable is not truly quantitatively dichotomous, but treated as such E.g. pass/fail class grades (categories based on continuum) Median split → create groups

Answer 33

Two categorical variables

Answer 34

Random assignment

Answer 35

By squaring the value of r, you get the proportion of variance in one variable shared by the other(s), R2 Can only take the value of 0-1, because it is a squared value (must be positive)

Answer 36

That the two variables have no causal connection, yet it may be inferred that they do, due to an unknown confounding factor I.e. ice cream sales and death by drowning → third confounding is temperature outside

Answer 37

when group sizes for binary variable are unequal → correlation values can become smaller

Quiz 4 Flashcards

(61 cards)