Final Exam Terms Flashcards

Question 1

Q

When are sample proportions normally distributed (i.e. what assumptions must be met?)

Answer

A

np >= 10

n(1 - p) >= 10

Question 2

Q

The sample proportions might NOT follow a normal distribution if …

Answer

A

p is close to 0 or 1
small n

Question 3

Q

When do we use two-sided or one-sided tests?

Answer

A

When the sign of Ha is does not equal, use both tails ( p x 2)

If Ha is < or >, use one tail to compute p

Question 4

Q

What assumptions must be met for a one proportion CI?

Answer

A

nphat >= 10

n(1 - phat) >= 10

Question 5

Q

How do we calculate a sample size for a one proportion test?

Answer

A

n = (z*/ME)^2 (p squiggle)(1 - p squiggle)

p squiggle is an estimate for the proportion

ALWAYS ROUND UP

Question 6

Q

What is p~ ( p squiggle)

Answer

A

it is the estimated proportion

If not provided p squiggle is 0.5

Question 7

Q

How is a hypothesis test set up for a two proportion test?

Answer

A

H0: p1 = p2
Ha: p1 does not equal p2

Question 8

Q

How do you interpret the confidence interval of a two proportion test?

Answer

A

We are __% confident that the difference in the population proportions of (insert variables) is between ___ and ____.

Question 9

Q

How do you set up a hypothesis test for a chi square test?

Answer

A

H0: p1 = p2 = p3 ….
Ha: some p does not equal some value

Question 10

Q

What is the formula for chi square goodness of fit test?

Answer

A

X(chi)^2 = the sum of (observed - expected)^2/expected

Question 11

Q

In a chi square test, what is the formula for expected counts?

Answer

A

n(p sub i)

pi is given in the null hypothesis

Question 12

Q

When do we use a t distribution?

Answer

A

1 or 2 paired means

Question 13

Q

When do we use a z distribution?

Answer

A

1 or 2 proportions

Question 14

Q

When do we use a chi square distribution?

Answer

A

When we have more than 2 proportions

Question 15

Q

How does a chi-square distribution appear? (i.e. what shape?)

Answer

A

right skewed

Question 16

Q

What happens to a chi-square distribution as df increases?

Answer

A

The degree of skew decreases and approaches a normal distribution.

Question 17

Q

What assumptions must be met for chi-square distribution?

Answer

A

each of the expected counts must be >= 5

Question 18

Q

What tail test do we use for finding p-value with a chi-square test?

Answer

A

always the right tail

Question 19

Q

What chi-square test do we use for two categorical variables?

Answer

A

chi square test for association

(goodness of fit for one categorical variable)

Question 20

Q

How do we set up a hypothesis test for a chi square test for association?

Answer

A

H0: variable A is not associated with variable B
Ha: variable A is associated with variable B

Question 21

Q

How do we calculate expected counts for chi-square test of association?

Answer

A

expected count = (row total x column total)/sample size (n)

Question 22

Q

How do we calculate degrees of freedom for chi-square tests?

Answer

A

goodness of fit: df = k -1

association: df = (r - 1)(c - 1)

Question 23

Q

How do we graph two quantitative variables?

Answer

A

scatterplot

Question 24

Q

What does correlation do?

Answer

A

Measures the strength and direction of a linear relationship between two quantitative variables.

Question 25

Q

How do we describe correlation in terms of paramaters and statistics?

Answer

A

paramater: rho

statistic: r aka correlation coefficient

Question 26

Q

What is the correlation coefficient range?

Answer

A

the smallest r can be is -1, the largest it can be is 1

Question 27

Q

What does it mean if r is positive?

Answer

A

as one variable increases, the other variable increases

direct/positive relationship

Question 28

Q

What does it mean if r is negative?

Answer

A

as one variable increases, the other variable decreases

inverse relationship

Question 29

Q

What does it mean if r is 0?

Answer

A

There is no linear relationship

Question 30

Q

The farther r is away from zero …

Answer

A

The stronger the linear relationship

(min -1, max +1)

Question 31

Q

How do you set up a correlation hypothesis test?

Answer

A

H0: rho = 0
(no linear relationship, variables are not correlated)

Ha: rho does not equal 0
(linear relationship, variables are correlated)

Question 32

Q

Does correlation imply causation?

Answer

A

Not always, must consider if an experiment is observational or experimental, or if there are any confounding variables

Question 33

Q

Is r resistant to outliers?

Question 34

Q

What does linear regression do?

Answer

A

Uses one quantitative variable to predict changes in another quantitative variable.

or using an explanatory variable to predict changes in the response variable

Question 35

Q

What is the linear regression equation?

Answer

A

y hat = a + bx

y hat: predicted response value
a = y intercept; predicted value of y when x = 0
b = slope; change in y for one unit change in x

Question 36

Q

What is the difference between simple and multiple linear regression.

Answer

A

simple has one explanatory variable

multiple has two or more explanatory variables

Question 37

Q

How is the residual calculated?

Answer

A

residual = actual y - predicted y

or = y - y hat

Question 38

Q

What does it mean if the residual is positive or negative?

Answer

A

positive residuals are above the line of best fit

negative residuals are below the line of best fit

Question 39

Q

What is ANOVA?

Answer

A

analysis of variance; helps determine if there is a difference between two or more means

Question 40

Q

For ANOVA, what are the factor and response

Answer

A

factor is the x variable, a categorical variable

response is the y variable, a quantitative variable

Question 41

Q

How is the hypothesis test set up for ANOVA?

Answer

A

H0: mu1 = mu2 = mu3 …
Ha: at least one mu does not equal another mu

Question 42

Q

How is df error found?

Answer

A

Df total (calculated as normal, n - 1) - Df factor (#groups - 1)

Question 43

Q

Describe F-distributions

Answer

A

F-distributions are right skewed, must use a right tail test when using the F-statistic to find the p value

Question 44

Q

How do you interpret a Tukey Comparison?

Answer

A

They ensure that the Type-1 error rate is not inflated.

As long as the data spread is not overlapping 0, the means are different.