Final Exam Terms Flashcards

1
Q

When are sample proportions normally distributed (i.e. what assumptions must be met?)

A

np >= 10

n(1 - p) >= 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The sample proportions might NOT follow a normal distribution if …

A
  1. p is close to 0 or 1
  2. small n
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When do we use two-sided or one-sided tests?

A

When the sign of Ha is does not equal, use both tails ( p x 2)

If Ha is < or >, use one tail to compute p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What assumptions must be met for a one proportion CI?

A

nphat >= 10

n(1 - phat) >= 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we calculate a sample size for a one proportion test?

A

n = (z*/ME)^2 (p squiggle)(1 - p squiggle)

p squiggle is an estimate for the proportion

ALWAYS ROUND UP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is p~ ( p squiggle)

A

it is the estimated proportion

If not provided p squiggle is 0.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is a hypothesis test set up for a two proportion test?

A

H0: p1 = p2
Ha: p1 does not equal p2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you interpret the confidence interval of a two proportion test?

A

We are __% confident that the difference in the population proportions of (insert variables) is between ___ and ____.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you set up a hypothesis test for a chi square test?

A

H0: p1 = p2 = p3 ….
Ha: some p does not equal some value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the formula for chi square goodness of fit test?

A

X(chi)^2 = the sum of (observed - expected)^2/expected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In a chi square test, what is the formula for expected counts?

A

n(p sub i)

pi is given in the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When do we use a t distribution?

A

1 or 2 paired means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When do we use a z distribution?

A

1 or 2 proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When do we use a chi square distribution?

A

When we have more than 2 proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does a chi-square distribution appear? (i.e. what shape?)

A

right skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What happens to a chi-square distribution as df increases?

A

The degree of skew decreases and approaches a normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What assumptions must be met for chi-square distribution?

A

each of the expected counts must be >= 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What tail test do we use for finding p-value with a chi-square test?

A

always the right tail

19
Q

What chi-square test do we use for two categorical variables?

A

chi square test for association

(goodness of fit for one categorical variable)

20
Q

How do we set up a hypothesis test for a chi square test for association?

A

H0: variable A is not associated with variable B
Ha: variable A is associated with variable B

21
Q

How do we calculate expected counts for chi-square test of association?

A

expected count = (row total x column total)/sample size (n)

22
Q

How do we calculate degrees of freedom for chi-square tests?

A

goodness of fit: df = k -1

association: df = (r - 1)(c - 1)

23
Q

How do we graph two quantitative variables?

A

scatterplot

24
Q

What does correlation do?

A

Measures the strength and direction of a linear relationship between two quantitative variables.

25
Q

How do we describe correlation in terms of paramaters and statistics?

A

paramater: rho

statistic: r aka correlation coefficient

26
Q

What is the correlation coefficient range?

A

the smallest r can be is -1, the largest it can be is 1

27
Q

What does it mean if r is positive?

A

as one variable increases, the other variable increases

direct/positive relationship

28
Q

What does it mean if r is negative?

A

as one variable increases, the other variable decreases

inverse relationship

29
Q

What does it mean if r is 0?

A

There is no linear relationship

30
Q

The farther r is away from zero …

A

The stronger the linear relationship

(min -1, max +1)

31
Q

How do you set up a correlation hypothesis test?

A

H0: rho = 0
(no linear relationship, variables are not correlated)

Ha: rho does not equal 0
(linear relationship, variables are correlated)

32
Q

Does correlation imply causation?

A

Not always, must consider if an experiment is observational or experimental, or if there are any confounding variables

33
Q

Is r resistant to outliers?

A

no

34
Q

What does linear regression do?

A

Uses one quantitative variable to predict changes in another quantitative variable.

or using an explanatory variable to predict changes in the response variable

35
Q

What is the linear regression equation?

A

y hat = a + bx

y hat: predicted response value
a = y intercept; predicted value of y when x = 0
b = slope; change in y for one unit change in x

36
Q

What is the difference between simple and multiple linear regression.

A

simple has one explanatory variable

multiple has two or more explanatory variables

37
Q

How is the residual calculated?

A

residual = actual y - predicted y

or = y - y hat

38
Q

What does it mean if the residual is positive or negative?

A

positive residuals are above the line of best fit

negative residuals are below the line of best fit

39
Q

What is ANOVA?

A

analysis of variance; helps determine if there is a difference between two or more means

40
Q

For ANOVA, what are the factor and response

A

factor is the x variable, a categorical variable

response is the y variable, a quantitative variable

41
Q

How is the hypothesis test set up for ANOVA?

A

H0: mu1 = mu2 = mu3 …
Ha: at least one mu does not equal another mu

42
Q

How is df error found?

A

Df total (calculated as normal, n - 1) - Df factor (#groups - 1)

43
Q

Describe F-distributions

A

F-distributions are right skewed, must use a right tail test when using the F-statistic to find the p value

44
Q

How do you interpret a Tukey Comparison?

A

They ensure that the Type-1 error rate is not inflated.

As long as the data spread is not overlapping 0, the means are different.