Midterm Flashcards

1
Q

What is the total area under the density curve?

A

100% or 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What percent of observations will fall within 1 standard deviation of the mean of an approx. Normal distribution?

A

68%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What percent of observations will fall within 2 standard deviation of the mean of an approx. Normal distribution?

A

95%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What percent of observations will fall within 3 standard deviation of the mean of an approx. Normal distribution?

A

99.7%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is “r”?

A

the correlation r measures the strength of a linear relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What values can r take? What does it mean when r is less than 0?

A

r can take any value from -1 to 1. If it’s less than zero, it describes a negative correlation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does it mean when r = 1? r = -1?

A

perfect correlation/points on a scatterplot lie exactly on a straight line; r = -1 means perfect negative correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the slope b of a regression line y-hat = a + bx?

A

the predicted change in y-hat when x increases by 1 unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the standard deviation of the residuals measure?

A

typical size of prediction errors when using the regression line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the coefficient of determination r^2 measure?

A

fraction of the variation in the response variable that is accounted for by the least-squares regression on the explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define influential observations

A

Individual points that substantially change the correlation or the regression line; outliers are often influential for the regression line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The least squares regression line of Y on X is the line with slope B= ? and intercept A = ?

A

r(Sy / Sx)

YMean - bXMean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the 4 basic principles of experimental design?

A
  1. Comparison: use a design that compares 2 or more treatments
  2. Random assignment: use chance to assign experimental units to treatments. This helps create roughly equivalent groups before treatments are imposed.
  3. Control: keep as many other variables as possible the same for all groups. Control helps avoid confounding and reduces the variation in responses, making it easier to decide whether a treatment is effective.
  4. Replication: impose each treatment on enough experimental units so that the effects of the treatments can be distinguished from chance differences between the groups.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe a randomized block design

A

A randomized block design forms groups of experimental units that are similar with respect to a variable that is expected to affect the response. Treatments are assigned at random with in each block. Responses are then compared with in each block and combined with the responses of other blocks after accounting for the differences between the blocks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Describe a matched pairs design

A

A matched pairs design is a common form of blocking for comparing just two treatments. And some matched pairs designs each subject receives both treatments in a random order. And others to very similar subjects are paired and the two treatments are randomly assigned within each pair

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What makes something a simple random sample?

A

It gives every possible sample of the same size an equal chance to be selected

17
Q

How should you organize a matched pairs experiment?

A

Subjectively divide the sample into pairs to make the pairs as similar to each other as possible, and then randomly assigned the treatment to one of the members of the pair

18
Q

What is the law of large numbers?

A

The law of large numbers says that the proportion of times that a particular outcome occurs in many repetitions will approach a single number.

19
Q

What will the probability of the sample space always equal?

A

1

20
Q

What is the addition rule for mutually exclusive events?

A

P(A or B) = P(A) + P(B)

21
Q

What probability does the union of A and B describe?

A

P(A or B)

22
Q

What probability does the intersection of A and B provide?

A

P(A and B)

23
Q

What is the general addition rule to find P(A or B)?

A

P(A or B) = P(A) + P(B) - P (A and B)

24
Q

What does the general multiplication rule state?

A

P(A and B) = P(A) * P(B | A)

25
Q

If two events are mutually exclusive, they can/cannot be independent

A

Cannot

26
Q

What are the sums of the means of X + Y?

A

X mean + Y mean

27
Q

What are the variance of X + Y?

A

variance X + variance Y

28
Q

What are the variance of X - Y?

A

variance X + variance Y

29
Q

What are the qualifications of a binomial setting?

A

Binary- The possible outcomes of each trial can be classified as a success or failure
Independent- trials must be independent; that is, knowing the result of one trial must not tell us anything about the result of any other trial
Number- The number of trials of the chance process must be fixed in advance
Success- there is the same probability of success on each trial

30
Q

What is the 10% condition?

A

The binomial distribution gives a good approximation to the count of successes in a simple random sample from a large population containing proportion P of success. This is true as long as the sample size is no more than 10% of the population size.

31
Q

What is the large counts condition?

A

You can use a normal approximation for a binomial distribution when the sample size times the probability of success is greater than 10 and the sample size times the complement of the probability of success is also greater than 10

32
Q

How do you find the geometric probability?

A

P(Y=k) = (1-p)^(k-1) (p)

33
Q

How do you find a mean or expected value of a geometric random variable?

A

The mean is equal to one divided by the probability of success

34
Q

The central limit theorem states that when n is ?, the sampling distribution of XBar will be approx. Normal in most cases

A

n > or = 30