Midterm Flashcards

Question 1

Q

What is the total area under the density curve?

Answer

A

100% or 1

Question 2

Q

What percent of observations will fall within 1 standard deviation of the mean of an approx. Normal distribution?

Question 3

Q

What percent of observations will fall within 2 standard deviation of the mean of an approx. Normal distribution?

Question 4

Q

What percent of observations will fall within 3 standard deviation of the mean of an approx. Normal distribution?

Question 5

Q

What is “r”?

Answer

A

the correlation r measures the strength of a linear relationship

Question 6

Q

What values can r take? What does it mean when r is less than 0?

Answer

A

r can take any value from -1 to 1. If it’s less than zero, it describes a negative correlation.

Question 7

Q

What does it mean when r = 1? r = -1?

Answer

A

perfect correlation/points on a scatterplot lie exactly on a straight line; r = -1 means perfect negative correlation

Question 8

Q

What is the slope b of a regression line y-hat = a + bx?

Answer

A

the predicted change in y-hat when x increases by 1 unit

Question 9

Q

What does the standard deviation of the residuals measure?

Answer

A

typical size of prediction errors when using the regression line

Question 10

Q

What does the coefficient of determination r^2 measure?

Answer

A

fraction of the variation in the response variable that is accounted for by the least-squares regression on the explanatory variable

Question 11

Q

Define influential observations

Answer

A

Individual points that substantially change the correlation or the regression line; outliers are often influential for the regression line

Question 12

Q

The least squares regression line of Y on X is the line with slope B= ? and intercept A = ?

Answer

A

r(Sy / Sx)

YMean - bXMean

Question 13

Q

What are the 4 basic principles of experimental design?

Answer

A

Comparison: use a design that compares 2 or more treatments
Random assignment: use chance to assign experimental units to treatments. This helps create roughly equivalent groups before treatments are imposed.
Control: keep as many other variables as possible the same for all groups. Control helps avoid confounding and reduces the variation in responses, making it easier to decide whether a treatment is effective.
Replication: impose each treatment on enough experimental units so that the effects of the treatments can be distinguished from chance differences between the groups.

Question 14

Q

Describe a randomized block design

Answer

A

A randomized block design forms groups of experimental units that are similar with respect to a variable that is expected to affect the response. Treatments are assigned at random with in each block. Responses are then compared with in each block and combined with the responses of other blocks after accounting for the differences between the blocks.

Question 15

Q

Describe a matched pairs design

Answer

A

A matched pairs design is a common form of blocking for comparing just two treatments. And some matched pairs designs each subject receives both treatments in a random order. And others to very similar subjects are paired and the two treatments are randomly assigned within each pair

Question 16

Q

What makes something a simple random sample?

Answer

A

It gives every possible sample of the same size an equal chance to be selected

Question 17

Q

How should you organize a matched pairs experiment?

Answer

A

Subjectively divide the sample into pairs to make the pairs as similar to each other as possible, and then randomly assigned the treatment to one of the members of the pair

Question 18

Q

What is the law of large numbers?

Answer

A

The law of large numbers says that the proportion of times that a particular outcome occurs in many repetitions will approach a single number.

Question 19

Q

What will the probability of the sample space always equal?

Question 20

Q

What is the addition rule for mutually exclusive events?

Answer

A

P(A or B) = P(A) + P(B)

Question 21

Q

What probability does the union of A and B describe?

Answer

A

P(A or B)

Question 22

Q

What probability does the intersection of A and B provide?

Answer

A

P(A and B)

Question 23

Q

What is the general addition rule to find P(A or B)?

Answer

A

P(A or B) = P(A) + P(B) - P (A and B)

Question 24

Q

What does the general multiplication rule state?

Answer

A

P(A and B) = P(A) * P(B | A)

Question 25

Q

If two events are mutually exclusive, they can/cannot be independent

Question 26

Q

What are the sums of the means of X + Y?

Answer

A

X mean + Y mean

Question 27

Q

What are the variance of X + Y?

Answer

A

variance X + variance Y

Question 28

Q

What are the variance of X - Y?

Answer

A

variance X + variance Y

Question 29

Q

What are the qualifications of a binomial setting?

Answer

A

Binary- The possible outcomes of each trial can be classified as a success or failure
Independent- trials must be independent; that is, knowing the result of one trial must not tell us anything about the result of any other trial
Number- The number of trials of the chance process must be fixed in advance
Success- there is the same probability of success on each trial

Question 30

Q

What is the 10% condition?

Answer

A

The binomial distribution gives a good approximation to the count of successes in a simple random sample from a large population containing proportion P of success. This is true as long as the sample size is no more than 10% of the population size.

Question 31

Q

What is the large counts condition?

Answer

A

You can use a normal approximation for a binomial distribution when the sample size times the probability of success is greater than 10 and the sample size times the complement of the probability of success is also greater than 10

Question 32

Q

How do you find the geometric probability?

Answer

A

P(Y=k) = (1-p)^(k-1) (p)

Question 33

Q

How do you find a mean or expected value of a geometric random variable?

Answer

A

The mean is equal to one divided by the probability of success

Question 34

Q

The central limit theorem states that when n is ?, the sampling distribution of XBar will be approx. Normal in most cases

Answer

A

n > or = 30