Highlighted memory Flashcards

1
Q

FIve Number summary

A

Min, Q1, Med, Q3, Max

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Describing or comparing distributions

A

For quantitative data discuss

shape center spread outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Outlier

A

An extreme observation is an outlier if it is smaller than Q1-(1.5IQR) or larger than Q3+(1.5IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Choosing Measure of Center and spread

A

The mean and standard deviation are used to compare roughly symmetric distributions. The median and IQR are used to compare distributions where at least one is skewed, because they are resistant to outliers and skewness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

percentile

A

Percent of the distribution that is below the value of that distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Z-Score

A

How many standard deviations x lies above or below the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Density curve

A

A density curve always
remains on or above the horizontal axis
has total area 1 underneath it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describing scatterplots

A

discuss

direction form strength outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

residual plot

A

when the residual plot has no obvious pattern, the linear model is appropriate for the actual data
when the residual plot has an obvious pattern, the linear model is NOT appropriate for the actual data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

S:

A

Standard deviation of the residuals, the typical size of the prediction errors (residuals) when using the regression line, in context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

r^2

A

Coefficient of determination: r^2 percent of the in variation in y is accounted for by the least squares regression line relating x to y, in context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

4 basic principals of experimental design

A

Comparison: Use a design that compares two or more treatments
Random Assignment: Use chance to assign experiment units to treatments. This helps create roughly equivalent groups before treatments are imposed
Control: Keep as many other variables as possible the same for all groups. Control helps avoid confounding and reduces the variation in responses, making it easier to decide whether a treatment is effective.
Replication: Impose each treatment on enough experiment units so that the effects of the treatment can be distinguished from chance differences between the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Statistically significant

A

When an observed difference in responses between the groups in an experiment is too large to be explained by chance variation in the random assignment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

scope of inference

A

We can infer about population if individuals taking part in a study were randomly selected from the population
We can infer about cause and effect if a well-designed experiment that randomly assigns experimental units to treatments is used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Law of large numbers/probability

A

the law of large numbers says that the proportion of times that a particular outcome occurs after many repetitions will approach a single number, its probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

independent

A

two events are independent when knowing the occurence of one event does not raise or lower the likeihood of the other event occuring
a and b are independent if P(A|B)=P(A)

17
Q

mean of a discrete random variable

A

also called the expected value. After repeating many trials, the expected value is the average value of the outcome per trial (in context)

18
Q

Standard deviation of a discrete random variable

A

the typical distance that each trial’s outcome is from the mean value in context

19
Q

Mean and standard deviation of a sum of random variables

A

for any two random variables x and y, if s=X+Y
E(s)=Ms=Mx+My
os^2=ox^2+Oy^2, or equivalently Os={Ox^2+Oy^2}
THe sum of independent Normal random variables follows a normal distribution

20
Q

Mean and standard deviation of a difference of random variables

A

For any two random variables X and Y, if S=X+Y
E(S)=Ms=Mx+My
Os^2=Ox^2+Oy^2 or equivalently Od={Ox^2+Oy^2}
The difference of independent normal random variables follows a normal distribution

21
Q

Binomial setting

A
consists of fixed n independent trials of the same chance process, each resulting in a success or a failure, with probability of success p on each trial. The 4 conditions are BINS
Binomial
Independent 
Number of trials fixed
success probability fixed
22
Q

Geometric setting

A

consists of repeated trials of the same chance process in which the probability p of success is the same on each trial, and the goal is to count the number of trials it takes to get one success