AP Stats all concepts Flashcards

1
Q

1.5 IQR rule

A

Low outliers < Q1-1.5 (IQR)
High outliers > Q3+1.5(IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Interpret slope

A

As the (x-variable) increases by 1 (unit), the predicted (y-variable)increases/decreases by (slope units).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Interpret Y-intercept

A

When the (x-variable) is 0 (units), the predicted (y-variable) is (y-intercept units).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Interpret the mean:

A

If many, many (context) are randomly selected, the average (context) will be about (mean value).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Interpret standard deviation

A

The (context) typically vary from the (mean value) by about (standard deviation value).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Interpret Confidence Interval (Conclusion):

A

We are (confidence level%) confident that the true (parameter in context) is
between (lower bound and upper bound in context.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interpret Confidence Level:

A

l: If we constructed many, many confidence intervals from random samples of size (n), about
(confidence level%) of the intervals would capture the true (population parameter in context).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interpret the p-value:

A

Assuming the (H0 in context) is true, there is a (p-value%) chance of getting a sample (proportion/mean) of
(sample value) or something more extreme by chance in random samples of size (n).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interpret the power:

A

If the true (population parameter in context) is (Ha), there is a (power%) probability of finding convincing
evidence to reject (H0 in context).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describing/Comparing Distributions

A

Center: compare the median or mean
Shape: describe the distibution, peaks, symmetrical, bimodal, or skewed
Unusual features- gaps ro outliers
Spread- SD or range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Resistance

A

IF extreme values have an impact
median is resistant
mean, range, and SD is not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

mean & Median in roughly symmetric distributions

A

mean=median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mean & median in skewed to the left distributions

A

mean < median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mean & Median in skewed to the right distributions

A

mean > median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

z score formula

A

value- mean/ SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Transforming Data - Addition & Multiplication

A

Addition= center & location are affected (mean, 5# summaries, and percentiles)
Mutiplying- center, location, variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the empirical rule

A

68% of the data falls under 1 standard deviation away from the mean

95% of the data falls under 2 standard deviations away from the mean

99.7% of the data falls under 3 standard deviations away from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explaining correlation

A

There is a (strength- strong or weak, direction- negative or positive, form- linear or curved form) relationship between (context)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define extrapolation

A

using the line for x-values outside for the interval which is not accurate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Define residual

A

y-y(hat) = actual value - predicted value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Interpretation- residual

A

(actual y -variable) is less than/ greater than/ equal to than the (y-variable) predicted by the regression line with x=(context of x-variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Standard deviation of regression line interpretation

A

The actual (y-variable) is typically about (standard deviation) away from the (y-variable) predicted by the least-square regression line x= (x-variable).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Coefficient of Determination interpretation (r^2)

A

About (r^)% of the variability in the (y-variable with context) is accounted for by the least squares regression line with x= (x-variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Simple random sampling define

A

size n chosen with equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Pros of SRS
unbiased- equal chance and easy
24
Cons of SRS
Issue of undercoverage- one type of factor may have more people
25
Stratified sampling description
Divide the population into groups (strata) based on a similar characteristic, then use an SRS to choose from EACH group
26
Stratified Pros
More precise than an SRS and can be cheaper if the groups are already available
27
Stratified cons
Difficult to divide into groups, more complex than SRS, and must have known population
28
Cluster sampling description
Divide the population into groups (usually by location), randomly select a group and sample everything in THAT group
29
Pros of cluster sampling
Cost is reduced, is unbiased, and don’t need to know entire population
30
Cluster sampling cons
Sample may not be representative of overall population
31
Systematic sampling description
Use a system (every nth number) after choosing randomly where to begin
32
Systematic random sampling pros
Unbiased, the sample is evenly distributed across the population, and don’t need to know entire population
33
Systematic Sampling cons
Large variation and can be affected by trends
34
Bad sampling procedures
Voluntary sampling and convenience sampling, but both are highly unrepresentative
35
Nonresponse bias
not willing to participate in sample
36
response bias
false/incorrect answers
37
Confounding
multiple variables and it is difficult to distinguish which variable has an effect
38
randomized block design description
sample is separated into two groups, then they get randomly assigned and then they are split up and one group gets a treatment and other does not
39
sampling variability define
samples may be different from population
40
matched pairs design
subjects are grouped into pairs and then between those pairs one gets a treatment; in some cases you can give two treatmets
41
Complementary rule for independent events
P(Ac)=1-P(A)
42
define mutually exclusive
no outcomes in common and can never be together
43
Mutually exclusive addition rule
P(A or B)= P(A)+ P(B)
44
General Addition Rule for Dependent Events
P(A or B)= P(A)+P(B)-P(A and B)
45
Conditional Probability equation
P(A|B)= P(A & B)/P(B)= P(both events together)/P(given event)
46
General Multiplication Rule
P(A and B)= P(A) * P(B)
47
What makes a probability model valid?
Probabilities add up to 1 All probabilities are between 0 and 1
48
Binomial Probability Conditions
1. Binary- successes or failures 2. Independent- 3. Number- # of tirals 4. Same probability- for each outcome
49
binompdf
p(x=k)
50
binomialcdf
p(X less than or equal to k)
51
how do you find a binomial/geometric probability when it states at least k
1- binomial P less than or equal to J
52
Defining a Binomial Distribution
Random variable____ has a binomial distribution with n= # and p=#
53
Defining a geometric distribution
Random Variable ____ had a geometric distribution with p= #
54
Checking for independent events
P(A|B)= P(A|Bc)= P(A)
55
Significance Rule
< 5%= statistically significant >5%= not statistically significant
56
Unbiased estimator
statistic of sample= match the population
57
sample variability for sample proportion
SD is larger for values of p closer to 0.5 SD is smaller for values of p closer to 0 or 1
58
Point estimate
adding interval/2
59
Margin of Error
subtracting interval/2 or Subtracting the larger number of the interval by the point estimate
60
Impacts of margin or error on confidence level and sample sizes
Decreasing margin or error= decreasing confidence level Sample sizes increase= reduces margin of Error ME= 1/sqrrt n e.g. n=4 ME= 1/2
61
Type 1 Error
the test finds convincing evidence that Ha is true when it isn't Reject H0 but Ho is true
62
Type 2 error
the test doesn't find convincing evidence that Ha is true when it really is Ha True but fail to reject H0
63
Effect on Type 1 Error and Type 2 error when Significance level is decreased
Type 1 error decreases but Type 2 error increases
64
Effect of Type 1 error and Type 2 when significance level is increased
Type 1 error increases but Type 2 error is decreased
65
what do you do if there is a two sided z test?
multiply the p value by 2
66
What effect does a large power have on the Type 2 error and Type 1 error?
Type 2 error: decreases it Type 1 error: increases
67
What effect does sample size, n, and significance level has on the power?
Directly proportionally N and significance level are large= power is large N and significance level small= power is small
68
What effects does a small power have on the Type 2 error and Type 1 error?
Type 2 error: increases it Type 1 error: decreases it
69