AP Stats Flashcards

1
Q

categorical variable

A

labels that place each individual into a particular group
ex: race, sex, age group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

quantitative

A

takes number values that are quantities, counts, or measurements
ex: height, weight, cost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

quantitative discrete

A

a fixed set of possible values
ex: how many green marbles you draw out of a bag

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

quantitative continuous

A

any value in an interval on the number line
ex: time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

relative frequency table

A

shows the proportion or percent of individuals having each value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

two-way table

A

a table of counts that summarizes data on the relationship between two categorical variables for some group of individuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

categorical graphs

A
  • Pie Charts
  • Pictographs
  • Dot-plots
  • Bar Graphs
  • Side-by-side Bar Graphs
  • Segmented Bar Graphs
  • Mosaic Plots
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Association

A

if knowing the value of one variable
helps us predict the value of the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Simpson’s Paradox

A

a contradiction between what we see when looking at individual
categories and the subtotals for our distributions when dealing with categorical
variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Quantitative Graphs

A
  • Dot-plots
  • Stem-plots
  • Histogram
  • Boxplots
  • Ogives
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how we describe distributions

A

CUSS + BS
-C: center
-U: unusual outliers
-S: spread
-S: shape
-BS: be specific

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

mean

A

average of all individual data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Statistic

A

a number that describes some characteristic of a sample
-ex: asking 20 random people their height and averaging results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Parameter

A

a number that describes some characteristic of a population
-ex: asking everyone in the population
their height and averaging results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Range

A

difference between the maximum value
and the minimum value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Standard Deviation

A

the typical or average distance of the values in a distribution from the _mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Interquartile Range (IQR)

A

IQR = Q3 – Q1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

lower Outlier Test

A

Lower Outliers < Q1 – 1.5(IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

higher outlier test

A

Higher Outliers > Q3 + 1.5(IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

response variable

A

measures an outcome of a study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

explanatory variable

A

may help predictor explain changes in a
response variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

population

A

the entire group we want to know about

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

census

A

collects data from every individual in a population

23
Q

sample

A

a subset of individuals in the population

24
population parameter
ex: google says that 79% of people everywhere have a dog *it's from everyone
25
sample statistic
student asks 100 people if they have a dog and 70% say yes *it's from a small sample
26
convenience sampling
selects individuals who are easy to reach
27
voluntary response sampling
allows people to respond if they want to
28
SUDS
*Used for describing data -S: strength (strong, moderate, weak) -U: unusual values (outliers) -D: direction (positive, negative) -S: Shape (bell, Bimodal, skewed, uniform)
29
correlation(r)
measures the direction and strength: -strong: close to 1 or -1 -does not imply causation -does not measure form -only for linear relationships
30
Regression Line(LSRL)
line that models how a response variable y changes as an an explanatory variable x changes ŷ=a+bx
31
ŷ=a+bx
ŷ: predicted y a: y-intercept(a=ȳ -bx̄) b: slope(b= r sy/sx) x: x variable
32
extrapolation
uses the regression to predict a value outside of the interval
33
residual
actual y - predicted y OR (y-ŷ)
34
least squared regression line(LSRL)
the sum of the squared residuals as small as possible
35
coefficient of determination(r²)
n% of variability in y can be explained by the linear model
36
stratified random sampling
selects a sample by choosing an SRS from each group and combining the SRSs into one big sample
37
cluster sampling
selects a sample by randomly choosing clusters and including each member of selected clusters in the sample
38
systemic random sampling
selects a sample from an ordered arrangement of the population by randomly selecting one of the first k individuals and choosing every kth after that
39
under-coverage
when some members of the population are less likely to be chosen or can't be chosen for a sample
40
non-response
when an individual chosen for tyhe sample cannot be contacted or refuses to answer
41
response bias
when there is a systematic pattern of inaccurate answers
42
observational
observes results without trying to influence them
43
experiment
deliberately and randomly imposes treatments to measure responses
44
confounding
when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other
45
principles of experimental design
1.) how many in each 2.) randomization 3.) repeats? 4.) rest/stop 5.) assign to treatment groups
45
match pair design
a common experimental design for comparing two treatments that use blocks of 2
45
block
a group of experimental units that are known before the experiment to be similar in some way in terms of response to treatment
46
random selection
allows inference about the population from which the individuals were chosen
47
random assignment
allows for inference about cause and effect
48
law of large numbers
if we observe more and more trials of any random process, the proportion of times that a specific outcome occurs approaches its probability
49
mutually exclusive
when you have one, you CANT have the other
50
categorical conditions
☑ randomization ☑ independence OR 10% rule ☑ np rules
51
quantitative conditions
☑ randomization ☑ independence OR 10 % rule ☑ central limits theorem: n>30
52