Summer Stats Flashcards

1
Q

What is variability?

A

Differences? How things differ. There is variability everywhere. We all look different, like different things, act differently. Statisticians look at these differences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are 2 branches of AP Stats?

A

Inferential and Discriptive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are descriptive statistics?

A

Tell me what you got! Describe your data that you collected through pictures and summaries like mean, median, standard deviation, range, etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are inferential statistics?

A

Looking at your data (sample) and making an educated guess about the whole. It’s like tasting soup - a little bit (sample) can tell you a lot about the whole pot (population).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Compare descriptive and inferential statistics

A

Descriptive statistics tells you about the data that you have and inference uses that data to draw conclusions about the populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is data?

A

A collection of values or categories measured. You may count the number of words someone can type per minute like 30, 57, 25, or you may collect yes or no answers to a question posed to a group of people.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a population?

A

The entire group you’re interested in = all. It can be massive, such as “all teenagers in the US” or it can be small such as “all juniors in our high school”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a sample?

A

A small part of the population = some. We use the sample to learn about the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a statistic?

A

A value that characterizes a sample: sample mean, sample proportion, sample median, sample range, etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a parameter?

A

A value that characterizes a population: population mean, population proportion, population median, population range, etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Compare data to statistics

A

Data is the individual values we measure on each member of our sample, the little things we collect, like “your GPA”, while statistics are numerical summaries (mean, median, etc) of that data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Compare data to parameters

A

Data is the individual values we measure on each member of our sample, the little things we collect, like “how tall you are”, while parameters are summaries (mean, median, etc) of the whole population. In other words, we collect data (i.e. we take a sample), we calculate summaries (statistics) that we then use to estimate the values of the parameters which represent the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

We are interested in the average wait time at Wendy’s drive through in our neighborhood. You randomly sample cars at Wendys one afternoon and find that the average wait time is 2.7 minutes. What is the sample? What is the population? What is the parameter? What is the statistic? What is the data?

A

The sample is formed by the cars you “sampled”. The population is formed by all cars driving through at Wendy’s ever and forever, all of them. The parameter of interest is the average wait time for all cars at this Wendy’s. You will NEVER know this number. The statistic is the sample mean of 2.7 minutes wait time. We use this 2.7 to estimate and make inferences about the whole population. The data is formed by all the individual times you collected: 3.5 minutes, 1.9 minutes, 5 minutes, and so on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a census?

A

Data collected on all individuals of the population - we do it every 10 years. Can we collect data from ALL individuals?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Does a census make sense?

A

For a small population, like our school, yes, but not if we want to collect data on all girls in the US

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the difference between parameters and statistics?

A

Although both are single numbers summarizing a large group of numbers, pppp parameters come from pppp populations and ssss statistics come from ssss samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

If I ask 25 random students how many pets they have and one of them says 7, then the number 7 is?

A

a datum, or a data value, or a data point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

If I ask 25 random students how many pets they have and the average number of pets is 7 pets, then 7 is____

A

a statistic (summary of the sample)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

If I ask 25 random students what how many pets they have and I do this because I want to know the true average number of pets the students in our school have, the true average number of pets is considered ______

A

a parameter, a one number summary of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the difference between a sample and a census? ?

A

With a sample you get information from a small part of the population. In a census, you get info from the entire population. You can get a parameter from a census, but only a statistic from a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Use the following words in one sentence: population, parameter, census, sample data, statistics, inference, population of interest.

A

I was curious about a population parameter, but a census was too costly so I decided to choose a sample, collect some data, calculate a statistic and use that statistic to make an inference about the population of interest.

22
Q

If you are tasting soup, then the flavor of each individual thing in the spoon is the ________, the entire spoon is a ______. The flavor of all of that stuff together is like the _______and you use that to _________about the flavor of the entire pot of soup, which would be the ________

A

If you are tasting soup, then the flavor of each individual thin in the spoon is DATA, the entire spoon is a SAMPLE. The flavor of all of that stuff together is like the STATISTIC, and you use that to MAKE AN INFERENCE about the flavor of the entire soup, which would be the PARAMETER. Notice you are interested in the parameter to begin with…..that is why you take a sample.

23
Q

What are random variables?

A

If you randomly choose people from a list, then their hair color, height, weight, and any other data collected from them can be considered random variables.

24
Q

What is the difference between quantitative and categorical variables?

A

Quantitative variables are numerical measures, like height and IQ. Categorical are categories, like eye color and music preference.

25
Q

What is the difference between quantitative and categorical data?

A

The data is the actual gathered measurements. If it’s eye color, then the data would look like “blue, green, green, brown, brown, blue, blue…etc” - words for categorical. If it’s weight, then the data would look like “100, 156, 238, 167, 155, …etc’ - numbers for quantitative.

26
Q

What is the difference between discrete and continuous variables?

A

Discrete can be counted, like “number of pets a student has” - they are generally integers (except shoe size), while continuous can take any real number value such as the time it takes to complete an obstacle course, like 5.324 minutes

27
Q

What is a quantitative variable?

A

Quantitative variables are numeric like: height, age, number of M&Ms, SAT
score

28
Q

What is a categorical variable?

A

Categorical variables are like categories: Blonde, Listens to Hip Hop, Female, yes,
no … etc.

29
Q

What do we sometimes call a categorical variable?

A

Qualitative

30
Q

What do we sometimes call a quantitative variable?

A

Numerical

31
Q

What is quantitative data?

A

The actual numbers gathered from each subject. 211 pounds. 67 beats per
minute.

32
Q

What is categorical data?

A

The actual individual category from a subject, like “blue” or “female” or
“sophomore”

33
Q

What is a random sample?

A

When you choose a sample by rolling dice, choosing names from a hat, or other REAL RANDOMLY generated sample. Humans can’t really do this well without the help of a calculator, cards, dice, or slips of paper.

34
Q

What is frequency?

A

How often something comes up

35
Q

Data or datum?

A

datum is singular .. Like “hey dude, come see this datum I got from this rat!” data is the plural .. “hey look at all these data Edgar got from those chipmunks over there!!”

36
Q

What is a frequency distribution?

A

A table, or a chart, that shows how often certain values or categories occur in a data set.

37
Q

What is meant by relative

frequency?

A

The PERCENT of time something comes up

38
Q

How do you find relative frequency?

A

just divide frequency by TOTAL. … frequency relative to the whole

39
Q

What is meant by cumulative frequency?

A

ADD up the frequencies as you go. Suppose you are selling 25 pieces of candy. You sell 10 the first hour, 5 the second, 3 the third and 7 in the last hour, the
cumulative frequency would be 10 for the first hour, 15 for the first 2 hours, 18 for the first 3 hours, 25 for all 4 hours….so eventually, you’d cumulate the whole data set

40
Q

Make a guess as to what relative cumulative frequency is …

A

It is the ADDED up PERCENTAGES .. An example is selling candy, 25 pieces sold overall … , with 10 the first hour, 5 the second, 3 the third, and 7 the fourth hour, we’d take the cumulative frequencies, 10, 15, 18 and 25 and divide by the total giving cumulative percentages … .40, .60, .64, and 1.00. Relative cumulative
frequencies always end at 100 percent.

41
Q

What is the difference between a bar chart and a histogram

A

bar charts are for categorical data (bars don’t touch) and histograms are for quantitative data (bars touch)

42
Q

What is the mean?

A

It’s what your parents/grandparents would call average. It is the balancing point of the histogram

43
Q

What is the difference between a population mean and a sample

A

Population mean is the mean of a population, it is a parameter; sample mean is a mean of a sample, so it is a statistic. We use sample statistics to make
inferences about population parameters.

44
Q

What symbols do we use for population mean and sample mean?

A

Mu (µ) for population mean (parameter), x-bar for sample mean (statistic)

45
Q

How can you think about the mean and median to remember the difference when looking at a histogram?

A

Mean is balancing point of histogram, median splits the area of the histogram in half.

46
Q

What is the median?

A

The number in the middle after ordering the data; it splits area in half (always in the position (n+1/2)

47
Q

What is the mode?

A

With categorical variables. For instance, to describe the average teenagers
preference, we often speak of what “most” students chose, which is the mode. It is also tells the number of bumps in a histogram for quantitative data (unimodal, bimodal, etc … ).

48
Q

When we say “the average teenager” are we talking about mean, median or mode?

A

It depends, if we are talking height, it might be the mean, if we are talking about parental income, we’d probably use the median, if we were talking about music preference, we’d probably use the mode to talk about the average teenager.

49
Q

what is a clear example of where the mean would change but median
wouldn’t? (this would show its resilience)

A

Imagine if we asked eight people how much money they had in their wallet. We
found they had {1, 2, 2, 5, 5, 8, 8, 9}. The mean of this set is 5, and the median is also 5. You might say “the average person in this group had 5 bucks.” But imagine if one of them just got back from the casino, and instead it was ( 1, 2, 2, 5, 5, 8, 8, 9000}, in this case, the median would still be 5, but the mean goes up to over
1000. Which number better describes the amount of money the average person in the group carries, 5 bucks or 1000 bucks? I think 5 is a better description of the
average person in this group and the 9000 is simply an outlier.

50
Q

How are mean, median and mode positioned in a skewed left histogram?

A

It goes in order from left to right: Mean-Median-Mode

51
Q

How are mean, median and mode positioned in a skewed right histogram?

A

It goes in order from left to right: Mode-Median-Mean

52
Q

Who chases the tail?

A

The mean chases the tail, the mean chases the tail, high-ho the derry-oh the mean chases the tail. .. and outliers …….