Summer Vocab Flashcards

1
Q

What is statistics?

A

The study of variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is variability?

A

Differences, how things differ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are two branches of AP stats?

A

Inferential & Descriptive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are descriptive stats?

A

Description of data collected, using pictures or summaries like mean, median, range, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are inferential stats?

A

Using data to get a sample of the bigger picture.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Compare descriptive and inferential stats.

A

Descriptive explains the data that you have, and inferential uses that data to say something about an entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is data?

A

Any collected information or measurements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a population?

A

The group you’re interested in when using statistics. It could be big, like “all the teenagers in the US”, or small, like “all AP stats students in my school”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a sample?

A

A subset of a population, often used to make inferences about the population. Statistics are calculated using samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Compare a population and a sample.

A

Populations are generally larger, and samples are small subsets of these populations. Samples are taken to make inferences about populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Compare data and statistics.

A

Data is each little bit of info collected from subjects that is summarized using things like mean. If it is a sample, then that mean is called a “statistic”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a parameter?

A

A numerical summary of a population, like its mean, median, or range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Compare data and parameters.

A

Data is summarized by using things like mean. If we have data from each member of a population, then the mean is called a “parameter”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a statistic?

A

A numerical study of a sample using things like mean, median, and range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a census?

A

A sample of the entire population, information is taken from every member of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Does a census make sense?

A

A census is okay for small populations, but impossible if you want to survey “all US teens”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the difference between a parameter and a statistic?

A

Both are a single number summarizing a larger group of numbers. But Parameters come from Populations and Statistics come from Samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the difference between a sample and a census?

A

With a sample, you get info from a small part of the population. In a census, you get info from the entire population. You can get a parameter from a census, but only a statistic from a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are random variables?

A

If you randomly choose people from a list, then their hair color, height, weight, and any other data collected can be considered random variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the difference between quantitative and categorical variables?

A

Quantitative variables are numerical measures, like height or IQ. Categorical are categories, like eye color or music preference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is frequency?

A

How often something comes up.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How are mean, median, and mode positioned in a skewed left histogram?

A

Mean-median-mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How are mean, median and mode positioned in a skewed right histogram?

A

Mode-median-mean.

24
Q

Who chases the tail?

A

The MEAN chases the tail.

25
Q

How do you find relative frequency?

A

Divide frequency by total.

26
Q

What do we sometimes call a categorical variable?

A

Qualitative.

27
Q

What is a quantitative data?

A

The actual numbers gathered from each subject - 211 pounds, 67 beats per minute.

28
Q

What is categorical data?

A

The actual individual category from a subject, like “blue” or “female” or “sophomore”.

29
Q

What is a random sample?

A

When you choose a sample by rolling dice, choosing names from a hat, or other real randomly generated sample. Humans can’t do this well without the help of a calculator, cards, dice, or slips of paper.

30
Q

Data vs Datum

A

Datum is singular, data is plural.

31
Q

What is frequency distribution?

A

A table or chart that shows how often certain values or categories occur in a data set.

32
Q

What is meant by relative frequency?

A

The percent of time something comes up (frequency/total).

33
Q

What is meant by cumulative frequency?

A

Add up the frequencies as you go. Suppose you are selling 25 pieces of candy. You sell 10 the first hour, 5 the second, 3 the third, and 7 the last. So the cumulative frequency would be 10,15,18,25.

34
Q

What is the difference between a bar chart and a histogram?

A

Bar charts are for categorical data (bars don’t touch) and histograms are for quantitative data (bars touch).

35
Q

What is the mean?

A

The old average we used to calculate. It is the balancing point of the histogram.

36
Q

What is the median?

A

The middlest number, splits area in half (always in the position (n+1)/2).

37
Q

What is the mode?

A

The most common, or peaks of the histogram. Often used with categorical data.

38
Q

Why isn’t the mean always used?

A

It is not resilient, it is impacted by skewness and outliers.

39
Q

When is mode often used?

A

With categorical variables. Also tells the number of bumps in a histogram for quantitative data (unimodal, bimodal).

40
Q

What is relative cumulative frequency?

A

It is the added up percentages. Always end at 100%.

41
Q

What is the difference between a population mean and a sample mean?

A

Population mean is the mean of a population, it is a parameter. Sample mean is the mean of a sample, it is a statistic. Sample statistics are used to make inferences about population parameters.

42
Q

What symbols do we use for population mean and sample mean?

A

Mu for population mean and x-bar for sample mean

43
Q

How can you think about the mean and the median to remember the difference when looking at a histogram?

A

Mean is the balancing point of a histogram, median splits the area of the histogram in half.

44
Q

When we say “the average teenager”, are we talking about the mean, median, or mode?

A

It depends. If we are looking at height, it would be the mean. If we are looking at parental income, it would be median. If we are looking at music preference, it would be mode.

45
Q

What is a categorical variable?

A

Qualitative variables are like categories: blonde, listens to hip hop, female, yes, no, etc.

46
Q

What is a quantitative variable?

A

Quantitative variables are numeric, like: height, age, etc.

47
Q

What is the difference between discrete and continuous variables?

A

Discrete can be counted, and are generally integers. Continuous would be something like weight of a mouse.

48
Q

What is the difference between quantitative and categorical data?

A

The data from quantitative variables would be numbers, like: 125, 234, 145, etc. The data from categorical variables would be words, like: yes, no, no, etc.

49
Q

If I take a random sample of 20 hamburgers from Five Guys and count the number of pickles on a bunch of them, and one of them had nine pickles, then the number 9 from that burger would be called_____?

A

a datum, or a data value.

50
Q

If I take a random sample of 29 hamburgers from Five Guys and count the number of pickles on a bunch of them, and the average number of pickles was 9.5, then 9.5 is considered a____?

A

Statistic (it is a summary of a sample).

51
Q

If I take a random sample of 20 hamburgers from Five Guys and count the number of pickles on a bunch of them, and I do this because I want to know the true average number of pickles on a burger at Five Guys, the true average number of pickles is considered a_____?

A

Parameter, a one number summary of a population.

52
Q

If you are tasting soup, then the flavor of each individual thing in the spoon is the______, the entire spoon is a________. The flavor of all that stuff together is like the_________ and you use that to______ about the flavor of the entire pot of soup, which would be the________.

A

Data, sample, statistic, make an inference, parameter.

53
Q

Use the following words in one sentence: population, parameter, census, sample, data, statistics, inference, population of interest.

A

I was curious about a population parameter, but a census was too costly so I decided to choose a sample, collect some data, calculate a statistic, and use that statistic to make an inference about the population parameter (also known as the population of interest).

54
Q

What is a clear example of where the mean would change but median wouldn’t (which would show its resilience)?

A

If an outlier entered the set of data.

55
Q

Compare data-statistic-parameter using a quantitative example.

A

Data are individual measures, like how long a person can hold their breath: “45 sec, 33 sec”. That is raw data. Statistics and parameters are summaries like “the average breath holding time in the sample was 52.4 seconds”, and the parameter being “the average breath holding time in the population was 52.4 seconds”.

56
Q

Compare data-statistic-parameter using a categorical example.

A

Data are individual measures, like meal preference: “taco, taco, pasta”. Statistics and parameters are summaries. A statistic would be “42% of sample preferred tacos” and a parameter would be “42% of population preferred tacos.”

57
Q

We are curious about the average wait time at Dunkin’ Donuts drive-through in your neighborhood. You randomly sample cars one afternoon and find the average wait time is 3.2 minutes. What is the population parameter? What is the statistic? What is the parameter of interest? What is the data?

A

The parameter is the true average wait time at the Dunkin’ Donuts. this is a number you don’t have and will never know. The statistic is 3.2 minutes. It is the average of the data you collected. The parameter of interest is the same thing as the population parameter. In this case it is the true average wait time of all cars. The data is the wait time of each individual car so that would be like 3.8 minutes 2.2 minutes etc. you take that data and find the average, that average is called a statistic, and you use that to make an inference about the true parameter.