Unit 1 - Exploring Data Flashcards

1
Q

What are 2 measures of center?

A

Mean and median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are 3 measures of spread?

A

Standard Deviation, Interquartile Range, Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between quantitative and categorical data?

A

Quantitative data = number data

Categorical data = word data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What graphs are appropriate for categorical data?

A

Pie charts and bar charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What graphs are appropriate for quantitative data?

A

Histograms, box plots, stem plots, dot plots.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which graphs retain individual observations?

A

Stem plots and dot plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

About how many bars should your histogram have?

A

About 5 or 6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When describing a distribution, what 3 things should you include?

A

Shape, center, spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A distribution that has approximately the same frequency for each data value is ___________.

A

Uniform

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A distribution that has a greater frequency of large values is __________________.

A

Left Skew

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A distribution that has a greater frequency of small values is _________________.

A

Right Skew

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The mean is pulled in the direction of the _______.

A

Skew and outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

In a left skewed distribution, the median is ___________ than the mean.

A

greater than

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In a unimodal and symmetrical distribution, the mean is _________________ the median.

A

about the same as

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define statistic

A

a number that describes a sample (like sample mean, sample median, sample min, etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define parameter

A

a number that describes a population (like population mean, population range, etc)

17
Q

What is a 5 number summary?

A

Min, Q1, Median, Q3, Max

18
Q

What is the formula for calculating the upper boundary for outliers?

A

Q3 + 1.5(IQR)

19
Q

What is the formula for calculating the lower boundary for outliers?

A

Q1 - 1.5(IQR)

20
Q

For what kind of distributions do we prefer to use the median and IQR?

A

Skewed distributions

21
Q

What measure of center and spread do we use if a distribution is approximately symmetrical?

A

Mean and Standard Deviation

22
Q

What does it mean if a statistic is resistant?

A

The statistic is not easily influenced or changed by skew and outliers

23
Q

What does it mean if a statistic is non-resistant?

A

The statistic is easily influenced and changed by skew and outliers

24
Q

Name 2 resistant statistics.

A

Median, IQR

25
Q

Name 2 non-resistant statistics

A

Mean, Standard Deviation

26
Q

The median of a data set of the price of school lunch items is $2.35. Interpret this number.

A

The average price of school lunch items is $2.35.

27
Q

The standard deviation of a data set of the price of school lunch items is $0.48. Interpret this number.

A

0.48 is the average deviation of school lunch prices from the mean school lunch price

28
Q

Which graphs are most appropriate for large data sets, or data with a large range of numbers?

A

Histograms and box plots

29
Q

What is the symbol for sample mean?

A

X-bar

30
Q

What is the symbol for population mean?

A

Mew

31
Q

What is the symbol for sample standard deviation?

A

s

32
Q

What is the symbol for population standard deviation?

A

sigma

33
Q

If you add or subtract the same value to every number in a data set, what happens to the measures of location (mean, median, min, max, percentiles)?

A

They change.

34
Q

If you add or subtract the same value to every number in a data set, what happens to the measures of spread (range, IQR, S.D.)?

A

The stay the same.

35
Q

If you multiply or divide the numbers in a data set, which summary statistics will change.

A

All of them: mean, median, range, IQR, S.D. etc.