OpenStax - Ch. 2 Descriptive Statistics Flashcards

1
Q

Box plot

A

a graph that gives a quick picture of the middle 50% of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

First Quartile

A

the value that is the median of the lower half of the ordered data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Frequency

A

the number of times a value of the data occurs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Frequency Polygon

A

looks like a line graph but uses intervals to display ranges of large amounts of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Frequency Table

A

a data representation in which grouped data is displayed along with the corresponding frequencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Histogram

A

a graphical representation in x-y form of the distribution of data in a data set; x represents the data and y represents the frequency, or relative frequency. The graph consists of contiguous rectangles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interquartile Range

A

or IQR, is the range of the middle 50 percent of the data values; the IQR is found by subtracting the first quartile from the third quartile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Interval

A

also called a class interval; an interval represents a range of data and is used when displaying large data sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mean

A

a number that measures the central tendency of the data; a common name for mean is ‘average.’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Median

A

a number that separates ordered data into halves; half the values are the same number or smaller than the median and half the values are the same number or larger than the median. The median may or may not be part of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Midpoint

A

the mean of an interval in a frequency table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mode

A

the value that appears most frequently in a set of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Outlier

A

an observation that does not fit the rest of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Paired Data Set

A

two data sets that have a one to one relationship so that: both data sets are the same size, and each data point in one data set is matched with exactly one point from the other set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Percentile

A

a number that divides ordered data into hundredths; percentiles may or may not be part of the data. The median of the data is the second quartile and the 50th percentile. The first and third quartiles are the 25th and the 75th percentiles, respectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Quartiles

A

the numbers that separate the data into quarters; quartiles may or may not be part of the data. The second quartile is the median of the data.

17
Q

Relative Frequency

A

the ratio of the number of times a value of the data occurs in the set of all outcomes to the number of all outcomes.

18
Q

Skewed

A

used to describe data that is not symmetrical;

when the right side of a graph looks “chopped off” compared the left side, we say it is “skewed to the left.”

When the left side of the graph looks “chopped off” compared to the right side, we say the data is “skewed to the right.”

Alternatively: when the lower values of the data are more spread out, we say the data are skewed to the left.

When the greater values are more spread out, the data are skewed to the right.

19
Q

Standard Deviation​

A

a number that is equal to the square root of the variance and measures how far data values are from their mean; notation: s for sample standard deviation and σ for population standard deviation.

20
Q

Variance

A

mean of the squared deviations from the mean, or the square of the standard deviation;

for a set of data, a deviation can be represented as x – x¯x¯ where x is a value of the data and x¯x¯ is the sample mean.

The sample variance is equal to the sum of the squares of the deviations divided by the difference of the sample size and one.