Descriptive Statistics Flashcards

1
Q

Box plot

A

A graph that gives a quick picture of the middle 50% of the data. To graph a box plot, calculate: minimum, Q1, median, Q3, and maximum.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

First Quartile

A

The value that is the median of the lower half of the ordered data set. Also called Q1 or the 25th percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Frequency

A

The number of times a value of the data occurs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Frequency Polygon

A

A line graph that uses intervals to display ranges of large amounts of data; useful when graphing repeating data points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Frequency Table

A

A data representation in which grouped data is displayed along with the corresponding frequencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Histogram

A

A graphical representation of the distribution of data with x representing data and y representing frequency. Used for large, continuous, quantitative datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Time Series Graph

A

Helpful when viewing large amounts of data for one variable over time. Frequency on x-axis, variable values on y-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Interquartile Range (IQR)

A

The range of the middle 50% of the data values, found by subtracting Q1 from Q3. Used to detect outliers with Q3 + 1.5IQR or Q1 - 1.5IQR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Interval

A

Also called a class interval; represents a range of data and is used when displaying large datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Mean

A

A number measuring central tendency, also called ‘average’. For sample: x̄ = sum/n; for population: μ = sum/N.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Median

A

The middle value in ordered data; half the values are below and half above. Preferred when data has outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Midpoint

A

The mean of an interval in a frequency table; approximated by (lower boundary + upper boundary)/2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mode

A

The value that appears most frequently in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Outlier

A

An observation that does not fit the rest of the data. Can be detected using the IQR rule.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Paired Data Set

A

Two data sets that are the same size and each point in one is matched with a point in the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Percentile

A

Divides ordered data into hundredths. For example, the 50th percentile is the median.

17
Q

Quartiles

A

Values dividing the data into quarters. Q1 = 25th, Q2 = 50th (median), Q3 = 75th percentile.

18
Q

Relative Frequency

A

The ratio of the number of times a value occurs to the total number of outcomes.

19
Q

Skewed

A

Describes asymmetry in data. Skewed left: lower values spread out. Skewed right: higher values spread out.

20
Q

Standard Deviation

A

Measures spread of data from the mean. Sample: s, Population: σ. Used for comparison of values to the mean.

21
Q

Variance

A

The mean of the squared deviations from the mean. Square of standard deviation. Sample variance: sum of squares/(n-1).

22
Q

Skewness & Center

A

In skewed distributions: mean ≠ median ≠ mode. In symmetric data, mean ≈ median ≈ mode.

23
Q

Approximating the Mean

A

When data is grouped, estimate the mean by using midpoints multiplied by frequencies, then divide by total frequency.