Module 2 Flashcards

1
Q

Population Mean

A

Represented as µ. The population mean applies when the data represent all of the items in a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample Mean

A

Represented as (look in interactive lecture). The sample mean is used when the data consist of just a sample taken from the overall population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Median

A

another measure of location, is the value for which there are as many values above it as below it, when arranged in order from lowest to highest. If the number of data items is odd, the median is the data item in the middle of the list. If the number of data items is even, the median is the average of the two middle data items. A quick way to find the position of the median in the ordered list is to use the equation n+1/2
, where n is the number of data items. This computation will tell you the position in the list where the median is located.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Mode

A

It is the value that occurs with the most significant frequency and the final measure of the location to be discussed. If no data items are repeated, then the data set has no mode. If more than one data value has the highest frequency, it will be multimodal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does Skewness of the data mean?

A

The relationship between the mean and the median of a data set. If the median exceeds the mean, the data are skewed to the left, and if the mean, exceeds the median. the data are skewed to the right. If the mean is equal to the median, the data are symmetrical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Second Quartile (Q2)

A

Is the median of the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

First Quartile (Q1)

A

Is the 25th percentile and can be thought of as the median of the lower half of the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Third Quartile (Q3)

A

Is the 75th percentile and can be thought of as the median of the upper half of the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Five-Number Summary

A

To put everything together, we can create what is called a five-number summary, which summarizes the data using the following familiar measures:

  1. Smallest value
  2. First quartile
  3. Median
  4. Third quartile
  5. Largest value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Interquartile Range (IQR)

A

Q3-Q1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Box Plot

A

Based on this five-number summary, we can create a box plot, which is a convenient way of graphically depicting groups of numerical data through their quartiles. The box plot is a graphical representation of the five-number summary (minimum, first quartile (Q1), median (Q2), third quartile (Q3), and maximum). The five-number summary must be calculated first before you begin the process of constructing a box plot. The interquartile range, IQR = Q3-Q1, is also utilized.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Relative Frequency Distribution

A

If you let the number of samples get very large (say, 300 million or more), the relative frequency table becomes a relative frequency distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Range

A

the difference between the highest and lowest values in a set of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Interquartile Range (IQR)

A

A measure of variability that overcomes the dependency on extreme values. This measure of variability is the difference between the third quartile and the first quartile. In other words, the interquartile range is the range for the middle 50% of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Variance

A

another common measure of dispersion that has a mathematical formula needed for the calculation. If you are finding the sample variance, it is the sum of the squared differences between the n data values and the sample mean divided by (n – 1). The population variance is the sum of the squared differences between the n data values and the population mean divided by N, which is the number of data values in the population ( look up example in lecture)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly