CHAPTER 4 Flashcards

1
Q

 The difference between the actual value and the average value.

A

Dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the MEASURES OF DISPERSION

A

range
average deviation
variance
standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the MEASURES OF LOCATION

A

Quartiles
Deciles
Percentiles
Midhinge
Interquartile range
Quartile Deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

 The difference of the highest value and the lowest value in the data set.

A

Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

 It is the absolute difference between the element and a given point.

A

Average Deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

 It is a statistical term that provides a good indication of volatility.
 It measures how widely values are dispersed from the average.
 It is calculatedas the square root of variance.

A

Standard Deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

It is the measure of risk

A

Volatility

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

 It is used to approximate or to give a rough estimate of the standard deviation

A

Range Rule of Thumb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

 It is a mathematical expectation of the average squared deviations from the mean.

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

It is the mean of the first (Q1) and third (Q3) quartiles in the data set. It is used to overcome potential problems introduced by extreme values (or outliers) in the data set.

A

Midhinge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A is a measure of statistical dispersion, being equal to the difference between the third and first quartiles.

A

Interquartile Range (IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Interquartile Range is also called

A

midspread
middle fifty

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

It is slightly better measure of absolute dispersion than the range.

It ignores the observation on the tails.

A

Quartile Deviation (QD)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The difference samples from a population & calculate their quartile deviations, their values are quite likely to be sufficiently different called

A

sampling fluctuation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

It is calculated from the sample data does not help us to draw any conclusion about the quartile deviation in the population.

A

inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

It is used when one is interested to compare standard deviations of two different units, coefficient of variations can be applied.

A

Coefficient of Variation (CV)

17
Q

is a statistical tool that measures dispersion in a data population that states that no more than 1 / k2 of the distribution’s values will be more than k standard deviations away from the mean.

A

Chebychev’s Theorem

18
Q

Statistical measure used to describe the distribution of observed data around the mean. It measures the relative peakedness or flatness of a distribution (as compared to the normal distribution, which shows a kurtosis of zero)

A

Kurtosis

19
Q

Three Types of Kurtosis

A

Leptokurtic
Mesokurtic
Platykurtic

20
Q

are distributions where values clustered heavily or pile up in the center. (k  0)

A

Leptokurtic

21
Q

are intermediate distribution w/c are neither too peaked nor too flat. (k = 0).

A

Mesokurtic

22
Q

are flat distributions with values more evenly distributed about the center with broad humps and shot tails. (k  0)

A

Platykurtic

23
Q

Measures the general shape of the distribution or the lack of symmetry of a distribution.

Ranges from –3 to +3.

It relates the difference between the mean and the median to the standard deviation.

The direction of the long tail of the distribution points the direction of the skewness.

A

Coefficient of Skewness

24
Q

Data values are evenly distributed.

The distribution is unimodal.

The mean, median, and mode are similar & are at the center of the distribution.

A

Symmetrical

25
Q

Most of the values in the data fall to the left of the mean and group at the lower end of the distribution.

The tail is to the right.

The mean is to the right of the median, and the mode is to the left of the median

A

Positively Skewed (or Right-Skewed).

26
Q

The mass of the data values fall to the right of the mean and group at the upper end of the distribution.

The tail to the left.

The mean is to the left of the median, and the mode is to the right of the median.

A

Negatively Skewed (or Left-Skewed).

27
Q

A data set should be checked for extremely high or extremely low values. These values are called outliers. Outliers can strongly affect the mean and standard deviation of a variable. One method in determining the outliers is when a data value in a data set is less Q1 – 1.5(IQR) or greater than Q3 + 1.5(IQR)

A

Outliers

28
Q

Outliers can strongly affect the mean and standard deviation of a variable.
true or false

A

true

29
Q

Introduced by John Tukey in 1970’s..
It gives the following information:
 If the median is near the center of the box, the distribution is approximately symmetric.
 If the lines are about the same length, the distribution is approximately symmetric.
 If the median falls to the right of the center of the box, the distribution is negatively skewed.
 If the median falls to the right of the center of the box, the distribution is negatively skewed.

A

Boxplot (Box-and-Whisker plot)

30
Q

 If the left line is larger than the right line, the distribution

A

negatively skewed

31
Q

If the right line is larger than the left line, the distribution is

A

positively skewed.

32
Q
A