MEASURES OF DISPERSION OR SCATTERDNESS Flashcards

1
Q

What is a measure of variability

A

a number that describes the dispersion or variation in a set of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is are absolute and relative measures of dispersion

A

A measure of dispersion may be either absolute or relative. Absolute measures of dispersion are expressed in the same statistical unit in which the original data are given. e.g km, kg, dollars etc. A measure of relative dispersion is the ratio of a measure of absolute dispersion to an appropriate average. It is sometimes called the coefficient of dispersion, because it is independent of the unit of measurement (it is a pure number).Each absolute measure of dispersion can be converted into its relative measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Give examples of absolute and relative measures of dispersion

A

Absolute measures include
• The range
• The interquartile range and quartile deviation
• The mean deviation or average deviation
• The standard deviation

Relative measures include:
• Co-efficient of Range
• Co-efficient of Quartile Deviation
• Co-efficient of mean Deviation
• Co-efficient of Variation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the four basic purposes of measures of dispersion

A
  1. To determine the reliability of an average
  2. To serve as a basis for control of variability
  3. To compare two or more sets of observations with regard to their variability
  4. To facilitate the use of other statistical measures
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the properties of a good measure of variation

A
  1. Be simple to understand
  2. Be easy to compute
  3. Be rigidly defined
  4. Be based on each and every item of the distribution
  5. Be amenable to further algebraic treatment
  6. Have sampling stability
  7. Not be unduly affected by extreme items
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is range and how is it calculated

A

This the simplest measure of variability. The range is the difference between the value of the largest item and the smallest item in the set of observations.
Range=Largest item-Smallest item.=L-S

For grouped data the range can be found using two methods.
The first is to find the difference between the upper limit of the highest class and the lower limit of the lowest class. The second method is to find the difference between the midpoint of the highest class and the midpoint of the lowest class.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the coefficient of range

A

The relative measure corresponding to range , called the coefficient of range is obtained by applying the formula *see notes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the merits, limitations and uses of range

A

Merits of the range:
1. It is the simplest to understand and compute compared to other measures of dispersion.

Limitations:

  1. It is not based on every item of the distribution.
  2. It is subject to fluctuations of considerable magnitude from sample to sample.
  3. It cannot tell us anything about the distribution within the two extreme observations.
  4. It cannot be computed for open ended distributions

Uses of the range

  1. Quality control to ensure the difference between the largest and smallest of mass produced items does not exceed a certain value
  2. Fluctuations in share prices
  3. Weather forecasts e.g difference between maximum and minimum temperature
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Briefly describe interquartile range and quartile deviation

A

The interquartile range=Q3 − Q1
It is reduced to the form of the semi-interquartile range or quartile deviation by dividing it by 2
* See notes for formula .The quartile deviation gives the the average amount by which
by which the two quartiles differ from the median. In a symmetrical distribution the two quartiles
are equidistant from the median. A very small value of quartile deviation indicates that the variation of the middle 50% items is small, likewise a high quartile deviation means variation of
the middle 50% items is large.
The relative measure corresponding to quartile deviation is the coefficient of quartile deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the coefficient of Quartile Deviation

A

*See notes for formula

Coefficient of quartile deviation can be used to compare the degree of variation of different distributions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the merits and demerits of quartile deviation

A

Merits of Quartile Deviation

  1. In certain respects it is superior to the range as a measure of dispersion.
  2. Can be used in open ended distributions.
  3. Useful in erratic or badly skewed data

Demerits of Quartile deviation

  1. It does not depend on each and every item in the data set. |It ignores the top 25% and the bottom 25% of items in a distribution.
  2. It is not capable of mathematical manipulation
  3. Its value is affected by sampling fluctuations
  4. It is more of a positional average rather than a measure of dispersion as it does not show scatter around an average.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Briefly describe average deviation or mean deviation

A

This is the average of the absolute values of deviations from the mean. *See notes for formula

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the coefficient of mean deviation

A

The relative measure corresponding to mean deviation is the coefficient of mean deviation
Coefficient of Mean deviation= mean deviation/mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the merits and demerits of average deviation or mean deviation

A

Merits
1. It is simple to understand and easy to compute
2. It is based on each and every item of the data
3. It is less affected by extreme values
Limitations
1. Ignoring of the signs makes the method non algebraic
2. It is not capable of further algebraic treatment.
3. It may not give very accurate results because mean deviation gives best results when deviations are taken from the median, but the median is not a satisfactory measure when variability is high in the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is variance and what are the types

A

Variance is the arithmetic mean of the square of the deviations from the mean. Population and sample variance
*See notes for formula

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the primary use of sample statistics

A

The primary use of sample statistics like Sample variance is to estimate population parameters like Population variance

17
Q

Why is n-1 used in the denominator for sample variance.

A

using n would under estimate the population variance. Dividing by n-1 gives a slightly larger value and an unbiased estimate of the population variance.

18
Q

What is standard deviation

A

The standard deviation is the square root of variance

19
Q

What is coefficient of variation and what is it used for

A

Standard deviation is an absolute measure of variation. The corresponding relative measure of variation is the coefficient of variation. It is used when making comparisons of variability of two or more data sets. The data set with a higher coefficient of variation has more variation and that with a lower value has less variation or is more consistent/uniform/homogeneous.
*See notes for formula

20
Q

What are the merits and demerits of SD

A
  1. It is the best measure of variation. It is based on every item of the data set
  2. It is amenable to further algebraic treatment. It is possible to calculate the combined standard deviation of two or more groups.
  3. It is used in further statistical work e.g in computing skewness, correlation etc

Limitations of Standard deviation

  1. It is difficult to compute
  2. It gives more weight to extreme items and less to those which are near the mean.