MEASURES OF CENTRAL TENDENCY Flashcards

1
Q

What is a measure of central tendency

A

A measure of central tendency is a number that locates the approximate center of a distribution of data. The purpose of a measure of central tendency is to locate the “average” or “typical” case in a distribution of cases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define the term ‘average’ in statistics

A

that value of a distribution which is considered as the most representative or typical value for a group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Give examples of measures of central tendency

A

The most commonly used measures of central tendency or averages are the mean, mode and median. Other types of averages include the weighted arithmetic mean, trimmed mean, geometric mean and harmonic mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two main objectives of averaging

A
  1. To get a single value that describes the characteristics of the entire group
  2. To facilitate comparison between groups.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are Properties of a Good (Average)Measure of Central tendency

A
  1. I t should be easy to understand. It should be readily understood otherwise its use will be limited.
  2. It should be simple to compute. It is important to note though that ease of computation should not be at the expense of other advantages.
  3. It should be based on all items. It should depend on each and every item of the data set so that if any item is dropped its value is altered.
  4. It should not be unduly affected by extreme items. If one or two very small or very large items unduly affect the average then it cannot be typical of the entire data set. Extremes may distort its value and reduce its usefulness
  5. It should be rigidly defined. An average should be properly defined preferably by an algebraic formula so that it has only one interpretation. Different people computing it from the same figures should get the same answer.
  6. It should be capable of further algebraic treatment. It can be used for further statistical computations to enhance its usefulness..
  7. It should have sampling stability. If we pick different samples from a population and compute the average for each of them we should expect to get approximately the same value.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is mean, sample mean and population mean and how are they calculated

A
  • See notes for formulas
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the properties of the mean

A
  1. It is the point in a distribution of measurements or scores about which the sum of the
    deviations are equal to zero.
    The mean is therefore characterized as a point of balance. It is the value that balances all scores on either side of it. *See notes
  2. The sum of the square of the deviations of the items from the arithmetic mean is minimum, that is it is less than the sum of squared deviations of items from any other value. This property is of immense use in regression analysis which is topic to be covered later.
  3. If each item in a series is replaced by the mean, then the sum of these substitutions will be equal to the sum of the individual items.
  4. Using the arithmetic mean and number of items of two or more related groups we can compute the combined mean
    * See notes for further explanations and formulas
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the merits of the arithmetic mean

A
  1. It is the simplest average to understand and easiest to compute
  2. Its computation is based on all items of the series
  3. It is rigidly defined. Every one computing the arithmetic mean for the same data set will get the same answer.
  4. It lends itself to further algebraic treatment.
  5. It is relatively reliable in the sense that it does not vary too much when repeated samples are taken from one and the same population.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the limitations of arithmetic mean

A
  1. The mean is very sensitive to extreme values when these are not evenly dispersed on both sides of it. E.g Comparing two data sets 2,3,5,7,8 and 2,3,5,7,33. The mean for the first one is 5, while that for the second one is 10. The large score of 33 in the second group makes the mean of the second group double. When a distribution is markedly skewed the mean provides a misleading measure of central tendency. The mean provides a an appropriate ‘average’ , only when the distribution of a variable is reasonably normal(bell shaped) Income is a commonly studied variable in which the median is preferred over the mean , since the distribution is distinctly skewed in the direction of high incomes.
  2. The mean cannot be computed in a distribution with open ended classes without making assumptions regarding the size of the class interval of the open ended classes which may lead to substantial errors.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the median?

A

The median is the half way point in a data set. It is the point at which 50% of the values in the data set have a value the size of the median value or smaller and 50% of the values have a value the size of the median value or larger.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you calculate median?

A

*See notes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the properties and merits of the median

A

Properties of the median:
It is insensitive to extreme scores

Merits

  1. Useful incase of open ended classes
  2. Not influenced by extreme scores and therefore is preferred to the arithmetic mean in skewed distributions such as income
  3. It is most appropriate when dealing with qualitative data i.e where ranks are given or there are other types of items that are not counted or measured but are scored.
  4. The median can be determined graphically
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the limitations of the median

A

1.Its computation does not involve all items in the series
2.It is not capable of further algebraic treatment e.g. we cannot find the combined median of
two data sets.
3. Its value is affected more by sampling fluctuations than the value of the arithmetic mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you calculate mode?

A

*See notes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the different types of means

A

Distributions that have only one mode are referred to as unimodal. Those with two, three or more mode are referred to as bimodal, trimodal or multimodal respectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you locate the mode graphically

A
  1. Draw a histogram
  2. Draw two lines diagonally on the inside of the modal class bar, starting form each upper corner of the bar to the upper corner of the adjacent bar.
  3. Draw a perpendicular line from the intersection of the two diagonal lines to the horizontal axis which gives us the modal value.IN the histogram below the reading at P gives the modal value.
17
Q

What are the merits of the mode

A
  1. It is not affected by extreme values. It is the most meaningful measure of central tendency in the case of highly skewed or non normal distributions.
  2. It can be determined for open ended distributions.
  3. Can be determined graphically unlike the mean.
  4. Can be used to describe qualitative data particularly nominal data. Useful in comparison of consumer preferences ice cream flavors, soap etc.
  5. It is the most typical or representative value of a distribution. If the modal wage in a company is Ksh 30,000 then more workers in that company receive Ksh 30,000 than any other wage.
18
Q

What are the limitations of the mode

A
  1. The value of the mode can not always be determined. The distribution can be bimodal, trimodal or multimodal.
  2. It does not lend itself to algebraic manipulation.
  3. It is not based on each and every item of the series.
  4. It is not rigidly defined. There are several formulae for calculating the mode all of which usually give somewhat different answers.
19
Q

Describe using graphs the relationship between mean, mode and median for symmetrical and asymmetrical distributions

A

In a symmetrical distribution the values of mean, mode and median coincide. If the mean, mode and median are not equal the distribution is known as asymmetrical or skewed
In a distribution that is skewed to the right or positively skewed the mean is higher than the median. In a distribution that is negatively skewed the mean is lower than the median

20
Q

Describe the relationship of mean and skewness

A

The mean is the balance point of the distribution. Because points further away from the balance point change the center of balance, the mean is pulled in the direction the distribution is skewed.
For example, if the distribution is positively skewed, the mean would be pulled in the direction of the skewness, or be pulled toward larger numbers.

21
Q

Which method of central tendency is most suitable for the following types of variables: Nominal , ordinal, interval/Ratio(not skewed), Interval/Ratio(skewed)

A

Nominal- Mode
Ordinal- Median
Interval/Ratio (not skewed)- Mean
Interval/Ratio (skewed)- Median