2.1. Descriptive Statistics Flashcards

1
Q

Population mean…

A

Can be applied to interval and ratio data.

Is affected by all the values in the dataset - therefore any extreme outliers will stretch the mean value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample mean…

A

A summation of all values we have over the total number of observations.

Sample mean is the mean calculation for a subset of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Grouped data mean…

A

The frequencies of data are categorised into classes.

We can then use a midpoint value to calculate an fx = (frequency * midpoint value).

The sum of the fx is then divided by the total cumulative frequency.

This type of mean can be calculated for both a population and a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Mean and investments…

A

The mean is considered a measure of return on investment.

Standard deviation is considered a measure of risk of an investment, as it measures the rate of an investment’s performance.

The greater the standard deviation, the greater the investment’s volatility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Median of ordered data…

A

Data must be ordered, from smallest to largest, for example.

The median is the value that divides the ordered sample into two parts, with equal numbers of observations in each part.

Can be applied to ordinal, interval and ratio data (not nominal).

The median is not affected by outliers as we just calculate the midpoint.

There are two ways to calculate the median:
- For an odd sample: (n + 1) / 2 - this is the observation that acts as the median.
- For an even sample: the median is between (n/2) and (n/2 + 1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Median of grouped data…

A

Calculate the class interval in which the median observation is held (using the standard calculation method).

Calculate the specific value (check formula on formula sheet).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Mode…

A

This is the most frequently occuring value in the dataset.

This can be applied to all types of data.

Data can be bimodal, where two values have the highest frequency, or multimodal, where three or more values have the highest frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Choice of measures of central tendencies…

A

The measure of central tendency that is most appropriate depends on the scenario.

The mean is the most widely used measure of central tendency.

If observations are symmetrical (and unimodal), the mean, median and mode are the same.

Although data can have the same mean, it may be differently distributed (4, 4, 4, 4 and 2, 4, 4, 6).

Additionally, data that have different means can be similarly dispersed (2, 4, 4, 6 and 4, 6, 6, 8).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Quartiles…

A

Quartiles divide the dataset into four equal parts.

Calculate the class interval in which the lower quartile or the upper quartile is held, using the standard calculation method.

To deduce the 1st and 3rd quartile, divide by either 4 (and then multiply by 3 if applicable).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Range…

A

The difference between the largest and the smallest observation.

It is the simplest and easiest descriptive statistical measurement to calculate.

It ignores all the data points apart from the most extreme.

Interquartile range can also be calculated if we know both the first and third quartile. This shows the spread of 50% of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Box and whisker plots…

A

These are used to illustrate the spread of data.

The square shows us how data is concentrated and the whole spread of data is from the ends of the ‘whiskers’.

They are tools that show us the overall distribution of data.

Skewed data can be seen when the median is further to one side or the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mean absolute deviation…

A

The average of the absolute deviations from the mean.

Looks at dispersion around the central location of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Variance…

A

A measure that considers all of the information available.

Check formula sheet and tutorial document ‘1.2. Tutorial 1 Answers’ for guidance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Standard deviation…

A

The square root of the variance.

We can calculate standard deviation for a population, a sample or for grouped data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Coefficient of variation…

A

A measure of relative dispersion (independent of units of measurement).

Considers how the data varies with the mean and standard deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly