Lecture 2 Flashcards

Measures of Central Tendency

1
Q

Distribution

A

A way of describing the way a dataset looks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Measures of Central Tendency

A

Mean, Median and Mode. The one that is used depends on the distribution of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Another name for central tendency

A

The Average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Mean

A

AKA arithmetic mean, most common measure of a midpoint. Useful if data spread is fairly even (normally distributed). Sum of all values divided by total nº of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the symbol for the mean if we have data on a whole population?

A

μ (pronounced ‘mu’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the symbol for the mean if we have data on a sample (from a larger population)?

A

𝒙̅ (pronounced ‘x bar’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is the mean only useful when data is normally distributed?

A

It is impacted by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Median

A

The point at which half of the values (for a given variable in a sample/population) lie below and half of the values lie above

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Odd vs Even Number of Values

A

If even, median is the middle ordered value. If odd, median is the average (mean) of the two middle ordered values. Values must be in order to work out median!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What kind of measure is the median?

A

A resistant measure of the data’s centre. not affected by outliers. Used instead of mean if data is skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Median and Quartiles

A

-The 1st quartile has ¼ of the data below it
-The 3rd quartile has ¾ of the data below it
-The interquartile range (IQR) contains the middle ½ of the sample data – the data between the 1st and 3rd quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Box and Whisker Plots

A

Very top and bottom lines = maximum and minimum values (excluding outliers!). Middle line = median. 2nd and 4th lines = upper and lower quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mode

A

The most frequently occurring event/observation (data point). We can have modal, bimodal and multimodal data. Useful for nominal variables such as eye colour.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly