Statistics Chapter 2 Flashcards

1
Q

What is a measure of location?

A

A single value describing a position in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a measure of central tendency?

A

A single value describing the centre of a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the mode/modal class?

A

The most often occurring value or class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the median?

A

The middle value of an ordered set of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the mean?

A

The sum of the data values divided by the number of data values
x bar = sum of x / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When should the mode be used? (2)

A

Qualitative data

Quantitative data with repeating values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a data set with two modes called?

A

Bimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When should the median be used?

A

Quantitative data; especially data with extreme values and outliers as they do not affect the average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When should the mean be used?

A

Quantitative data as it uses all values in order to give a true measure; not suitable when there are extreme values as this has a big effect on the average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the four pots method used for?

A

Finding quantiles for discrete data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does the four pots method work? (3)

A

Divide the number of pieces of data by four
Place the data values as equally as possible into the four pots
Add remainders as evenly as possible in the spaces between the pots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is linear interpolation used for?

A

Finding quartiles for continuous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a measure of spread?

A

A measure of how spread out the data is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the range? (2)

A

The difference between the largest and smallest data values

Affected by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the interquartile range? (2)

A

Difference between quartile 1 and quartile 3

Not affected by extreme values but only considers middle 50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the interpercentile range? (2)

A

Difference between two given percentiles

Often uses 10% and 90% as this gives a measure from 80% of the data but discounts extreme values

17
Q

What is variance?

A

Works out the spread of the data

18
Q

How is variance calculated? (2)

A

(Sum of x^2 / n) - (x bar)^2

Sxx / n

19
Q

What is standard deviation? (2)

A

The higher the number the more spread out the data is

Always positive

20
Q

How is standard deviation calculated?

A

✔️(sum of x^2 / n) - (x bar)^2

21
Q

How is the standard deviation of grouped data calculated? (2)

A

✔️(sum of fx^2 / sum of f) - (sum of fx / sum of f)^2

Where x = midpoints of each group

22
Q

What is coding?

A

A set of data values can be modified by using a formula:

y = (x - a) / b

23
Q

How is the mean of coded data converted to the mean of the original data? (2)

A

(y bar) = (x bar) - a / b

Mean is affected by all transformations so it must be divided and subtracted from

24
Q

How is the standard deviation of coded data converted to the standard deviation of the original data? (2)

A

(sigma y) = (sigma x) / b

As standard deviation is a measure of spread, adding or subtracting has no effect