Lecture 6- Measures of Variability Flashcards

1
Q

What are three measures of central tendency often used in statistics?

A
  • Mean (average)
  • Median (mid point)
  • Mode (most common value)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the aim of measuring central tendency?

A

Summarizes a data set with a single value that is representative of the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In terms of measuring central tendency what occurs when the distribution is approximately normal?

A
  • The mean, median and mode are approximately the same

- Therefore the mean can be used to summarize the data as no information will be ‘missed’ in doing so

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the purpose of measuring variability in a data set?

A

Looks at the extent to which things are not the same i.e. how scores differ from each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two major roles of measuring variability?

A

1) Inferential statistics - the amount of variability affects the kinds of statements we can make about our population and the degree of confidence we can have in those statements. The more variability the lower our confidence should be in our trend
2) Descriptive statistics- variability is an interesting property of a data set in itself

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why can’t we just rely on measures of central tendency to summaries data?

A

Because although two sets of data may average to have the exact same mean the distribution of the data may be completely different. For example, you could have a graph that is very narrow (all data is clustered tightly around the mean) or alternatively data could be spread out with long tails.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What’s the difference between a low variability and high variability data set?

A

Low variability= most data clustered around mean (narrow graph)
High variability= data spread out along the scale (long tails= broad graph)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the simplest way to measure variability?

A

Range.
The largest score in your data - the smallest score.
This shows that all your data points sit within this number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Where can range fall down?

A

It ignores distribution of the data. You can have two graphs that have both the same mean and range but that are still complete different.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What’s an alternative measure of variability to range?

A

Variance (S2)

It is calculated by taking the sum of the squared deviations from the mean and dividing it by the number of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do we get standard deviation from variance?

A

Square root it.
Variance= S^2
SD= S

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is standard deviation?

A
  • Approximately the average distance of the scores in a data set from their mean
  • It is the most useful and most common measure of variability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How does S^2 compare to the true population variance (on average)? How do we overcome this?

A

S^2 is a little bit smaller than the population variance on average. we take this into account by dividing the sum of the squared differences in our mean by n-1 not n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the inflection point in a normal distribution?

A

Point where the curve starts bending outward more (start of the bend)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Where is the inflection point in terms of the mean in a normal distribution?

A

A point of inflection will always be 1 SD from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Where is the inflection point in terms of the mean in a normal distribution?

A

A point of inflection will always be 1 SD from the mean

17
Q

What can be said about the distribution of data in a normal distribution using SDs?

A
  • About 68% of all scores are within +/- 1 SD of the mean
  • Nearly 96% are within +/- 2 SD of the mean
  • 99.7% are within +/- 3 SD of the mean
18
Q

How do you calculate a Z score?

A

z = (data point you are interested in- mean) / standard deviation (s)

19
Q

What does a Z score tell us?

A

How far away a score is from the mean, taking into account the
variability of the scores in the data set

It allows us interpret a score in terms of its…
a) relative standing
b) variability (S)
by comparing it to a standard normal curve

20
Q

What do we mean by being able to determine a score’s relative standard?

A

Determining what proportion of scores are less than or

greater than the individual score

21
Q

What are the steps to determining an individual scores relative standard in a distribution curve?

A

1) Calculate z-score
2) Draw a picture
3) Look up value of z in z-table to find the
relative position of xi

22
Q

Given this data calculate the z score…

Assume mean family income = $37,000,
S = $5,000, value interested in is 30,000

A

z = 30,000 30,000 – 37,000/ 5,000
= – 7,000/ 5,000
z = – 1.4

23
Q

If you get a negative z score how do you look it up in a z table?

A

By using the positive entry. The bell curve is symmetrical so this doesn’t matter.

24
Q

For any normal distribution, the z-distribution obtained by converting
every data element to its z-score has what properties?

A

1) It is normal
2) mean = 0
3) S = 1

25
Q

In addition to finding the relative position of a data element in its distribution what can z scores also allow and why?

A

They can be used to compare scores from different distributions
Normally it is hard to do this because distributions vary in both their means and standard deviations, however, because z scores can transform any data into a standardized version it allows comparison