Summarising data Flashcards

1
Q

What are the 2 types of summary descriptive statistics?

A
  1. Measure of central tendency = avg

2. Measure of dispersion = spread of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the different ways to measure central tendency/ typical performance?

A
  • Mean
  • Mode
  • Median
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the dis and ad of using mode?

A

Mode: most frq score in a set of score
AD:
- simple + easy
- only avg which can be used w/ nominal data (categorical)
DIS:
- can be unrepresentative therefore misleading
–> 39 = mode but best of numbers may be low
- may be more than one mode in a set of score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the dis + ad of using mean?

A

Mean: (add all scores)/ total numbers of scores
AD:
- uses info from every single score
- resistant to sample fluctuation
DIS:
- Susceptible to distortion from extreme score - outliers + Skew

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the dis + ad of using median?

A

Median: arrange scores in order, median = middle value or avg of middle 2 scores
AD:
- resistant to the distorting effects of extreme high or low scores
DIS:
- ignores score’ numerical value = wasteful data
- more susceptible to sampling fluctuations than the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the different measures of dispersion/ variability in performance?

A
  1. Range

2. Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the AD and DIS of the range?

A

Range = difference between the highest + lowest score
AD;
- quick + easy to calculate
DIS:
- influenced by extreme scores
- conveys no info about the spread of scores between the highest + lowest scores
–> could have same range but spread of data completely different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is SD?

A
  • The spread of scores around a sample mean
  • tells us how well the mean summarises the sample
  • -> bigger the SD, the more scores differ from the mean + between themselves and less satisfactory the mean becomes as a summary of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the ads + dis of SD?

A

AD;
- like the mean, use info from every score
DIS:
- not intuitively easy to understand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you calculate SD?

A
  1. Work out mean of data
  2. Subtract mean from each score
  3. Square the differences obtained
  4. Add up the squared differences = SS sum of squares
  5. SS/ the total number of scores = Variance
  6. SD = square root of variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some issues with using the mean and SD?

A
  • usually obtain SD/ mean from a sample = cannot extrapolate to the population from our sample = only a good estimate
  • SD tends to underestimate the population SD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can we deal with SD typically underestimating the SD of the population?

A
  • when using sample, divide by n
  • when using the sample SD as an ESTIMATE of the population, divide by n-1
    (makes SD larger)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the relationship between the normal curve and the SD?

A
  • The SD cuts off a constant proportion of the distribution of score
    > 68% of ppl have IQs between 85 + 115 (mean = 100, SD +/- 15)
    > 95% have IQs between 70 - 130 (100, +/- (2*15))
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the chances of 99.7% of a population will have an IQ between 55 - 145 if the mean = 100 and the SD = 15 and the SD constant has been 30?

A
  1. (100 - 99.7)/ 2
    –> 2 there since it is 2 more than normal 1 SD
    = only occurs in 15% of the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Wha is standard error of mean?

A

A type of SD

  • is the SD of a set of sample means
  • shows how much variation there is within a set of sample means
  • -> indicates the reliability of each sample mean as an estimate of the true population mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the formulation for SE?

A

= SD/ square root of n

n = Sample size

17
Q

What can be interpreted from the different sizes of SE?

A
  1. Small SE = our obtained sample mean is more likely to be similar to the true population mean
  2. Increasing n (sample size) reduces size of SE