Descriptive Statistics Flashcards

1
Q

Parameter

A

Describes a population → summary of real behaviour/data present in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Statistic

A

Describes a sample of data → we infer something about the population (parameters) from what we know about the sample (statistics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Descriptive statistics

A

Summarise patterns within the sample

  • Central tendency (location)
  • Dispersion (spread)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Infererential statistics

A

Allow us to draw inferences about the population based on our sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Discrete variable

A
  • Scores can only take on certain values e.g. 5-point Likert-type rating scale
  • Usually whole numbers but e.g. UK shoe sizes are also discrete: 7, 7 1/2, 8 etc
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Continuous variable

A
  • Scores can take any value
  • Any level of precision
  • E.g. age, time, weight etc
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types of data

A
  • Nominal: labels/names/categories e.g. gender
  • Ordinal: numbers ranked or ordered by a category e.g. order of finishing in a race
  • Interval: measurements are made on a scale; differences between points on the scale are equal but there is no ‘natural’ zero point (it is arbitrary) e.g. temperature scale
  • Ratio: same as interval data but there is a ‘natural’ zero point e.g. height/length, weight, time etc
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Measures of central tendency

A
  • Mode: the most frequently occurring score
    • Advantage: simple to calculate and easy to understand
    • Disadvantage: easily unrepresentative
  • Median: middle score when scores ordered by size
    • Advantage: relatively unaffected by untypical extreme scores
    • Disadvantage: may not describe all scores in a data set
  • Mean: add together scores and divide by total number of scores
    • Advantage: only measure which uses every single score
    • Disadvantage: easily distorted by single, extreme scores (outliers)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Measures of dispersion

A
  • Range: how far apart the highest and lowest scores are
    • Advantage: easy to calculate
    • Disadvantage: affected by extreme scores
  • Interquartile range (IQR): ‘trimmed’ range = 75th percentile minus the 25th percentile
  • Standard deviation: the average amount by which scores differ form the mean
    • Standard deviation takes into account all values in the data set
    • Square root of variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sum of squared errors (SS)

A

The total amount that data points deviate from the mean, squared.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Central limit theorem

A
  • If we take lots of samples from a population and take the mean of each sample, these means will be nornally distributed
  • The mean of all sample means will approximate the population mean
  • Need sample size of at least 30
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Standard error

A

The standard deviation of sampling means = the variability in sample means around the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly