Descriptive statistics Flashcards

1
Q

Define Categorical data

A

Has 2 or more categories with no ordering to them (usually binary or nominal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define discrete data

A

Has a fixed value with a logical order (usually ordinal, ratio, or interval variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define continuous data

A

Can take any fractional value (usually ratio or interval variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can frequencies be presented?

A

As its raw frequency or as a percentage frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define Mode

A

The score occurring most often in a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define Median

A

The middle score in a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define Mean

A

Sum of data point divided by number of data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define Central tendency

A

Where the centre of our frequency distribution lies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which bar on a bar chart would be the mode?

A

The highest bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How would you work out the median in an odd value dataset?

A

(n+1)/ 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How would you work out the median in an even value dataset?

A

Middle two values/ 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are some pros of the median?

A

Insensitive to outliers, often gives a real data value and is useful for ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some cons of the median?

A

Ignores a lot of the data, difficult to calculate without a computer and can’t be used with nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are some pros of the mean?

A

Uses all of the data, is most effective for normally distributed datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are some cons of the mean?

A

Sensitive to outliers, values are not always meaningful and is only useful for ratio and interval data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the measures of spread for the mode?

A

There are no measures of spread

17
Q

What are the measures of spread for the median?

A

‘distance-based’ measures e.g. range and interquartile range

18
Q

What are the measures of spread for the mean?

A

‘centre-based’ measures of spread e.g. variance and standard deviation

19
Q

Define the interquartile range

A

Ignores most extreme values and is the range of scores within the middle 50% of scores

20
Q

Define the Lower quartile range

A

Median of lower half of data

21
Q

Define the Upper quartile range

A

Median of upper half of the data

22
Q

What are the pros and cons identical to?

A

The median pros and cons

23
Q

Define Deviance

A

Take each score and subtract it from the mean

24
Q

Define squared errors

A

Take each deviance score and square it

25
Q

Define the variance

A

Average squared errors

26
Q

What are the pros of variance?

A

Uses all of the data and forms the basis of several other tests

27
Q

What are the cons of variance?

A

Requires a normal distribution and is sensitive to outliers

28
Q

Define Standard deviation

A

A measure of spread that’s equal to the unit of measurement of the dependent variable

29
Q

How would you calculate the Standard deviation?

A

Using the square root of variance

30
Q

What does the Standard deviation allow us to do?

A

Get an unbiased estimate of the population’s standard deviation if we only have access to a sample of data

31
Q

What can Standard deviance estimate?

A

Population based on a sample