Statistics 2 Flashcards

1
Q

What are the two types of data?

A

Discrete & continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an example of discrete and continuous data?

A

Frequencies and weight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the four scales of measurement?

A

Nominal, Ordinal, Interval and Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the nominal scale of measurement?

A

Discrete, mutually exclusive categories, if coded numerically then they have no numerical significance, no arithmetic possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the ordinal scale of measurement?

A

Discrete, categories are not mutually exclusive - instead they form a natural ordering/hierarchy, intervals in between categories are not uniform, no arithmetic possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the interval scale of measurement?

A

Continuous data, intervals are split equally allowing for simple arithmetic, however there is no absolute zero value which means there are no values below a certain point and it also prevents complex arithmetic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the ratio scale of measurement?

A

Continuous data, interval data with an absolute zero point. all arithmetic possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What scales of measurement can a bar chart represent?

A

Discrete (nominal and ordinal) data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is data represented on the bar chart?

A

The length of the bar is proportional to frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is represented on the x and y axis of a bar chart?

A
X-axis = nothing just the names of the different categories
y-axis = frequency/measurement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How would you order nominal and ordinal data on a bar chart along the x-axis?

A

Nominal: order from smallest category to the largest category
Ordinal: order from pre-determined way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What data is represented on a histogram?

A

Continuous (ratio or interval) data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How is data represented on the histogram?

A

The area is proportional to the size of the category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If there is a change to the category size what axis is changed?

A

The x-axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why is there no gaps between the different categories on the x-axis?

A

Because it is continuous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is univariate analysis?

A

analysis of a single variable distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How can distribution be described?

A

Visually (graphs and plots) or numerically (summary statistics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How is central tendency described?

A

Averages

19
Q

What is a bimodal distribution and how does it affect the central tendency?

A

it has two peaks so essentially there is no centre

20
Q

What is dispersion?

A

The spread of the data

21
Q

What are the two methods for representing and measuring dispersion?

A

Median and IQR & Standard deviation and variance

22
Q

When would you use the median and IQR as a method to represent dispersion?

A

Most suitable for non-normal distribution. Also when the sample size is small

23
Q

When would you use standard deviation and variance as a method to represent dispersion?

A

For normal distribution. (ratio and interval)

24
Q

What is standard deviation?

A

Summary statistic that characterises the spread of the distribution around the mean

25
Q

What happens as SD increases?

A

the spread of the data around the mean broadens/increases

26
Q

How much of the data is in 1 standard deviation of the mean?

A

68.26%

27
Q

How much of the data not within 1 SD is on either side of the mean?

A

15.87% on either side

28
Q

What happens to the shape of the normal curve at the point of 1SD?

A

It changes from a concave to convex shape

29
Q

What are important characteristics of a normal distribution?

A

It is spread equally and symmetrically

30
Q

What is variance?

A

Standard deviation squared

31
Q

What scales of measurement can variance not be used ?

A

nominal or ordinal

32
Q

What can the variance never be?

A

non-negative

33
Q

What is the name for values and calculations for a sample and a population?

A

Sample: statistic
Population: parameter

34
Q

What is the coefficient of variation?

A

It allows us to compare different variables of a distribution with different units or magnitude by standardising them.

35
Q

What scale of measurement is the coefficient of variation only valid for?

A

Ratio

36
Q

What is skewness?

A

The characterisation of shape in a normal distribution - measuring its symmetry

37
Q

If the skew is =, less than or more than 0 what does this mean?

A

= 0: No skew
<0: negative skew (more values concentrated to the right of mean)
>0: positive skew (more values concentrated to the left of mean)

38
Q

What is kurtosis?

A

A measure of peakedness

39
Q

What happens if kurtosis increases?

A

the mass of the data gets moved from the shoulders of the data distribution towards the centre

40
Q

What does it mean if the kurtosis is or = 3?

A

<3 means that it is platykurtic (not very peaked)
~3 means that it is mesokurtic (relatively peaked)
>3 means that it is leptokurtic (very peaked)

41
Q

What are the four moments of distribution?

A

1) Central value, 2) dispersion, 3) skewness, 4) kurtosis

42
Q

What is parametric data?

A

ratio or interval data that is normally distributed. n>30

43
Q

What is non-parametric data?

A

ordinal or nominal data OR ratio or interval data that is not normally distributed. n<30