Week 4 Flashcards

1
Q

Can a median be obtained for nominal variables?

A

Median cannot be obtained for nomimal variables, it can be obtained only on ordered variables – ordinal, interval, ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What types of data can the mode be used for

A

Can be used for all types of variables – and often used for nominal and ordinal variables e.g. the most frequent answer was ‘extremely satisfied.’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Can there be multiple modes

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What variables can mean be defined in

A

Ratio and interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Difference between mean and median

A

Mean depends on the actual values, whike median is not affected e.g. one extreme outlier can hugley affect mean but not median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How would you go about creating equal sections of data (use coins)

A

Divide coins into sections containing the same number of data and report where the sections are located
The cut off points dividing these sections are called quantiles
if there were 200 coins - 20 sections of 10 coins each

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When there are 4 sections, they are called _____, and the median is the ____

A

quartiles (1sr-3rd)
2nd quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When there are 100 sections, they are ______ (1st-99th) and the median is the ___

A

percentiles
50th percentile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the 2nd moment

A

(distance from mean)^2 to each data point / number of data points
This is the variance
How hard it would be to spin the coins around the mean (torque)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a small variance mean

A

This is when the data has a small spread
Its concentrated more towards the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is standard deviation

A

Square root of variance is called the standard deviation (SD)
The standard distance from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do z scores enable

A

Fair comparisons of deviations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does skewness measure

A

The degree of asymmetry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skewness and the 3rd moment

A

3rd moment = (distance from mean)^3 to each data point / no of data points
To make it dimensionless, this is divided by SD^3 ie. Skewness = 3rd moment/ SD^3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does zero and high skewness mean

A

Zero skewness means data are symmetrically distributed, high skewness means distribution is highly asymmetrical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does positive and negative skewness indicate

A

-Positive/negative skewness indicates which direction data are skewed (see graphs for reference)

17
Q

What does kurtosis meaure

A

Kurtosis meaures the sharpness

18
Q

Kurtosis and the 4th moment

A

4th moment = (distance from mean)^4 to each data point / no of data points
To make it dimensionless, this is divided by SD^4 ie. Kurtosis= 4th moment/ SD^4

19
Q

What do we do to the kurtosis

A

Kurtosis is always positive, but we normally subtract 3 (the kurtosis of the normal distribution). This is called the excess kurtosis

20
Q

Reasons for outliers

A

Can be due to inaccuracies in data processing, problems with the methodology (e.g. measurements, instruments, participants not following instructions, an actual extreme value from an unusual participant

21
Q

2 ways of measuring an outlier

A
  1. Based on z score – outlier if z score is more than 3 or less than –3 I.e. when the distance from the mean is more than 3 times of SD
  2. based on the IQR; width between the 1st and 3rd quartile – outlier if value is greater than 1.5 IQR above the 3rd quartule or smaller than 1.5 IQR below the 1st quartie
22
Q

Cumulative probability

A

Using bionomial distribution, you can also calculate the probability that you get heads in a certain range e.g. what is the probability you get no more than 3 heads in 10 tosses?

23
Q

What is the IQR

A

width of between the 1st and the 3rd quartile

24
Q

Two tailed cumulative probability

A

Sometimes you may want to check the probability that a data is deviated from the centre or mean.

In this case, you need to take the cumulative probability at both ends. Checking this probability is called the two tailed probability

25
Q

What do you need to use to describe a distribution of a continuous variable

A

Continuous distribution

26
Q

In continuous distribution, what indicates the probability

A

And the area under the distribution in that range indicates the probability

27
Q

Y axis on continuous distribution

A

Probability density