Statistics 3 Flashcards

1
Q

Define probability

A

A quantitative description of the likelihood of an event occurring within dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three ways of calculating probability?

A

Subjective: Judgement based on individual opinion and experiences
Theoretical: Calculation based on reasoning using knowledge of the specific dataset characteristics
Experimental: Measurement of an observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is sample space?

A

All the outcomes that could occur within a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does it mean if events are mutually exclusive?

A

Zero probability of two events occurring together (e.g. on a dice there is no way of getting a 5 and a 3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does it mean if events are independent?

A

Events that have no influence on each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What should you typically assume about events?

A

That they are independent but not always

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In a normal distribution what does the area under a curve represent?

A

the probability of different events occurring within the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

In a normal distribution what does the curve represent and what is underneath?

A

the probability density function and underneath are all the associated probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why cant you compare different normal distributions?

A

Because they typically have different means and standard deviations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do you need to do to compare different normal distributions?

A

Standardisation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the standardization formula?

A

X-u/ standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the standardization formula do to the mean and standard deviation of normal distributions?

A

Mean: makes it 0

Standard deviation: makes it 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What happens as you move away/to from the mean?

A

there is losses or gains in the mass of the dataset which is captured which then affects probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is hypothesis testing?

A

Testing to see whether the hypothesis around outcomes of a dataset are statistically significant or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the initial hypothesis called and how is it phrased?

A

Null hypothesis always negatively phrased.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the alternative hypothesis and when is it formed?

A

The alternative to the null that is formed as the test has been carried out and after the null hypothesis has been rejected or accepted based on the test

17
Q

What are 3 examples of null hypothesis? No statistically significant…

A

difference between sample and population
difference between samples a, b, c
relationship between variables a, b, c

18
Q

How is the significance threshold determined?

A

By you - you decide what is a suitable level for a probability to confidently not have occurred due to chance

19
Q

What are 4 typical levels of significance?

A

90%, 95%, 99%, 99.9%.

20
Q

What does the result mean in regards to the statistical significance threshold?

A

If the result falls outside of the significance threshold then it did occur due to chance, but if it does surpass then you can be confident that it did not occur due to chance and it is indeed a statistically significant result

21
Q

Why is it crucial to test for normality in a distribution?

A

To determine whether to carry out parametric or non-parametric data analysis

22
Q

What are the three ways of determining normality in a distribution?

A

Observation
Q-Q plot
K-S test

23
Q

What is the q-q plot?

A

Determining whether there is an even scatter of data that is also closely centred around the drawn straight line

24
Q

What is the K-S test?

A

Observational with empirical analysis of whether a distribution is normally distributed

25
Q

What does the K-S test involve?

A

Comparing the measured/observed distribution against a theorised normal distribution of that dataset.

26
Q

What does it mean visually if the observed distribution resembles that of the theorised distribution?

A

that it is probably normally distributed

27
Q

Why is the K-S similar to hypothesis testing?

A

Because you accept the null or alternative hypothesis

28
Q

What is the null hypothesis and alternative hypothesis for a k-s test?

A

Null: there is no statistically significant difference between the observed dataset and the theorised dataset i.e. it resembles the normal distribution
Alternative: there is statistical significant difference i.e. it does not resemble the normal distribution

29
Q

How do you analyse a K-S test in SPSS?

A

Analyse > non-parametric tests > legacy dialogs > one sample K-S > tick “normal” box

30
Q

What is the product of the K-S test on SPSS?

A

Table - the column titled “sig.” tells you what the level of significance is and therefore enables you to determine whether to accept the null or alternative hypothesis

31
Q

What is the importance of the “statistic” column in the SPSS table produced for K-S?

A

Nothing