stats Flashcards

1
Q

null hypothesis

A

based on normal conditions eg 0.5 probability of heads when tossing a fair coin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

alternative hypothesis

A

when the probability is not “normal”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

significance level

A

you set a lower tail and upper tail with anything above or below these numbers allowing you to reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

combination

A

doesn’t care about the order of things for example 3 cards in order A,b,c is the same as 3 cards in order b,c,a

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

permeantation

A

does care about order eg number for a safe so different orders of the same numbers are different possible answers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If you multiply all your data points by a number what happens to mean and sd

A

mean and sd are both multiplied by that number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

if you add a number to all data points what happens to mean and sd

A

add number to mean, nothing happens to sd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

sum of squared deviation (sxx)

A

sum of (x-mean)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

sd

A
  1. root(sxx/n-1)

2. root(sum of the x^2-n x mean^2/n-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

lower boundary outliers

A
  1. LQ-1.5 x IQR

2. mean - 2 x sd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

upper boundary outliers

A
  1. UQ+1.5 x IQR

2. mean + 2 x sd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how do you know if 2 events are mutually exclusive

A

p(A u B)= p(A)+P(B)

p(A n B)=0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

p(B|A)

A

p(B|A)=P(BnA)/P(A)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

P(B|A’)

A

P(B|A’)=P(BnA’)/p(A’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

parent population

A

set of all possible data points from which you will draw your sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

census

A

When population is small enough data can be collected from every member of the population

17
Q

Sampling fraction

A

Sample size/population size

18
Q

Sampling error

A

The difference between an estimate derived through stats and its true value

19
Q

Simple random sampling

A

assign everyone in the population are number and generate random numbers (shift, decimal point, x by total in pop.). Take information from those with corresponding numbers. You should take no less than 30 samples

20
Q

pros and cons of simple random sampling

A
  • equal chance of getting chosen so it will provide an accurate picture of the population and a spread
  • however it’s time consuming and access to the entire population is unlikely
21
Q

Stratified sampling

A

The population is divided into different groups which will have different information (for example tomatoes sizes with a population divided into tomato varieties). Each group should be represented in your sample asa percentage of the sample it takes up equal to the percentage of the population the group takes up

22
Q

Pros and cons of stratified sampling

A
  • Results are likely to accurately reflect the population studied and take into account a wide spread
  • you can’t always divide the population into groups and sometimes members will not fit into any group or will fit into multiple
23
Q

Cluster sampling

A

The population is in groups but there is no reason to suspect the information between groups will be hugely different so use one or more groups as the sample

24
Q

Pros and cons of cluster sampling

A
  • very easy to conduct

- clusters are likely to have been picked by human bias so limited in how representative they are

25
Q

Systematic sampling

A

Take a sample at regular intervals for example every 10th person

26
Q

Pros and cons of systematic sampling

A
  • it’s very simple and when the population is large you’re likely to get a widespread
  • makes data manipulation easier and so bias/skewed results due to targeted outcome
27
Q

Quota sampling

A

You don’t care about groups in the population so you decide in advance how many from each group you will use

28
Q

Pros and cons of quota sampling

A
  • easy and can accommodate population proportions to improve accuracy
  • unsure if you can fill quota and non-random so gives researchers a lot of say
29
Q

Opportunity sampling

A

You use anyone you can get for example drop a net and use the cod you catch

30
Q

Pros and cons of opportunity sampling

A
  • quick and easy

- easily manipulated by only talking certain people/cold calling in one area

31
Q

Self-selecting sample

A

Use whoever allows you to sample them

32
Q

pros and cons of self-selecting sample

A
  • can allow you to get to get a spread

- tends to be a certain type of person so can make the results exaggerated and unrepresentative