Stats Flashcards

1
Q

How does Quota sampling work
Name one advantage and one disadvantage

A

Take a certain number from each category according to the size of each group in the population
Ad: all categories are represented
Disad: Not random so can lead to bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does stratified sampling work
One advantage
One disadvantage

A

Sample data from each strata that is proportional to the population sizes
Adv : sample accurately reflects the population , selection is random
Disad : time consuming , depends on sampling frame available

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how does systematic sampling work
one advantage
one disadvantage

A

number every piece of data in the population then use random number generator to take a starting point, select every nth price of data

ad: random so less likely to lead to bias
disadv : need sampling frame

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how does opportunity sampling work
one advantage
one disadvantage

A

pick the data as it becomes available
adv: easy and quick (cheap)
disadv : not random , can lead to bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how does simple sampling work
one advantage
one disadvantage

A

number every piece of data in the population , use number generator to pick he numbers in the sample and keep going until you have your sample
Adv : random and less likely to be biased, each piece of data has an equal chance of being picked
disadv : requires a sampling frame

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

if you have outliers which value of the average and which measure of the spread would be best to use

A

the median as the mean is distorted by extreme values

interquartile range as this is not affected by outliers - represents the middle 50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

which value of the average and measure of the spread is most accurate and why

A

the mean and standard deviation as both measures include every value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

is the explanatory variable the x or y values

A

x values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

is the response variable the x or y values

A

y values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

if a line of regression y = 17.0 + 14x represents the relationship between the percentage (x%) of cocoa solids and the price (y pence ) of different chocolate , interpret the value 15.4

A

for every 1% more cocoa that the chocolate contains, the price can be increased by 15.4 pence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

if the relationship between the variables is p on n, is the linear regression line
p = an +b
or
n = ap + b

A

p = an + b

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

where data is coded what is the mean affected by

A

addition / subtraction
multiplication / division

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

continuous data ….

A

can take every value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

where data is coded what is the standard deviation affect by

A

multiplication / division

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

discrete data is ….

A

data that can only take specific values e.g shoe size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

in large data set which UK locations are on the coast (windy )
north to south

A

Leuchars, Hurn, Camborne

17
Q

in the large data set which worldwide locations are on the coast

A

Jacksonville and Perth

18
Q

in the large data set what is the daily maximum gust measures in and give a definition of what this means

A

Knots
1 knot = 1.15 mph

19
Q

in the large data set what the only 3 categories of data where the data is continuous

A

daily mean rainfall
daily hours of sunshine
daily max temperature

20
Q

helpful histogram formula

A

area = k x frequency

21
Q

what is the definition in words of the standard deviation

A

the average distance every value is away from the mean

22
Q

for a hypothesis test that uses binomial distribution what are the null and alternative hypothesis if you are testing a two tailed test

A

H0 : p = p
H1 : not equal

23
Q

for a hypothesis test that is testing positive correlation what are the null and alternative hypotheses

A

H0 : row / p =
H1 : row/p > 0

24
Q

for a hypothesis test that is testing to see if the mean has decreased, what are the null and alternative hypotheses

A

H0 : u =
H1 : u<

25
Q

if you are using the normal approximation to a binomial distribution.
x-b (10,0.2) and find P(X>3), what is the probability for the normal distribution

A

contribunitety correction P (x>3) includes the values 4,5,6,…..10
P(Y>3.5)

26
Q

hypothesis test for a mean using the ND, and the ND is K-N (120,5) if a sample is taken of 25 values, what ND do you use for the test and new value for the standard deviation

A

mean stays the same
SD 5 / 25

27
Q

ACTUAL significance level of a hypothesis test , what are they asking for

A

the actual probability of the critical region , E.g. if the significance level is 0.05 but the probability of the critical region if P(X=5) = 0.0223 is the actual significance level

28
Q

when doing a two tailed hypothesis test what do you need to Ensure you always do first

A

divide the significance by 2

29
Q

what type of question means you have to use the standard Normal distribution Z-N(0,1^2)

A

when you have a missing mean of SD or both

30
Q

when trying to find the mean or SD using Z-N(0,1^2), what is the key formula

A

z= X - u / SD

31
Q

what is the general formula for finding out probabilities with the binomial distribution using SIGMA

A

upper lower and using the binomial C

32
Q

if you using the linear regression line to make a prediction what, what the two things you need to watch for

A

are there values close by ( extrapolation or interpolation) if extrapolation not reliable

if you are using the independent variable (x) to predict the dependent variable (y)

33
Q

with vectors what is the formula that will give you the position vector of the final position, where the object does not start at the origin

A

r1 = r0 +s

34
Q

if string is inextensible what are you able to assume

A

acceleration is equal

35
Q

the object is moving is moving in the direction of 5i - 2j

A

velocity = K(5i - 2j)

36
Q

if the string is LIGHT what are you able to assume

A

tension is equal

37
Q

if an object is positioned south east what does this mean

A

r = K( i - j )

38
Q

other than resistance what may affect an object travelling in air due to gravity

A

spin of the object, dimensions of the object, wind affects

39
Q
A