Maths stats Flashcards

1
Q

What is a census

A

Measures or observes every member

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a sample

A

Selection of observation taken from subset of pop used to find out about whole pop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Advantages of census

A

Results should be completely accurate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Disadvantages of census

A

Time consuming, expensive, cannot be used when testing destroys process and hard to process large quantity of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Advantages of sample

A

Less time consuming and cheaper, fewer people have to respond, less data needs to be processed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is random sampling

A

Each member of pop has equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is simple random sampling

A

Everything has equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Advantages of simple random sampling

A

Free of bias, easy and cheap for small samples and pops and each sampling unit has known and equal chance of selectionDi

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

sadvantages of simple random sampling

A

Sampling frame needed and not suitable for large samples and populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is systematic sampling

A

The required elements are chosen at regular intervals from and ordered list

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Advantages of systematic sampling

A

Simple and quick to use, suitable for large samples and large populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Disadvantages of systematic sampling

A

A sampling frame is needed and bias introduced if sampling frame is needed, Bias introduced if sampling frame is not random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is stratified sampling

A

The population is divided into mutually exclusive strata and a random sample is taken from each strata in proportion to size of strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Equation for stratified sampling

A

(number in stratum x overall sample size) / number in population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Advantages of stratified sampling

A

Sample accurately reflects population structure, proportional representation of group within population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

DIsadvantages of stratified sampling

A

Population must be clearly classified into distinct strata, same disadvantages as simple random within each strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Two types of non random sampling

A

Quota sampling and opportunity sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is quota sampling

A

An interviewer selects a sample that reflects the charecteristics of the whole opulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Advantages of quote sampling

A

Allows small sample to still be representational of whole pop, so sampling frame, quick and cheap and easy comparison between different groups within a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Disadvantages of quota sampling

A

Non random sampling can introduce bias, population must be divided into groups which can be costly or innacurate, increasing scope of study increases number of groups which adds time and money, non responses not recorded

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is opportunity sampling

A

Sample is taken from people who are available at the time and who fits criteriaA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Advantages of opportunity sampling

A

Easy and inexpensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Disadvantages of opportuinity sampling

A

Unlikely to provide a representitative result and highly dependant on researches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Criteria for a binomial dist

A

The number of observations n is fixed.
Each observation is independent.
Each observation represents one of two outcomes (“success” or “failure”).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
To make binomial suitable what would be ideal
LARGER N
26
Conditions for normal aproximation of binomial
Large n and p close to 0.5
27
State one disadvantage of using quota sampling compared with simple random sampling.
nOT RANDOM SO CANNOT USE RELIABLY FOR INFERENCES
28
Mutually exclusive
Both cannot happen at once
29
P(A or B) mutulaly excluvive
P(A) + P(B)
30
Independent P(A GIVEN B )
P(A)
31
Reason to include outliers
It is a piece of data and we should include all pieces of data
32
Reasons to not include outliers
It is extreme and could unduly influence anaylsis or could be a mistake
33
Do you include NA at all when calculating mean
No
34
State the assumption involved with using class midpoints to calculate an estimate of a mean from a grouped frequency table.
Assumes values are uniformly distributed within the classes
35
Why could random sampling not be used
It is not possible to have a sampling frame
36
Conditions for normal dist
Variable has to be continuous
37
P(x=5) for continuous
0 as continous
38
+- standard deviation for normal
Point of inflection
39
Numbers not in table
3sf
40
Numbers in talbe
4 dp
41
For normal mean=
Mode = median
42
z=
x-mew / o
43
Standard deviation binomial
np(1-p) root that
44
Why would oyu have to times p vbalue by 2
For normal dist, it is ewual on both sides
45
For cumalitive frequenct what value do you plot
Top value
46
Histogram height
Work out area scale factor in relation to frequency (double check this )
47
What is extrapolating
Estimate outside range (unreliable )
48
What is interpolation
Estimate inside range
49
What is the explanatory variables
The one thrat explains the other and that causes change
50
What does close to 1 mean
Positive correlation
51
Normal aprox
Make binomial and then make normal from that i think
52
list
aDD 1/2 FOR MEDIAN
53
Conditions for poisson aprox of binomial
Large n small p np <10
54
variance of (3x-1)
square 3
55
variance poisson
Mean or np(1-p)
56
Conditions for poisson
Events must occur independently, events must occur singly, events cannot occur at same time, events occur at constant
57
H0 for chi squared test
No difference between theoretical frequency and observed frequency
58
Binomial need thing
Need to either succeed or fail
59
What is area
Significance level
60
DOR
(rows-1)(columns-1)
61
EF
Row time column over grand
62
What is needed for central limit theorum
Large sample size unless data is already normally distributed
63
mean for nb
r/p
64
variacne for nb
r(1-p)/p squared
65
What is a type 1 erro
Actual significance error (chance they are lucky)
66
What is type 2 error
Incorrectly accept h0 p(not critical region given h1 i true)
67
What will reducing significane level (type 1 error) do
Increase type 2 error
68
What does increasing n do to erorrs
Reduces type 2
69
What is size
Type 1 errorh
70
What is power
1-type 2 error
71
size
Rejecting h0 given h0 is true
72
Power
rejecting h0 given h1 is true
73
variance
Dont divide by n if its a discrete uniform distribution
74
If chi squared greater than critical value
Reject that it is a fitable modle
75
g(1) always =
1
76
Remember you can rearrange var formula for e(x)squared
ok
77
What happens if proportion of defective things stays same and dof stays same but proportions can be ree allocated
No change in test statistic
78
) Explain the relevance of the Central Limit Theorem in part (a)
CLT applies since the sample size is large B1 3.5b CLT states that the sample mean/ S is (approximately) normally distributed
79
formula for chi squared
(o-e)squared/E
80
E in 2 wasy
Row total times column total / grand total
81
Dof 2 way
(r-1)(c-1)
82