stuff to remember Flashcards
what do you do when the sample size is at least a tenth of the population
when n/N > 0.1, the variance is the usual thing ([➰^2]/n) times the finite population correction factor (1-[n/N])
N being provided in the exam is a suggestion that this is needed
Bias formula
Bias(estimator) = E(estimator) - thing the estimator estimates
it’s positively biased if the bias is positive
(same for negative)
how do you tell which estimator is better than another
the one with the smallest mean squared error is best
MSE(🌐)= Var(🌐) + (Bias (🌐))^2
formula for confidence intervals
sample mean +- estimated standard error x t
when determining which sample size will give a particular tolerance or confidence interval, which value of t do you use
the z value (t infinity), a/2
a/2 is used with all confidence interval questions and two tailed tests
a is used for one tailed tests
when to use t n-1 and when to use z (t infinity) for confidence intervals and hypothesis testing
use t n-1 for:
• only means (never proportions)
• when n<30 AND v is unknown (so you’re given or made to work out the SAMPLE v rather than the v for the whole population)
• differences between two means WHEN variances ARE the same and UNKNOWN (and you have to use a pooled s from the formula sheet (both ns-2 instead of n-1)
• paired data (when you do d=x-y)
use t infinity or z for:
• proportions
• means where n is large and/or ➰is known
• finding n for a confidence interval (always)
• differences between two means UNLESS variances are unknown and equal (even if they’re unknown and unequal)
which error is which
type 1: alpha, false positive, rejecting the null hypothesis when the null hypothesis is true (the worse one)
type 2: beta, false negative, accepting the null hypothesis when it isn’t true (the better one)
what’s a probability distribution and what’s a probability function
function is the { thing
distribution is the table
what is a binomial distribution
values come from a series of independent bernoulli trials so:
2 possible outcomes from each trial (success and failure)
fixed prob of success
fixed no of trials
trials independent
X~Bin(n,pi)
what is a poission distribution
a discrete distribution where values are independent of each other and can occur anywhere on a continuum
X~Pois(lambda)
What assumptions may you make when doing confidence intervals
- samples are independent
- values are normally distributed
- when sample sizes are small and the variances unknown, they may be assumed to be equal
Axioms of probability
- a probability must be equal to or greater than zero
- the probability of s, where s represents a set, is 1
- the probability of a union of mutually exclusive events is the sum of their probabilities
what is a p value
probability of obtaining a certain value, or something more extreme, if the null hypothesis is true
(the probability of making a type 1 error/false positive)❓
when do you use the poisson approximation to the binomial
when n is so big that nCx doesn’t work,
when
n>30
n(pi) <10 because pi is so extremely small
what is the power of a hypothesis test?
the probability that a false null hypothesis will be rejected
1-(probability null hypothesis will be rejected when it’s actually true)
= 1-p(type 2 error)
= 1-beta