Week 5 - Foundations for inference Flashcards

1
Q

sampling variability

A

our point estimate isn’t exactly equal to p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

bias

A

we may systematically under or over estimate the true value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

central limit theorem

A

as the sample size grows, the distribution of the sample proportion can be approximated by a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

hypothesis testing

A

another way of conducting statistical inference - a formal way of pitting 2 mutually exclusive statements against each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

2 hypothesis statements

A

Null hypothesis - there is nothing going on
- defendant is innocent
Alternative hypothesis - there is something going on
- defendant is guilty

general rule - null is true until we prove otherwise, but we never accept the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

one sided hypothesis test vs two sided

A

one sided = only checking for an effect in one direction
two sided =

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

significance level

A

this defines the threshold p value below which the null hypothesis will be rejected 0.05 is the most common

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

the null distribution

A

the distribution of possible outcomes under the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

type 1 error

A

when we reject a null hypothesis that is true
the probability of a type 1 error is denoted by ‘a’
when the null is true 100 x a% of the time you will observe a p value below ‘a’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

type 2 error

A

when we fail to reject a false null hypothesis

the probability of making a type 2 error is denoted by BETA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

significance level

A

the probability that the event could have happened by chance
the probability of rejecting the null when its actually true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

margin of error

A

the range of values around the sample statistic that is likely to contain the true population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

chi square goodness of fit test

A

used to determine if the data in your sample is drawn from some hypothesized distribution. ie does your expected response match the actual response.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

chi square considerations

A

independence
at least 2 degrees of freedom
at least 5 values in each category it think

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

the t distribution

A

the t distribution is used when we aren’t given the standard deviation. it accounts for the extra uncertainty of another estimated value.

its very similar to the normal distribution but is more spread out to reflect the uncertainty

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

t distribution and degrees of freedom

A

it has one parameter - DF
as df increases, the t distribution becomes less spread out and eventually approximates to the standard normal distribution