Inferential statistics Flashcards

1
Q

why is randomisation important

A

avoid bias and confounders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

sampling methods

A

simple random sampling
stratified sampling
convenience samplong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is data provenance

A
Data provenance is the history of a dataset
– Data cleaning processes
– Imputations for missing data
– How it was collected and by whom
– Access to previous versions etc
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

standard error of the mean

A

SE = s/√n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Hypothesis testing

A

A hypothesis should be something that is testable and falsifiable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

procedure for hypothesis test

A

• a random sample is drawn from a population
• a null hypothesis is formulated
• a test-statistic is calculated, of which we
know the probability distribution
• p-value: evidence for a hypothesis comparing the observed value of the statistic
with the corresponding distribution
• if the p-value<0.05, reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

normality testing

A

Shapiro tests

H0 = Null hypothesis: Data is normally
distributed
H1 = Alternative hypothesis: Data is not
normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

T-test & p value

A

The probability that these two variables are

from the same population, specifically the means.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

z test and t test definition

A

• The probability that two means are from the
same populations
• The probability that two means are from
different populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Type I Error

A

– errors where the result is statistically significant
despite the fact that the null hypothesis is true
– i.e., a diagnosis of cancer (“positive”) for healthy
subject

Solution: change alpha value from 5% to 1%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Type II Errors

A

– errors where the result is NOT significant despite
the fact that the hypothesis is true
– i.e., a diagnosis of healthy for a subject who has
cancer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Sensitivity and Specificity

A

• Sensitivity (power): proportion of the
positives that are correctly identified by a
test as being positive
• Specificity: proportion of negatives that are
correctly identified by a test as being
negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly