Inferential Biostatistics Flashcards

1
Q

what are the steps for hypothesis testing?

A
  1. state the null and alternative hypotheses
  2. select the alpha value (usually 0.05)
  3. gather data and perform appropriate statistical tests generating a p value
  4. accept or reject the null
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the null and alternative hypotheses for investigating whether a new drug improves symptoms of tension headache?

A

n: drug produces no change in headache symptoms compared to placebo
a: drug produces a significant change in headache symptoms compared to placebo

  • -> must not say it makes symptoms better because you need to leave open possibility that it makes symptoms worse
  • just say it produces a change or it does not
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the alpha level?

A

designates the level of uncertainty you’re willing to accept
- alpha=0.05… you’re willing to accept that 5 % of the time the test could be wrong, the results are generated by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is the p value?

A

the measure of significance obtained from statistical tests

-the probability that the difference between group means found in the research is due to chance alone

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

describe a parametric test

A
  • focused on population parameters
  • requires a continuous variable (ex cholesterol levels)
  • assumes a normal distribution
  • generally more powerful than nonparametric
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are examples of parametric tests

A

anova, t-test, paired t-test, confidence intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is the central limit theorem

A

creates a normal distribution by sampling the population and distributing the means of each sample

  • states that for a sufficiently large sample size, the means will aways tend to be normal irrespective of the shape of the population from which the samples were drawn
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

why are sampling distributions (central limit theorem) useful?

A

-allows parametric (ones that require normal distribution) tests to be done even if the underlying distribution is not normal so long as the sample size is large enough

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

describe a non-parametric test

A

not focused on population parameters
can be applied to discrete variables
used for smaller samples
makes few assumptions about distribution
-used more frequently than parametric because they don’t need continuous data and don’t make assumptions about distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are examples of non-parametric test

A

chi-square

—also mann-whitney, fischer’s wilcoxon, etc (don’t need to know specifics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does a paired t-test compare

A

compares means of a single group before/after an intervention

ex: pre/post testing to access of value of a workshop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what does a t-test compare

A

compare means between two groups

ex: experimental drug group and placebo group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what does an ANOVA compare?

A

compares means between more than one group

ex: birth weights in non, light, and heavy smokers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what does a chi-square test do?

A
  • non-parametric test
  • used for testing hypothesis about nominal scale data
  • test of proportions

ex: is there a significance difference in seat belt use between high school graduates and college graduates?
ex: is there a significant difference in growth of plants treated with water vs plants treated with grape solution?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is type I error?

A

false positives

-null is actually true but it was rejected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is type II error

A

false negatives

-null was actually accepted but alternative is true

17
Q

what is power?

A

estimate of the ability of a study to detect a false null hypothesis

    • the ability of a test to detect a difference if there is one
    • probability of avoiding a type II error, a false negative
18
Q

what does power depend on?

A

multiple aspects of a study design, but most importantly the sample size
-the larger the sample size, the greater the power

19
Q

how do you calculate power?

A

1-beta

beta=probability of making Type II error

20
Q

what is beta?

A

probability of making type II error

  • for medical research, beta=0.2
  • –> 20% chance of making a type II error, a false negative
21
Q

if beta=0.2, what is power of the study?

A

=1-0.2

power = 0.8
there is an 80% chance of avoiding type II error, 80% chance of avoiding false negatives

22
Q

what are the probabilities of type I and type II errors if alpha=0.05 an beta=0.2

A

type I - probability of making false positive is 5%
type I - probability of making false negative is 20%

power of this study is 0.8; 80% chance of avoiding false negatives

23
Q

what is a confidence interval?

A

gives measure of the precision of the study

-95% CI is the range of values within which we can be 95% certain that the actual population value lies

  • also indicates whether a difference found in a study is statistically significant
  • -> if the 95% CI does not include the value of equality, then the difference is significant at less than 0.05

-the narrower the CI, the more precise the estimate

24
Q

interpret:

a study finds the mean systolic blood pressure of 20-40 year old males to be 125 with a 95% CI of 120-130

A

we can be 95% certain that the actual population mean systolic blood pressure for 20-40 year old males lies between 120-130

25
Q

interpret:

a study demonstrates that penicillin cures strep throat in 90% of children, tetracyclin cures strep throat in 40% of childre. the difference in cure rates of strep throat using two different antibiotic regimens is 50% with a 95% CI (25%-75%)

A

we can be 95% certain that there is an actual difference between these two treatment regimen

since 0% is not included in the CI, we can be certain the difference between the antibiotic cure rates is significant at 0.05

26
Q

what determines the width of confidence intervals?

A

variability of the sample, size of the sample

27
Q

what is standard error of the mean?

A
  • standard deviation of the means in the sampling distribution (generated by CLT)
  • used in generating the t-test and confidence intervals

–> limited use as a descriptive statistic

28
Q

why is the SEM not a good descriptive statistic?

A

SD- measure of variability of the underlying population; ie, magnitude of SD depends primarily on the variability of the population

SEM - measure of the variability of the sampling distribution of means; ie, magnitude of SEM depends primarily on the sample size used in the CTL calculations

thus: SEM is always smaller than SD, and therefore is misleading
- usually only indicates a huge sample size, but often used in literature to make results look tighter