Biostatistics Flashcards

1
Q

What is continuous data? Examples

A

Logical order with values that can increase or decrease by the same amount

Examples: Ratio and IntervalW

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is ratio data?

A

Absolute 0 where 0 means none

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is interval data

A

No meaningful zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is categorical data? Examples

A

Data fits into a limited number of categories

Ex: Nominal and ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nominal vs ordinal data

A

Nominal: Arbitrary order (order of categories doesn’t matter)

Ordinal: Categories can be ranked

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the mean? When is it preferred?

A

Average

Continuous data with a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is median? When is it preferred?

A

Number in middle with data is organized numerically

Ordinal or continuous data that is skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is mode? When is it used?

A

Most frequently data point

Nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is range?

A

The difference between highest and lowest values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is standard distribution?

A

How spread out data is from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a large standard dev mean?

A

Large amount of data is dispersed away from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is another name for bell-shaped curve?

A

Gaussian

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Where do the central tendencies lie on a Gaussian distribution

A

Mean, median, mode are the same value

68% of data falls within 1 SD of the mean

95% of data falls with 2 SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What makes a distribution skewed?

A

Number of values (sample size) is small or contains outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Method of measuring central tendencies in skewed data?

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the difference between outliers of high vs low values?

A

High: Right (positive) skew → mode to median to mean (left to right)

Low: Left (negative) skew → mean to median to mode (left to right)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the difference between independent and dependent variables?

A

Independent: can be manipulated by the research

Dependent: affected by the independent variables (outcomes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the null hypothesis?

A

That there is no statistically significant difference between groups

Researchers aim to disprove or reject

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is an alternative hypothesis?

A

There is a statistically significant difference between groups

Researchers aim to prove or accept

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the importance for alpha

A

Maximum permissible error margin

Commonly 5%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do we interpret p-value using alpha (0.05)?

A

p-value < a: reject the null proving statistical significant results (alternative hypothesis accepted)

p-value > a: accept the null stating there is no statistical significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a confidence interval

A

Provides information on the significance of p-value

CI = 1-a

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

When would you use CI of 0 vs 1?

A

Comparing differences of means: use 0
Comparing ratio data: use 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How do you interpret CI when looking at means of data?

A

CI includes 0 → Not statistically significant

CI doesn’t include 0 → Statistically significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

How do you interpret CI when looking at ratio data?

A

CI includes 1 → Not statistically significant

CI doesn’t include 1 → Statistically significance

26
Q

How do interpret a 0.95 CI 0.06, 9.35?

A

95% confidence that the true value lies between 6-35%

27
Q

What is the difference between narrow and wide CI?

A

Narrow: high precision
Wide: poor precision

28
Q

What is a false positive? How is it represented?

A

The alternative hypothesis was accepted when and the null was rejected in error

(Null is supposed to be accepted and no statistical significant difference present)

Represented as alpha (Type 1 error)

29
Q

What is a false negative? How is it represented?

A

The null was accepted and the alternative hypothesis was rejected

(Alternative is supposed to be accepted → there is a statistically significant difference present)

Represented by beta (Type 2 error)

30
Q

What is power?

A

The probability that a test would reject the null hypothesis correctly → the avoidance of type 2 errors

31
Q

How do you interpret power?

A

More power → less likely of a type 2 error to occur

32
Q

How do you calculate risk and risk ratio?

A

Risk = number of subjects of an unfavorable event/Total of subjects in group

Risk ratio = risk of treatment group / risk of control group

33
Q

How do you interpret a risk ratio?

A

RR = 1 implies no difference in risk between the 2 groups
RR > 1 implies there is a greater risk in the treatment vs control
RR < 1 implies there is a greater risk in the control vs treatment

34
Q

How to interpret a RR of .57?

A

The treatment group is 57% AS LIKELY to have the progression of outcomes as the control group

35
Q

How do you calculate relative risk reduction?

A

RRR = (% risk of control - % risk of treatment) / % risk of control

RRR = 1-RR (as a decimal)

36
Q

How do you interpret RRR of 47%

A

Treatment group is 47% less likely to have the outcome than the control

37
Q

Why is absolute risk reduction more useful than RR or RRR?

A

RR and RRR is comparative data between treatment and control → no meaning of absolute risk

ARR includes reduction of risk and the incidence rate of outcomes

38
Q

How do interpret a study if ARR was not reported?

A

Risk reduction is minimal to start with, therefore ARR would have been small

39
Q

How do you calculate ARR?

A

I % of control - % of treatment I

40
Q

How do you calculate NNT and NNH?

A

1/ (risk in control - risk in treatment)

1/ARR (in decimal)

NNT: Round up
NNH: Round down

41
Q

How do interpret a NNT of 9 in a study that lasted for 1 yr

A

9 patients needed to receive treatment for 1 yr to see a progression prevented in 1 patient

42
Q

What is NNT?

A

Number of patients needed to treat for a period in order to see a benefit in 1 patient

43
Q

What is NNH?

A

Number of patients needed to be treated for a period in order to see one patient harmed

44
Q

How do interpret a NNH of 9 in a study that lasted for 1 yr

A

9 patients needed to receive treatment for 1 yr for 1 patient to be harmed

45
Q

What study primarily uses odds ratio?

A

Case control

46
Q

What are odd ratio?

A

The estimated risk of an unfavorable event associated with a treatment of intervention

47
Q

How do you calculate OR?

A

OR = (# outcome with exposure/ # no outcome with exposure) / (# outcome without exposure / # no outcome without exposure)

48
Q

What data is used in survival analysis?

A

Hazard rate

49
Q

How do you calculate hazard ratio

A

Hazard rate in treatment / Hazard rate in control

50
Q

How do you interpret OR and HR?

A

OR or HR = 1: the event rate is same (no advantages to the treatment)
OR or HR >1: the event in the treatment group is higher than the control group (control is more advantageous)
OR or HR <1: the event is the control group is higher than treatment (treatment is more advantageous)

51
Q

What is the difference between primary and composite endpoints?

A

Primary: main result that is measured for significant benefit

Composite: combines individual endpoints into one measurement

52
Q

What is the criteria of making a composite endpoint?

A

All endpoints must be of similar magnitude and similar importance to the patient

53
Q

What is the difference between parametric and nonparametric test?

A

Parametric requires normal distribution

Nonparametric: for data not normally distributed

54
Q

Differentiate the types of t-tests? What are they for?

A

T-tests are used for continuous data that is normally distributed

One-sample: 1 sample
Dependent/paired: 1 sample (before and after)
Independent/unpaired student: 2 samples (treatment and control)

55
Q

What test is used for continuous data with a normal distribution and ≥3 samples?

A

Analysis of variance: ANOVA (F-test)

56
Q

What is a sign test

A

Skewed distribution of 1 sample

57
Q

What is Wilcoxon signed rank test for?

A

Continuous skewed distribution of 1 sample with 2 measurements

Categorical data of 1 sample with 2 measurements

58
Q

What is Mann-Whitney test used for?

A

Continuous Skewed distribution of 2 samples

Ordinal data with 2 samples

59
Q

What is Kruskal-Wallis test?

A

Continuous Skewed distribution of ≥3 samples

Categorical data of ≥3 samples

60
Q

When is chi-square tests used?

A

Categorical data of 1 or 2 samples

61
Q

When is Fisher’s extract used?

A

Categorical data of 2 samples