CM- Biostatistics Flashcards

1
Q

What are the 6 steps in the process of statistics?

A
  1. formulate a hypothesis
  2. design a study to test the hypothesis
  3. collect valid and accurate data
  4. manage the data effectively
  5. analyze the data creatively
  6. interpret the findings validly to answer the question
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 4 practical questions for analyzing data?

A
  1. what are the variables?
    - predictor [independent]
    - outcome [dependent]
  2. What types of variables are they?
    - what is the scale
    - what is the category [dichotomous, continuous, nominal, ordinal]
  3. If it is continuous, does it meet normality assumption?
  4. Independent or Paired groups?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 4 categories of variables?

A
  1. dichotomous
    - sex, death [yes or no]
  2. multi-category nominal
    - name, hospital, service
    - infection rates by specialty
  3. multi-category ordinal
    - class rank quartiles
    - age categories [15 to 24, 25-40, 40-60, etc]
  4. Continuous [interval]
    - height, weight, age
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can you tell if a continuous outcome variable is normally distributed ?

A

It will be a standard bell curve where the mean equals the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When the data is skewed right, what is the better measure of the data, median or mean?

A

Median because the mean will be disproportionately affected by the outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the difference between an independent and paired group?

A

Independent is when the groups don’t have effect on each other.
Ex. ALL UTSW faculty vs. ALL medical students

Dependent is when the groups would have an effect on each other
Ex. Medical students vs. Advisors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

You are performing a study and the independent variable is dichotomous. The dependent variable is also dichotomous.

What should you use to analyze the data?
What measures of association are used?
What tests of statistical significance apply?

A

Analyze with a 2x2 contingency table.

Measures of association:

  1. RR
  2. AR
  3. contingency coefficienct [phi]

Tests:

  1. chi-squared
  2. fisher’s exact test
  3. 95% confidence interval of RR
  4. McNemar’s test [maintain pairing]
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the difference between relative risk and odds ratio?

A

They are calculated the same way with a 2x2 contingency table.

RR = cohort study
Odds Ratio = case control study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between Chi2 and fisher’s exact test when evaluating statistical significance?

A

Chi2 is an approximation that allows you to calculate a p-value.
-Estimation, but good with large numbers

Fisher’s exact is an exact value for statistical significance [not an estimation], but it uses factorials so it is not good with large numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does the 95% confidence interval of the relative risk change with sample size?

A

As the sample size increases the confidence interval becomes more narrow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When is McMemar’s test used?

A

paired observations when the independent and dependent variables are dichotomous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When you are conducting a test with:
Independent variable being multi-category nominal
Dependent variable being dichotomous.

What is used for analysis?
What is the measure of association?
What is the test of statistical significance?

A

Analysis: nX2 contingency table

Measure of association: uncertainty coefficient

Test of Stat Sig : Chi-square
*this will not help you see trends, just differences between the nominal groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

You are conducting a test with:
Independent variable being multi-category ordered
(ex. no, light, medium, heavy smokers)
Dependent variable being dichotomous

What is used for analysis?
What is the measure of association?
What is the test of statistical significance?

A

Analysis: nX2 contingency table

Measure of association: Goodman-Kruskal gamma

Test of Stat Sig:

  1. Chi-square
  2. Chi-square test for trend
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

You are conducting a study with:
Independent variable being continuous
Dependent variable being dichotomous.

What is used for analysis?

A
  1. recode the independent variable into categories and analyze like multi-category ordinal or nominal
    - nX2 contingency table
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

You are performing a study where the independent variable is dichotomous and the dependent variable is continuous.
When you chart the data, the normality assumption is satisfied.

How do you analyze the data?
What is measure of association?
What is the test of stat sig?

A

Analyze: compare the means

Measure of association = subtract the means

Test of stat sig:

  1. student’s t-test
  2. paired t-test
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

You are performing a study where the independent variable is dichotomous and the dependent variable is continuous.
When you chart the data, the data is skewed to the right.

How do you analyze the data?
What is measure of association?
What is test of stat sig?

A

Analyze:

  1. transform the data [log, square root, etc] and then compare means
  2. use non-parametric tests
    - Mann-Whitney U test, Wilcoxon Rank-Sum Test, MEdian test, Signed Rank Test [paired]

Measure of association:
Difference between transformed means

Test of Stat Sig:

  1. student’s T-test
  2. Paired T-test
17
Q

You are performing a study where the independent variable is multi-category. The dependent variable is continuous.
When you chart the data, the normality assumption is satisfied.

How do you analyze the data?
What is the measure of association?
What is the test of stat sig?

A

Analyze:
1. compare means

Measure of association:
1. difference between means

Test of stat sig:

  1. ANOVA [t-test but for more than 2 groups]
  2. F test
18
Q

You are performing a study where the independent variable is multi-category. The dependent variable is continuous.
When you chart the data, the normality assumption is NOT satisfied.

What are the 2 ways you can analyze the data?

A
  1. transform the data [log, etc] and then compare means, measure association by taking difference between means, and use ANOVA as test of stat sig,
  2. use nonparametric tests like
    - Kruskal Wallis
    - non parametric ANOVA
19
Q

You are performing a study where the independent and dependent variables are both continuous.
You chart the data and the normality assumption is satisfied.

How do you analyze the data?
What is the measure of association?
What is the test of stat sig?

A

Analysis:
1. regression analysis [linear or multivariable]

Measure of association:
1. regression coefficient [B] and intercept [A]

Stat significance:
1. regression F test

20
Q

When making a scatterplot for regression analysis of 2 continuous variables, What is on the Y and X axis?
What is B and a?

A

Y is the dependent variable
X is the independent variable
B is the strength of the effect of X on Y [slope]
a is the intercept

21
Q

What is the p-value?
What value must it be for a study to be statistically significant?
When can a higher or lower value be used?

A

Probability that the observed distribution occurred by chance alone.
The higher the P value, the less real your association is.
For most studies, the p value must be below 0.05 to be “statistically significant”

Higher p-value is used with Bayesian stat when the severity of the situation is great
Lower p-values are utilized for small sample sizes

22
Q

How is chi-square calculated?

What increases the value?

A

[AD-BC]sqN divided by [AB+CD+AC+BD]

Increases with:

  1. sample size
  2. degree of diff
23
Q

How do you determine the degrees of freedom to find a chi2 value?

A

DF = max number of rows in the table minus one

The more degrees of freedom, the more difficult it is to get a significant result

24
Q

What is the standardized mortality ratio?

A

cases observed/# expected x 100

It is used to test if the incidence of disease is greater than expected.