Introduction to biostatistics Flashcards

1
Q

When is Spearman’s rank correlation suitable?

A

It’s suitable if

  • there is a non-linear relationship
  • there are outliers
  • variables are on an ordinal scale (e.g. economic status: low, medium, and high)
  • the sample size is small
  • x or/and y are not normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Assumptions for a chi-squared test

A

“Chi-squared test is a way to compare whether the variation in the data is due to chance or whether it is due to one of the variables we are actually testing”

  • two variables are ordinal or nominal (i.e. categorical data)
  • two or more independent groups
  • for a 2x2 test all expected values must be at least 5
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Parameters defining normal distribution

A

mean (µ) and standard deviation (sigma)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Assumptions when using a one-way ANOVA

A
  • normally distributed data in each group
  • independence of observations
  • variances (SD) are equal in all groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What can one use for the analysis of categorical data?

A

The chi-squared test (Pearson’s chi-squared test) to test whether there is an association between two categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Some parametric methods

A
  • pearson correlation
  • T-test
  • ANOVA
  • linear regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is numerical data?

A

Numerical data is quantitative data (actual values as data) and consists of discrete and continous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

You have numerical data of more than two groups. Which parametric and non-parametric test do you use?

A

parametric: One-way ANOVA4

non-parametric: Kruskal-Wallis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Type 2 error

A

H0 is not rejected when it is false

“false non-significant results”

–> happen often when the sample size is too small

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is continous data?

A

Continuous data describes data that can take any given value Example: BMI, height, weight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

You have numerical data of two unrelated groups. Which parametric and non-parametric test do you chose?

A

parametric: unpaired t-test (compares the means) non-parametric: Mann-Whitney U test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is categorical data?

A

Categorical data is qualitative data and consists of nominal and ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Assumptions when using a Kruskal-Wallis test

A
  • independence of observations
  • can be used when your variables are not normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Different types of correlation analysis

A
  • Pearson correlation
  • Spearman’s rank correlation (non-parametric)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

SPSS

A

look at the handout if you think that’s important. I don’t ;)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is nominal data?

A

Nominal data is data that is not specially ordered and no comparisons about better/worse can be made. Examples: marital status, gender, blood group, name, …

15
Q

How do you check normality?

A

1) Histogram, Q-Q plot, mean and meadian are close to each other (then it is normally distributed)
2) Kolmogorov-Smirnov test and Shapiro-Wilk test (cave!! p has to be higher than 0.05 in order to say that data is normally distributed!)

16
Q

Assumptions when using an unpaired t-test

A
  • independence of observations
  • normally distributed data in each group
  • variances (SD) are equal in both groups
17
Q

Normal distribution

A

also called Guassian distribution or bell curve

18
Q

Some non-parametric methods

A
  • Spearman’s Rank Correlation
  • Mann-Whitney test
  • Kruskal-Wallis test
  • Wilcoxon test
19
Q

Alternative hypothesis

A

assumes an effect (“there is difference/association”)

21
Q

Type 1 error

A

H0 is rejected when it is true

“false significant results”

22
Q

What is discrete data?

A

Discrete data describes the number of different events and is a whole number. Example: number of pregnancies

24
Q

Assumptions when using a Mann-Whitney u test

A
  • independence of observations
  • can be used when your two variables are not normally distributed
25
Q

Linear regression analysis

A

to predict the value of a variable based on the value of another variable

(a lot of “mathematical” expressions; look at the handout)

26
Q

Null hypothesis

A

assumes no effect (“There is no difference/association”)

27
Q

What is ordinal data?

A

Ordinal data is data that indicates different levels and can be coded by 0-n in order to rank the data Examples: disease stage, education level

28
Q

What are the null- and the alternative hypotheses for the chi-squared test?

A
  • H null: there is no difference between the observed and expected frequencies
  • H1: there is a difference between the observed and the expected frequencies