Choosing statistics Flashcards

1
Q

Give important points to remember when selecting a study design, including the choice of statistical test.

A

The stats must test the hypothesis.
The patient population or study sample (i.e. inclusions and exclusion criteria) must be selected to allow for a comparison that will test the hypothesis.
The patient outcome measures (i.e. variables) must also be useful for testing the hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the best way to select the most appropriate statistical test for the hypothesis once the data has been collected (or even before)?

A

Use a statistical decision tree.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When is a Pearson statistical test used?

A

The hypothesis proposes a correlation, the variables are continuous and have a normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When is a Spearman rank statistical test used?

A

The hypothesis proposes a correlation, the variables are continuous and do not have a normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When is a Chi-squared statistical test used?

A

The hypothesis proposes either a correlation or a comparison between groups, and the variables are discrete.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When is ANOVA used?

A

The hypothesis proposes a comparison between groups, the variables are continuous and have a normal distribution, there are >2 groups and only one variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When is a t-test used?

A

The hypothesis proposes a comparison between groups, the variables are continuous and have a normal distribution, there are 2 groups of either paired or independent data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is a Kruskal-Wallis statistical test used?

A

The hypothesis proposes a comparison between groups, the variables are continuous and do not have a normal distribution, there are >2 groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When is a Wilcoxon statistical test used?

A

The hypothesis proposes a comparison between groups, the variables are continuous and do not have a normal distribution, there are 2 groups of paired data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When is a Mann-Whitney statistical test used?

A

The hypothesis proposes a comparison between groups, the variables are continuous and do not have a normal distribution, there are 2 groups of independent data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the 2 main types of data that may be analysed?

A

Quantitative and qualitative (categorical).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is quantitative data?

A

Numeric information about quantities. Information that can be measured and written down with numbers, including dimensions such as height, width, length and mass or other measured variables such as temperature, blood pressure and price etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is qualitative data?

A

Information about qualities. Information that can’t actually be measured. Qualitative data deals with categories such as gender (male or female), life status (alive or dead) and stages of disease (e.g. hypertension).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Give an example of a variable that could be measured either quantitatively or qualitatively.

A

Eye colour can be measured quantitatively by assessing the RGB scale or qualitatively by categorising into blue, brown or green etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the different types of quantitative data?

A

Continuous and counted (discrete).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the different types of qualitative (categorical) data?

A

Nominal (unordered) and ordinal (ordered).

17
Q

Give examples of nominal (unordered) data.

A
Gender (male/female/other).
Life status (alive/dead).
18
Q

Give examples of ordinal (ordered) data.

A

Fitness (not fit, quite fit, very fit).

Stages of hypertension.

19
Q

Give examples of continuous data.

A

Age.

Heart rate.

20
Q

Give examples of counted (discrete) data.

A

Number of males/females in a group.

Number of people with hypertension.

21
Q

What types of data are parametric?

A

Continuous quantitative data.

22
Q

What types of data are nonparametric?

A

Nominal and ordinal qualitative data.

Counted (discrete) quantitative data.

23
Q

What does ‘normality’ measure, and what is it used for?

A

The central tendency and dispersion of data, used to decide how to describe the properties of large data-sets i.e. the descriptive statistics which are presented instead of the raw data.

24
Q

What are the features of a normal distribution?

A

Normal distribution has a symmetric distribution with well-behaved tails i.e., many data points at the central region of the range and a symmetrical disruption either side, a.k.a. ‘bell curved’ or ‘Gaussian’.

25
Q

What are the features of a skewed data distribution?

A

Asymmetric with many data points in the high or low end of the range and an uneven tail (long on one side and short on the other).

26
Q

What are the features of a kurtosis data distribution?

A

Heavy-tailed or light-tailed relative to a normal distribution. Data sets with high kurtosis tend to have heavy tails, or outliers that create a very wide distribution. Data sets with low kurtosis tend to have light tails, or lack of outliers that create a very narrow distribution.

27
Q

What are the statistical tests of normality?

A

Shapiro-Wilks test: used to test for normality with small sample sizes (n<50).
Kolmogorov-Smirnov: used to test for normality with large sample sizes (n>50).

28
Q

How do paired data arise?

A

Typically, paired observations arise from measuring the same variable in the same subject at different time-points (this could be referred to as a longitudinal experiment).

29
Q

How do independent data arise?

A

Unpaired, or independent, observations are seen when comparing two groups with no common factors (this could be referred to as a cross-sectional study).

30
Q

What are parametric statistics?

A
A parametric statistical test is one that makes assumptions about the parameters (defining properties) of the population distribution(s) from which the data are drawn, while a non-parametric test is one that makes no such assumptions.
Parametric statistics (e.g. t-test, ANOVA) are used when the data from the population is well described by the mean and standard deviation- normally distributed.
31
Q

What are non-parametric statistics?

A

Non-parametric tests (e.g. Mann-Whitney, Wilcoxon signed rank test) are adopted when the population is not well described by the mean and standard deviation. For example, when quantitative data is not normally distributed. Non-parametric are also used when the data is qualitative.

32
Q

What are the measures of centrality?

A

Mean, median and mode.

33
Q

What are the measures of dispersion?

A

Standard deviation, standard error of the mean, CI.