RDS Exam Flashcards

1
Q

What is a hypothesis?

A

A statement that predicts the finding of a research study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a p-value

A

Probability that an observed difference between groups (e.g. fracture incidence) was observed by chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the alpha a -value represent?

A

Probability of rejecting the hypothesis when it is true. It is the value at which you reject the hypothesis when the p-value is less than it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a research question?

A

The question the research is trying to answer. The research question provides the frame for the entire research project

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the different types of data?

Describe each

A

Quantitative – Can be split into counted (discrete e.g. number of children in a family) and measured (continuous e.g. temperature)

Qualitative – So like descriptive and free text

Categorical (kinda between qual and quant) - Nominal (individual terms like colours or gender) and ordinal (where there is a scale and can be converted into numbers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you test for normality?

A

Kolmogorov Smirnov test for sample size above 50

Shapiro-Wilks test for sample size below 50

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is skewness? Describe it in terms of positive and negative skew and where the medians and modes and means are.

A

Asymmetric distribution with many data points in the high or low end. Left-skewed is also negatively skewed (long tail is on left or lower end). Right skew is also called positive skew. The mode is the highest point, mean is towards on the side of the long tail and the median is between mode and mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is kurtosis?

A

Kurtosis describes data that are heavy-tailed or light-tailed relative to a normal distribution. They either make a ‘wider’ normal distribution (heavy kurtosis) or a ‘narrow’ normal distribution (low kurtosis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the variance?

A

A measure of the spread of the numbers away from the mean value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is standard deviation?

A

Square root of the variance. Measures the spread of a set of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the range?

A

difference between largest data value and smallest data value. Measures how far a set of number are spread out from their average value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the interquartile range?

A

Q3 - Q1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the standard error of mean?

A

Standard deviation divided by the square root of the sample size. Measures how well the sample mean approximates to the population mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are confidence intervals?

A

Standard error multiplied by 1.96

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When would you use a chi-square test?

A

When comparing two discrete sets of data. Either finding differences or similarities (you also know how to do a chi-square test: i.e. finding “expected” values and comparing those with the “observed” values and seeing if there is a significant difference)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When would you use a Wilcoxon test?

A

To compare two non-parametric sets of paired data to find differences.

17
Q

When would you use a Mann Whitney test?

A

To compare two non-parametric sets of unpaired data to find differences.

18
Q

When would you use an independent t-test?

A

To compare two parametric sets of unpaired data to find differences.

19
Q

When would you use a paired t-test?

A

To compare two parametric sets of paired data to find differences.

20
Q

When would you use a Kruskal Wallis test?

A

To test for differences between 3 or more sets of non-parametric data, which are paired

21
Q

When would you use ANOVA? What are the different types?

A

Compare differences in means of 3 or more sets of parametric data
One-way ANOVA tests unpaired sets
Repeated measures one-way ANOVA compare paired sets

22
Q

When would you use a Friedman test?

A

To test for differences between 3 or more sets of non-parametric data, which are unpaired

23
Q

What is the difference between correlation and regression?

A

They both look at similarities between two sets and the relationship between two variables.
Correlation is looking at the strength of the association or relationship between two variables.
Regression looks at what happens to one variable when you change the other variable by a certain amount. - becomes linear regression y = mx + c

24
Q

How do you test correlation in a parametric data sets?

A
Pearson's Correlation test: returns an r-value (can be either negative or positive):
= 0-0.2 => very low correlation
0.2-0.4 => low correlation
0.4-0.6 => reasonable correlation
0.6-0.8 => high correlation
0.8-1.0 => very high correlation

Can also return a r^2 value which represents how closely your data is fitted to the correlation line. (same rule, higher r squared value means line is more reliable).

25
Q

How do you test correlation in a non-parametric data sets?

A

Spearman rank correlation gives two things:

  • A correlation coefficient (Spearman’s rho, denoted by ρ) is the equivalent of the Pearson r-value.
  • The p-value, once again, tells you how reliable the rho-value is. The smaller the p-value, the more reliable the rho-value
26
Q

How is regression defined?

A

By the equation y = mx + c (or y = a + bx)
c = y-intercept
m = gradient

27
Q

What do you use as a measure of centrality in a normally distributed set of data? and in a non-normally distributed set?

A

Normal: mean

Non-normal: median (as the skew could cause the mean to be unrepresentative of the centre)

28
Q

When to use a pie chart?

A

To show relative sizes of the whole or portions.

29
Q

When to use a bar chart?

A

Comparative data where one data set is categorical and the other is numerical

30
Q

When to use a scatter plot?

A

To identify similarities or a relationship between two continuous variables

31
Q

What are the values on a box and whisker plot?

A
Max value
Q3
Median
Q1
Min value
32
Q

When to use a histogram?

A

Comparing two data sets, one of which is quantitative counted (e.g. frequency) and one is quantitative continuous/measured

33
Q

When to use a Dot-plot?

A

Graphical display of data using dots. It is very similar to bar-chart but the bars have been replaced by dots. Provides better visualisation of spread/dispersion of data but used for smaller data sets.

34
Q

How do you calculate odds ratio?

A

Odds ratio = (number of cases/number of controls in exposed group) ÷ (number of cases / number of controsl in unexposed group)

35
Q

How do you calculate relative risk?

A

RR = (number of cases / total in exposed group) ÷ (number of cases / total in unexposed group)