Data analysis 1 : comparing mean and medians Flashcards

1
Q

Why do we perform statistical analysis?

A
  • to determine whether there is sufficient evidence to accept or reject the experimental hypothesis
  • to test if the data is both genuine(was it random?) and meaningful (is effect size biologically or clinically relevant?)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a p value

A
  • calculated probability of observing an effect size to see if there was no genuine difference between the groups (ie. if all of the data points for the groups being compared actually came from the same group)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you accept or reject the null hypothesis based on p value and alpha value- interpreting p value?

A
  • p values are compared to threshold value ‘alpha’
  • p< alpha = the difference is statistically significant (reject null hypothesis)
  • p >/= alpha = the difference is not statistically significant (accept the null hypothesis)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an alpha value?

A

the threshold value that we use to compare p value to so we can see if the results are significant or not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is the most commonly used alpha value 0.05?

A
  • 1920s Fisher thought it is fairly unlikely thing to happen 1/20 chance (5%).
  • value of convention
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Choice of alpha value(0.05 or 0.001) considers …?

A
  • nature of data of the study and relative consequences of type 1 and type 2 errors
  • small alpha is more likely to result in false negative(type 2 error)
  • large alpha is more likely to result in false positive (type 1 error).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is type 1 error?

A

-false positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is type 2 error?

A
  • false negative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does type of statistical test to use when analysing if data is statistically significant depend on? (why not use t- test all the time?

A
  1. if the data adheres to normal distribution (parametric)?
  2. if the data is from independent samples(paired or unpaired?)
  3. on how many groups or differences are being compared?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is type of statistical test do you use for analysing parametric data from independent samples (unpaired data) and why?

A
  • unpaired t- test
  • the method by which the p-value is calculated in t test assumes the data adheres to a normal distribution(parametric)
  • t test is only appropriate in analysing parametric data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What determines if p value is large or small?

A

large : P>alpha

  • small sample size
  • small difference in group mean
  • large standard deviation

small sample size:

  • large sample size
  • large difference in group mean
  • small standard deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is two tailed test?

A
  • parameter that describes whether the analysis is concerned with assessing the statistical significance of potential positive and negative difference between one group and another (two tailed)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is one tailed test?

A
  • parameter that describes whether the analysis is only concerned with the significance of a difference in one specific ‘direction’
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When do you use one tail t test and two tailed t test for?

A

1 tail: weight of group 1 is significantly different to the mean weight of samples in group 2
2 tail : the mean weight of samples in group 1 is significantly greater than the mean weight of samples in group 2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why can’t t test be used to analyse non-parametric data?

A
  • parametric methods generate p values by assuming a specific relationship between the variation of data points to the mean and the probability of observing the, For non parametric data, the relationship doesn’t exist so any test that assume the relationship exists will generate inaccurate p value.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is an example of non-parametric data?

A

Mann- Whitney U test.

17
Q

How do non parametric data sort datapoints?

A
  • put them into ranks and calculate probability of observing a particular distribution of ranks between the two experimental groups.
18
Q

How do we determine data set is parametric?

A
  1. subjectively evaluate data by making a histogram
  2. use test of normality

=>non parametric = low p value, reject null hypothesis there is sufficient data to suggest data is NOT normally distributed.