Multiple Testing: Flashcards

Question

What is the advantage of a normal quantile-quantile plot (Q-Q plot) over a histogram?

Answer 1

It gives a clearer indication of a normal distribution

Answer 2

It is slightly more complicated than a histogram

Answer 3

- It compares the quantiles of the data (sample) with the theoretical quantiles from a normal distribution - A straight line indicates a normal distribution

Answer 4

Percentiles

Answer 5

By using the function to produce the summary statistics for both populations and compare the variance

Answer 6

Summarise the data and compare the standard deviations

Answer 7

In the psych package

Answer 8

It doesn't assume that the variance is equal

Answer 9

- Using the argument var.equal = TRUE - This increases the power of the test a little bit - In most situations, there is little advantage to doing this

Answer 10

- Performing a positive and negative control | - Carried out different independent variables/carried out same variable at different concentrations

Answer 11

Many t-tests would have to be conducted

Answer 12

- It takes a lot of effort and time | - The family wise error rate (FWER)

Answer 13

That there is a difference (in what?)

Answer 14

The probability of getting a false positive if the null hypothesis is true across a group of tests

Answer 15

Usually a group of tests on the same data set or the number of tests

Answer 16

- We strongly reject the null hypothesis five times out of a hundred based only on variation in samples - Every 20th comparison, we are likely to reach the 5% significance level and falsely reject the null hypothesis

Answer 17

Type 1 error (false positive)

Answer 18

The type 1 error rate is no longer equal to alpha but increases with the number of tests

Answer 19

(1-alpha)^m

Answer 20

1- (1-alpha)^m

Answer 21

The probability increases with the number of tests performed

Answer 22

The comparison of the variance within the samples with the variance between the samples

Answer 23

The analysis of variance (ANOVA)

Answer 24

Are all the values that have been measured in our samples are from the same population, or is at least one group from a different population?

Answer 25

That all free samples were of the same population

Answer 26

The third sample is very unlikely to be measured from the same population as the other two samples

Answer 27

- Squaring standard deviation - Taking sum of squared difference between each observation and the sample mean then dividing it by degree of freedom - Mean of the squares minus square of the mean

Answer 28

The number of observations minus 1

Answer 29

The dispersion of the sample (how spread out it is)

Answer 30

Both variance and standard deviation show the dispersion of data and are very closely linked

Answer 31

- Calculate the mean of all samples - And up the values of the mean of every sample - Divide by the total number of samples

Answer 32

The sum of squared differences within the groups

Answer 33

- Work out the difference between observations made and the mean for each sample - Square each individual value - Add up each individual value

Answer 34

The squared differences between the mean of each group and overall mean

Answer 35

The number of observations

Answer 36

It accounts for samples with a different number of replicates

Answer 37

- If we know the sample mean, we can work out the missing value if only one data point from the sample is missing - This means we don't need need to know all the data points-it is enough to know one less than the total number of observations - This means that the last value is set and not free to be any value - This concept is called the degrees of freedom -

Answer 38

The total number of observations (sum of all n of each group)

Answer 39

The number of observations in each group

Answer 40

The number of groups or samples

Answer 41

The degrees of freedom change for the variance between groups and within groups

Answer 42

The degrees of freedom when calculating the standard deviation for a sample with n observations

Answer 43

Because the overall mean was calculated from other sample means

Answer 44

- All observations and sample means from each group is used - Therefore degrees of freedom is defined by the total number of observations from all samples (N) minus the number of sample means (equal to number of groups-K)

Answer 45

The ratio between the variance between and within a group

Answer 46

Variance between groups --------------------------------------- Variance within groups

Answer 47

The F-statitsic

Answer 48

- We need to find the p-value for the specific ratio of the variance between groups to the variance within groups to analyse the output of ANOVA - The F-distribution is strongly dependent on the degrees of freedom of the two variances (between and within groups)

Answer 49

It automatically calculates the degrees of freedom and uses these values to obtain the p-value

Answer 50

The probability of getting the calculated F ratio or a value more extreme

Answer 51

- Include the most important output of the ANOVA test | - Also common to include degrees of freedom for within and between groups, the F-value/F-statistic and the p-value

Multiple Testing: Flashcards

(78 cards)