Exam Revision Flashcards
What is the third assumption of ANOVA?
The variance is the same in all k populations.
When is it appropriate to use a contingency table?
When the response variable is categorical and the explanatory variable is categorical.
What are the four components of an effective figure caption?
- title
- description of the techniques or methods used
- statement of the main results
- explanation of symbols, error bars or legends
The effects of pesticide and predator presence on tadpole survival were examined. 16 tubs were each filled with four tadpoles. Four tubs were randomly assigned to each combination of treatments (predator & pesticide, predator no pesticide, no predator & no pesticide, no predator and pesticide). What are the independent sampling units?
tubs
What is the second assumption of ANOVA?
The variable is normally distributed in each of the k populations.
When should you use a Tukey-Kramer test?
To test all pairs of means to find out which groups stand apart, following an ANOVA.
When should you use ANOVA?
To compare more than two means to each other. The response variable is numerical and the explanatory variable is categorical.
Define ‘Type II error’.
A Type II error is failing to reject a false null hypothesis.
What are the four goals of graphing?
- Show the data.
- Make patterns in the data easy to see.
- Represent magnitudes honestly.
- Draw graphical elements clearly.
Define ‘P-value’
The P-value is the probability of obtaining the data (or data showing as great or greater difference from the null hypothesis) if the null hypothesis were true.
When is it appropriate to use a mosaic plot?
When the response variable is categorical and the explanatory variable is categorical.
Does randomisation eliminate bias or reduce sampling error?
eliminate bias
Define ‘Type I error’.
A Type I error is rejecting a true null hypothesis. The significance level, alpha, sets the probability of committing a Type I error.
When is it appropriate to use a strip chart?
When the response variable is numerical and the explanatory variable is categorical.
What does a Q-Q plot tell us?
The Q-Q plot shows whether Y is normally distributed across X.
Define ‘variable’.
A variable is a characteristic that differs among individuals.
What is the third assumption of linear regression?
The variance of Y values is equal at all values of X.
When should you use a chi-squared contingency test?
To compare frequencies or proportions to each other. The response variable is categorical and the explanatory variable is categorical.
When should you use a two sample t-test?
To compare two means to each other. The response variable is numerical and the explanatory variable is categorical.
Does blinding eliminate bias or reduce sampling error?
eliminate bias
What is the formula when planning for power?
n = 16(SD/D)^2
When should you use a chi-squared goodness of fit test?
To compare frequencies or proportions to null values. The response variable is categorical.
When should you use linear regression?
To compare slopes or trends to a null value. The response variable is numerical and the explanatory variable is numerical.
When is it appropriate to use multiple histograms?
When the response variable is numerical and the explanatory variable is categorical.
Define ‘random sample’.
A random sample is a sample in which each member of a population has an equal and independent chance of being selected.
What is the first assumption of linear regression?
Y is linearly related to X.
Does balance eliminate bias or reduce sampling error?
reduce sampling error
What is the fourth assumption of linear regression?
Values of Y are randomly sampled at all values of X.
What does a residual plot tell us?
The residual plot shows whether the relationship between Y and X is linear.
When is it appropriate to use a scatter plot?
When the response variable is numerical and the explanatory variable is numerical.
When is it appropriate to use a grouped bar graph?
When the response variable is categorical and the explanatory variable is categorical.
Define ‘data’.
Data are the measurements of one or more variables made on a sample of individuals.
Define ‘replication’.
Replication is the application of every treatment to multiple, independent experimental units.
Define ‘sampling error’.
Sampling error is the chance difference between an estimate and the population parameter being estimated caused by sampling.
When should you use a paired t-test?
To compare two paired means to each other. The response variable is numerical and the explanatory variable is categorical.
Define ‘interaction’.
An interaction between two or more explanatory variables means that the effect of one variable depends upon the state of the other variable.
Define ‘observational study’.
An observational study is a study in which the assignments of treatments is not made by the researcher.
When should you use a one sample t-test?
To compare one mean to a null value. The response variable is numerical.
Define ‘bias’.
Bias is a systematic discrepancy between the estimates we would obtain, if we could sample a population again and again, and the true population characteristic.
What is the second assumption of linear regression?
The distribution of Y values is normal at all values of X.
What is the first assumption of ANOVA?
The measurements in every group represent a random sample from the corresponding population.
Do simultaneous control groups eliminate bias or reduce sampling error?
eliminate bias
Define ‘experimental study’.
An experimental study is a study in which the researcher assigns treatments randomly to individuals.
Does consistency of conditions eliminate bias or reduce sampling error?
reduce sampling error
Define ‘factorial experiment’.
A factorial experiment investigates all treatment combinations of two or more variables. A factorial design can measure interactions between treatment variables.
Which feature of experimental design cannot be included in observational studies?
randomisation
Does replication eliminate bias or reduce sampling error?
reduce sampling error
Does blocking eliminate bias or reduce sampling error?
reduce sampling error
Define ‘parameter’.
A parameter is a quantity describing a population.
When is it appropriate to use a box plot?
When the response variable is numerical and the explanatory variable is categorical.