Stats 5 - Writing Statistics Flashcards

1
Q

What are descriptive statistics what should you report?

A

Descriptive Statistics - Important characteristics:

  1. Define sample size (n)
  2. Provide measures of central tendency and spread (depending on the data type – normal, Poisson or binomial)

Descriptive statistics is reported at the start of the results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When reporting a One or Two sample T-Test what should you report?

A

Reminder

  1. One-sample t-test compares your sample mean to a global mean
  2. Two-sample t-test compares the means of two samples

Reporting

  1. What test you performed? Why you performed it?
  2. Report results –> Statistically different or not? –> Include:
  3. T-Value
  4. df
  5. p-value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When reporting a simpel F-test to compare variances what should you report?

A

Reminder:

  • Simple F-test compares the variance between two
  • Useful way to investigate the homogeneity of variance assumption for two-sample t-tests, as homogenous variance is a prerequisite for the two-sample t-test.

Reporting

  1. What test you performed and why?
  2. Report the results –> variation is statistically different or not? –> Include F-Value, df and p-value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When reporting a Wilcoxon test, what should you include?

A

Wilcoxon Test - Remember:

  1. A Wilcoxon test compares either two sample medians or a sample median to a global median (Non-Normal T-Test)
  2. The comparison is great for non-normally distributed data, such as count or proportion data.

Reporting

  1. What test you performed and why?
  2. Result of the test –> Median is statistically significantly or not? –> Include W-value and P-value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When reporting a One-way analysis of variance, what should you report?

A

One-way Analysis of Variance - Remember:

  1. An analysis of variance compares between-group variance to within-group variance to discern whether the grouping factor(categorical) – here “Climate” – has an effect on the response variable (log Mass in grams).

Basically, used to decide whether a term explains a significant amount of observed variation

Reporting

  1. What test you performed and why?
  2. Report results - did you term explain a significant amount of the variation –> Report the F value, Df for Sum sq. + residuals, and P-value

Note –> you can follow up with a Tukey test to investigate the intricacies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When reporting a correlation test, what should you include?

A

Correlation test - Reminder:

  1. A correlation looks at the association/correlation of two variables. It can NOT be used to infer causation –> we don’t which one is the independent or dependent variable, we just know they are associated
  2. Pearson’s correlation test is used for two normally distributed variables and Spearman’s fr two non-normal variables.

Reporting

  1. What test you performed and why?
  2. Report result –> what correlation/association was found? –> Include correlation score, T-value or S-value, df and p-value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When reporting a simple linear regression, what should you include?

A

Simple Linear Regression - Reminder:

  1. A linear regression examines the effect of a continuous explanatory variable on a continuous response –> Continuous Response ~ continuous explanatory variable
  2. Different to correlation tests, because a simple linear regression is used to define causality by fitting a linear line with an intercept and slope – we assign independent and dependent variables.

Reporting

  1. What test you performed and why?
  2. Did you find a signficant regression? –> include F-Value, Df, p-value and adjusted R2
  3. Propose equation using co-efficients given that it is signifcant

Note -> You may also may consider including the Coefficient table –> more formal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When reporting a two-way analysis of variance, what should you include?

A

Reminder:

  1. A two-way analysis of variance compares the between-group variance and within-group variance for main effects (a single grouping factor) and interaction effects (two or more interacting grouping factors)

Basically…

Looking at multiple categorical groups (factors) and their interaction –> how much variation is explained by each term

Reporting

  1. What test was performed + justification
  2. Report results from the Two-way analysis of variance test –> Include F-Value, df and P-value.

Note - You can follow up Tukey HSD to perform pairwise comparison

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When reporting a analysis of Covariance, what should you include?

A

Reminder:

  1. An analysis of covariance compares the between-group variance and the within-group variance for a grouping factor of interest whilst controlling for a continuous covariate. Your primary interest is on the group variable not the continuous covariate
  2. Note – You can apply an ANCOVA for additive models and interactive models

Reporting

  1. What test was performed + justification
  2. Report results from the ANCOVA –> F-Value, df and P-value –> Even though you are controlling for the continious variable you can also refer to it.
    - Follow up post-hoc test –> looking at the intricacies between the climactic variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When reporting a Multiple Linear regression, what should you include?

A

Reminder:

  1. A multiple linear regression examines the effect of more than one variable on a continuous response variable –> effect of multiple explanatory variables on continuous response variable
  2. Can include Categorical and Continious –> But emphasis is placed on continious

Reporting

  1. What test was performed + justification
  2. Report results from the multiple linear regression
    a) Did you obtain a signficant regression equation? –> Report F-value, df, P-value and adjusted R2
    b) if you did obtain a signifcant regression –> Outline the different regression equations
  3. Interpretation/takeaway message –> Which regression equation showed the most effect on the response?

Example - The effect of mammal mass is therefore weaker in tropical than temperate climates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When reporting a Model selection/simplification process, what should you include?

A

Reminder

Model selection starts with maximal model and produces a minimum adequate model.

Reporting

  1. Outline the simplification procedure –> stepwise backwards model selection
  2. Outline maximal model
  3. Report minimal adequate model obtained (F-value for model as support + R2) –> including regression equations
  4. Interpretation of the linear equations obtained from the minimal adequate model –> putting the numbers in biological context
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When statistical reporting, what does spin refer to?

A

Spin

Reported results differ from statistical results –> distorted interpretation of non-significant results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some example poor statisitcal practices that can still be found in the lierature?

A
  1. Selectively reporting outcomes
  2. Reporting statistical analyses that are NOT specified/different to previously specified analysis plans
  3. Graphs can NOT be interpreted unambiguously (only interpreted in one way –> Graphs that are NOT clear
  4. Reported results differ from statistical results –> distorted interpretation of non-significant results (spin) –> interpreting p-value between 0.05 and 0.10 as significant
  5. Summarizing data variability using standard errors/standard deviation of the mean.
  6. Not reporting exact P-Values for primary analyses + post-hoc tests
  7. Not plotting raw data to calculate variability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why is using Standard error of mean (SEM) and Standard deviation (SD) a problem in papers?

A

Summarizing the data as mean and SE or SD often causes readers to wrongly infer that the data are normally distributed with no outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly