Statistics Flashcards

1
Q

Three types of statistics

A

1) Descriptive
2) Tests for differences
3) Tests for relationships

2+3 Subdivided into Parametric + non-parametric

Parametric - assumed normal distribution
Non parametric - no such assumption

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Descriptive statistics

A

Measures central tendency - whats a typical value for this population?

Mean, mode, median

Measures of spread - how variable is the collected data?/ How much does individual observations differ from ‘typical’ value

SD, SE, Range, variance (sample not pop)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Variance

A

Population variance - not often used
Sample variance - S2 - use

n-1 instead of n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Standard Deviation

A

Variance has square units

SD is the square root of variance

Variance between sample values

Variance of 100mm^2 - SD = 10mm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Standard Error

A

SD of sample means - rather than sample values themselves

measures how good the estimate of the population mean is

Higher sample size = lower SE

SE=SD/sqrt(n/sample size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Parametric statistics

A

Some tests need to assume the variable measured has a normal distribution in the population

+/- 1.96 SD from mean = 95% of observations in a normal distribution

Assumptions:

  • normality
  • Independence
  • Homogeneity of variances
  • T-test (independent samples)
  • Paired T-test (repeated measures data)
  • Analysis of variance (ANOVA)
  • 2 way ANOVA
  • Repeated measures (ANOVA)
  • ANCOVA (ANOVA with covariates)
  • Linear Regression
  • Pearson product moment and correlation
  • Spearman’s ranking
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

T-test

A

Parametric
Independent samples

Does the mean differ between two populations?

Gives t statistic - look up against degrees of freedom (DFs) (n1 + n2 - 2) in table for p-value

Incorporates all variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Paired T-test

A

Parametric
Repeated measures data

For paired values (before + after study)
Concerned with differences rather than mean

Only incorporates variance between each pair of data, not inter-site

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

ANOVA

A

Parametric

analysis of variance

difference between two or more sample means?

What is the chance the samples belong to the same population

Better than multiple individual T -tests (decreased chance of false positive)

  • between groups variance (F stat): var. of combining all group data together
  • Within groups variance: avg. var. within each indiv. sample

ANOVA is: (between/within) ratio, large = real difference, small = random variation

F statistic compared against between groups (groups -1) and within groups (total DF -between) DFs to obtain P value

R^2 value - proportion of variance explained by dependant variable - how close the data fits the fitted regression line

Check residuals show normality on Q-Q plot (assumption of ANOVA)

Levene’s test can be applied to check homogeneity of variances (assumption of ANOVA)

Tukey post hoc test - show which varieties were different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

2-way ANOVA

A

Parametric

ANOVA for studies with two independent variables

3 Null hypothesis:

  • The population means of the first factor are equal
  • The population means of the second factor are equal
  • There is no interaction between the two factors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Repeated measures ANOVA

A

Parametric

Paired T-test for 2 or more repeated measures - are levels changing over time?

Sphericity - Equal variances across all time points - Mauchy’s W test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

ANCOVA

A

Parametric

ANOVA with covariates

Is there a difference between populations with or without taking a specific factor into account (usually if factor cannot be controlled)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Linear Regression

A

Parametric

Assumed causation, allows prediction and extrapolation

Allows description of relationship between x and y as well as strength and significance

y=mx+c

Regression sum of squares (SSregression) - measure vertically from mean (y) to best fit line at each data point -> sum + square these

residual sum of squares (SSresidual, noise) - measure distance from each data point to fitted line -> sum and square these
- if all points on line, SSresidual would = 0 - therefore further away - greater noise

small slope = large residuals - non signifcance

large slope - small residuals - signifcant

Assumptions:

  • Residuals are normal
  • Equal variances in Y across the range of X
  • Relationship is linear
  • No relationship between residuals and x or y
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Pearson product moment + correlation

A

Parametric

Doesn’t assume causation - doesn’t allow same as regression

Quantified by the correlation coefficient (r) - based on concept of covariance (multiply rather than squaring in variance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Spearman’s ranking

A

Parametric

Tests statistical dependance between the rankings of two variables - avoids assumptions (more robust) of normality and homoscedasticty (variance around regression line the same for all values of predictor variable, x_

Decreased test power (sensitivity, less chance to get p<0.05)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Non-parametric statistics

A

Normal distribution not assumed, not required to fit a normal distribution

Mann-Whitney U test

Wilcoxon signed Ranks test

Kruskal Wallis test

17
Q

Mann-Whittney U test

A

Non parametric equivalent of T-test (independant samples)

Are two population medians different

18
Q

Wilcoxon signed ranks test

A

Non parametric test of paired T test - are pair samples different

19
Q

Kruskal-Wallis Test

A

Non parametric equivalent of one way ANOVA