Statistics Flashcards

Question 1

Q

Three types of statistics

Answer

A

1) Descriptive
2) Tests for differences
3) Tests for relationships

2+3 Subdivided into Parametric + non-parametric

Parametric - assumed normal distribution
Non parametric - no such assumption

Question 2

Q

Descriptive statistics

Answer

A

Measures central tendency - whats a typical value for this population?

Mean, mode, median

Measures of spread - how variable is the collected data?/ How much does individual observations differ from ‘typical’ value

SD, SE, Range, variance (sample not pop)

Question 3

Q

Variance

Answer

A

Population variance - not often used
Sample variance - S2 - use

n-1 instead of n

Question 4

Q

Standard Deviation

Answer

A

Variance has square units

SD is the square root of variance

Variance between sample values

Variance of 100mm^2 - SD = 10mm

Question 5

Q

Standard Error

Answer

A

SD of sample means - rather than sample values themselves

measures how good the estimate of the population mean is

Higher sample size = lower SE

SE=SD/sqrt(n/sample size)

Question 6

Q

Parametric statistics

Answer

A

Some tests need to assume the variable measured has a normal distribution in the population

+/- 1.96 SD from mean = 95% of observations in a normal distribution

Assumptions:

normality
Independence
Homogeneity of variances
T-test (independent samples)
Paired T-test (repeated measures data)
Analysis of variance (ANOVA)
2 way ANOVA
Repeated measures (ANOVA)
ANCOVA (ANOVA with covariates)
Linear Regression
Pearson product moment and correlation
Spearman’s ranking

Question 7

Q

T-test

Answer

A

Parametric
Independent samples

Does the mean differ between two populations?

Gives t statistic - look up against degrees of freedom (DFs) (n1 + n2 - 2) in table for p-value

Incorporates all variation

Question 8

Q

Paired T-test

Answer

A

Parametric
Repeated measures data

For paired values (before + after study)
Concerned with differences rather than mean

Only incorporates variance between each pair of data, not inter-site

Question 9

Q

ANOVA

Answer

A

Parametric

analysis of variance

difference between two or more sample means?

What is the chance the samples belong to the same population

Better than multiple individual T -tests (decreased chance of false positive)

between groups variance (F stat): var. of combining all group data together
Within groups variance: avg. var. within each indiv. sample

ANOVA is: (between/within) ratio, large = real difference, small = random variation

F statistic compared against between groups (groups -1) and within groups (total DF -between) DFs to obtain P value

R^2 value - proportion of variance explained by dependant variable - how close the data fits the fitted regression line

Check residuals show normality on Q-Q plot (assumption of ANOVA)

Levene’s test can be applied to check homogeneity of variances (assumption of ANOVA)

Tukey post hoc test - show which varieties were different

Question 10

Q

2-way ANOVA

Answer

A

Parametric

ANOVA for studies with two independent variables

3 Null hypothesis:

The population means of the first factor are equal
The population means of the second factor are equal
There is no interaction between the two factors

Question 11

Q

Repeated measures ANOVA

Answer

A

Parametric

Paired T-test for 2 or more repeated measures - are levels changing over time?

Sphericity - Equal variances across all time points - Mauchy’s W test

Question 12

Q

ANCOVA

Answer

A

Parametric

ANOVA with covariates

Is there a difference between populations with or without taking a specific factor into account (usually if factor cannot be controlled)

Question 13

Q

Linear Regression

Answer

A

Parametric

Assumed causation, allows prediction and extrapolation

Allows description of relationship between x and y as well as strength and significance

y=mx+c

Regression sum of squares (SSregression) - measure vertically from mean (y) to best fit line at each data point -> sum + square these

residual sum of squares (SSresidual, noise) - measure distance from each data point to fitted line -> sum and square these
- if all points on line, SSresidual would = 0 - therefore further away - greater noise

small slope = large residuals - non signifcance

large slope - small residuals - signifcant

Assumptions:

Residuals are normal
Equal variances in Y across the range of X
Relationship is linear
No relationship between residuals and x or y

Question 14

Q

Pearson product moment + correlation

Answer

A

Parametric

Doesn’t assume causation - doesn’t allow same as regression

Quantified by the correlation coefficient (r) - based on concept of covariance (multiply rather than squaring in variance)

Question 15

Q

Spearman’s ranking

Answer

A

Parametric

Tests statistical dependance between the rankings of two variables - avoids assumptions (more robust) of normality and homoscedasticty (variance around regression line the same for all values of predictor variable, x_

Decreased test power (sensitivity, less chance to get p<0.05)

Question 16

Q

Non-parametric statistics

Answer

Study These Flashcards

A

Normal distribution not assumed, not required to fit a normal distribution

Mann-Whitney U test

Wilcoxon signed Ranks test

Kruskal Wallis test

Question 17

Q

Mann-Whittney U test

Answer

Study These Flashcards

A

Non parametric equivalent of T-test (independant samples)

Are two population medians different

Question 18

Q

Wilcoxon signed ranks test

Answer

Study These Flashcards

A

Non parametric test of paired T test - are pair samples different

Question 19

Q

Kruskal-Wallis Test

Answer

Study These Flashcards

A

Non parametric equivalent of one way ANOVA

Statistics Flashcards

(19 cards)