stats Flashcards

1
Q

assumption of independence

A

assumption across all inferential tests is that the observations in your sample are independent from each other
measurements for each sample subject are in no way influenced by or related to the measurements of other subjects
pseudoreplication or false independence when these independence aren’t met
you culture bacteria in triplicate to calculate growth rate
- calc the growth rate of each flask and then calc the mean growth rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

p value

A

the probability under the assumption of no effect or no difference (null hypothesis), of obtaining a result equal to or more extreme than what was actually observed
5% significant difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

effect size

A

effect size- the degree to which the treatment shifted the observations
when n is small no significance
n is higher is significant
effect sizes are much easier to interpret than p values as are reported in the units of the thing we are measuring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

categorical predictors

A

levels are qualitatively different
we estimate the effect per level
mean value per level
examples:
species
sex
chemical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

continuous predictors

A

levels are numerical
we estimate coefficients of a continuous function (e.g. line, surface)
response variables are usually continuous
slope and intercept
examples:
concentration
mass
time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

t-test

A

single categorical predictor with 2 levels
estimate the mean for the control (wild type)
estimate the mean difference between control and treatment (knock out- wild type )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

linear regression

A

single continuous predictor
estimate the mean yield when fertilizer = 0 (control)
the rate by which yield increases with fertilizer (slope)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

one-way ANOVA

A

single categorical predictor, 5 levels
estimate the mean for the control b,c,d,e
the differences between control and treatment means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

multiple regressions

A

single categorical predictor interacting with single continuous predictor
estimate the y-intercept of species 1
the difference in y-intercept of species 2
the slope of species 1
the difference in slope of species 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

confronting the model with data

A
  1. estimate coefficients that prescribe the numerical relationship between predictors and response (parameter estimation) and test whether they could in reality be zero (t-test)
  2. estimate whether the model explains more variation in the data than expected by chance (ANOVA)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

null hypothesis

A

means what if coefficients were all zero
point of statistical analysis is to use evidence (data) to reject (or not) the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

errors

A

deviations from the expected value
expected value is value that minimizes the deviations (also called errors or residuals)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

error sums of squares

A

the best fit: minimizing error sums of squares
what value of coefficient gives the smallest residuals?
method of ‘least squares’
error = observed = expected
error sum of squares (ESS) = sum (error^2)
sum of errors = zero, by definition
b0= mean (observed) = expected gives smallest ESS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

sample coefficient isn’t the ‘true’ parameter

A

every sample coefficient we estimate is a distribution, not a single number
specifically a t-distribution
width of the curve is defined by the standard error (SE)
the fewer independent points (degrees of freedom) we have, the fuzzier our estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

T value

A

coefficient/ standard error
bigger the t value = the bigger the coefficient relative to standard error the better model is and better estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

confidence intreva

A

roughly, an interval that has some probability of containing the ‘true’ parameter
typically 95% of the area under the probability curve
95% Cl is approximately b +/- 2 SE

17
Q

model sum of squares

A

the difference between predicted values from our equation and overall grand mean of data

18
Q

degrees of freedom

A

are independent piece of information
model degrees freedom = number of coefficients
error degrees of freedom = number of independent data points - number of coefficients

19
Q

mean sqaures

A

sum of squares/ degrees of freedom
amount of variation per degree of freedom

20
Q

ANOVA and F-test

A

F= ratio of explained; unexplained variance
model MS / error MS
expected F under null hypothesis is 1
is F greater than 1 (depending on DF), model is statistically significant

21
Q
A