part one Flashcards

Question

when should you use random sampling

Answer 1

when information is not known about the population

Answer 2

when you know information about the population to best represent that population . this increases precision and accuracy

Answer 3

no by chance it can be or it can not be. therefore preliminary tests can be performed with lots of replicates to decide how to represenativly sample

Answer 4

independance randomness these are KEY

Answer 5

homogeneity of variances | normality of residuals

Answer 6

replicates need to be indepenent of each other (eg seperated through space, look for possible relationships between replicates

Answer 7

'replicates' that are non-independant on each other therefore not really true replicates as you are not accounting for relationships between individuals. this increases type 1 error

Answer 8

when you reject the null (ie your hypothesis is supported) however this is not because your model is correct rather you have not accounted for other factors/variables that cause this relationship

Answer 9

by performing a manipulative study where you can control the variables

Answer 10

as we are taking a sample of the population that is subject to error, we can only make probalistic statments rather than absolute statments. statistcis allows us to quantify

Answer 11

a null hypothesis a test statistic rejection region and critical value

Answer 12

everything not included in hypothesis (eg equal or opposite)

Answer 13

there is no difference between groups

Answer 14

testing the difference between 2 means

Answer 15

when there is no direction in your hypothesis eg (there is no specified direction for proposed difference

Answer 16

when you have a directional hypothesis (eg this pop is greater than this pop)

Answer 17

when the null is true however you reject it

Answer 18

when the null is false however you support it

Answer 19

critical value (eg alpha = 0.05)

Answer 20

probability is always alpha (eg 0.05) so when you have a 2 tailed you half alpha (eg 0.05/2)

Answer 21

if variances are not equal then the rejections regions will not be comparable across groups this increases type 1 error to reduce: large sample and balance n can be fixed by transforming

Answer 22

difference between data point and predicted value (ie mean)

Answer 23

central limit therom ensures normality, therefore in a large enough sample it is not necessary can be fixed post sample by transforming. normality is only important in really skewed/non normal data.

Answer 24

analysis of variance. looking at variation between more than 2 groups and within more than 2 groups

Answer 25

increases probability of type 1 error (raises from 0.05 * x amount of tests conducted.) correction can be used but this increases type 2 error and reduces power

Answer 26

ie treatment/group. | you have factors and levels within factor

Answer 27

mean + effect of factor + noise

Answer 28

that there is no effect ie the levels of a factor dont differ. therefore the MS ratio = 1

Answer 29

that there is an effect between ie between levels of a factor. the MS ratio is >1

Answer 30

categorical ie factors. it is a linear model

Answer 31

the null is that there are no difference (ie they are homogenius (what we want). therefore if NOT significant the variances are not different therefore they are homogenious

Answer 32

1 factor with mutiple levels (comparing between levels)

Answer 33

2 factors with mutiple levels (comparing between fatcors and between levels)

Answer 34

this does not test the magnitiude of signfiance just that is it indeed significant. to look at th emagnitiude look at the data not the p value

Answer 35

tests conducted after getting a significant p value in an anova to determine which levels are significant (more than 2)

Answer 36

look at ranks given to each level to know what one they are comparing. then look at comparisons (ie 2-1) and look if stars to see significant. to see which is bigger look at rank means.

Answer 37

testing for a relationship / assoication between 2 random variables

Answer 38

testing whether a response variable is caused by explanitory variable ie prediction

Answer 39

correlation must be conducted primarily to know there is an association /pattern between variables and test the strength of that relationship. after this is established you can test for prediction (ie one causes the other)

Answer 40

continuous. it is a Linear model

Answer 41

each unit in the population has a value for each variable

Answer 42

p = pearsons. ranges from-1 to +1 (ie perfect negativce or perfect positive). 0 = no relationship

Answer 43

correlation r value. not the same as r2. measures the strentgh of the relationship -1 to +1. 0= no relationship.

Answer 44

normall distributed variables and relationships between variables are linear

Answer 45

you can predict new values of Y ( response) from new values of X (explanitory) however this is ONLY within your sampling range and you cant go beyond that.

Answer 46

the relationship. if slope is at 0 there is no relationship.

Answer 47

testing the precision of prediction. how much of the variation in y is explained by x. (closer to 1 = stronger). if below 0.5 = a lot of variance is not explained pruly by relationship with x

Answer 48

slope. either slope is at 0 (no relationship) or it is directional (opposite to your alternative).

Answer 49

1. plot a scatter plot to see linear relationship 2. perform anova and look at p value 3. if positive look at r sqaured to test strength

Answer 50

fixed X. measured without error. as all measurement has error however, the error must be lower than the measurement. ie measurement of cm with mm error okay )

Answer 51

correlation is an association between 2 variables, just because there is a relationship it doesnt mean one causes the other. casuation means one variable is caused by the other.

Answer 52

you can only disprove nulls therefore you can never prove casuality however to infer casuality you need to perform maipluatlve experiments

Answer 53

you usually work with smaller scales, will the same relationship be found at larger scales?

Answer 54

another level of a factor to account for experiment artefacts, ie an effect you created with your experiment that perviously wasnt in the system (confounding). you need a treatment, a control and a proceedural control

Answer 55

factors: fixed vs random crossed vs nested

Answer 56

no. interactions mean that levels of one factor is dependent on levels of another factor. you can only have interactions in a 2 or more way anova

Answer 57

levels factor 1 are nested within levels of factor 2 a common one is location: ie. treatments for factor 1 are sperated between sites

Answer 58

all levels of factor 1 are present within the other factor ie all treatments are present within each site

Answer 59

only in corssed designs can you find interactions

Answer 60

levels of one factor are dependent on the levels of another factor ie whether the levels of factor one are significant will depend on which site (level) of factor 2 they are in. this means there is inconsistancy thorugh space.

Answer 61

look at the bottom | factor1:factor2 and the pvalue

Answer 62

post hoc tests. eg SNK test will look at each factor2 and the levels of factor 1 wihtin this factor 2. eg each site and the levels with sites and if they are significant then look at means

Answer 63

if you dont get a significant effect then there is no issue however if you get a significant effect then need to transform and test again because nonhomogeneity increases type 1 error (say there is when there isnt)

Answer 64

yes, if there are more than 2 levels to know which levels are different.

Answer 65

look at main effects ie your response variable to look at differences betwene levels

Answer 66

a combination of fixed factors and random factors

Answer 67

fixed: treatment, specific. random: general, represenative example: fixed sites you can about each site random sites you test for consistancy spaitally

Answer 68

it will change how the mean square is estimated. fixed cares about means between levels. whereas random cares only about variability between sites same for the null: are you looking at means (fixed) or variance (random)

Answer 69

you cannot extrabolate/generalise for fixed factors. what you get from your experiment is speciic to your factors. for random, it is more general and inferences can be applied to other spp/sites etc.

Answer 70

avoid confounding (spaital or temporal), avoid non independence, test for consistency

Answer 71

no. this increases type 1 error

Answer 72

always categorical

Answer 73

categorical

Answer 74

1. goofness of fit, whether the sample matches exptected population 2. contingency or assoication test. test for independence

Answer 75

(rows - 1)*(columns -1)

Answer 76

using the null hypothesis and the total samples

Answer 77

[sum of] (observed - expected)2/expected

Answer 78

no more than 20% of expected freuqnecies are smaller than 5. there can be transformations / pooling if so

Answer 79

both testing for associations between random variables however chi is categorical while correlation is continuous

Answer 80

no association

Answer 81

no, only way to know is to plot the data on graphs

Answer 82

make assumptions about the parameters ie means, variance between groups/treatments of a populations distrubution eg t test ANOVA linear regression

Answer 83

distribution free, not estimating parameters ie rank based tests

Answer 84

no assumption about underlying disturbition therefore good when you have very non normal data or big outliers you want to keep or the responses are already ranks

Answer 85

independence between samples and homogeneity of variance

Answer 86

rank all observations ignoring groups from low to high | randomise ranks to develop propability distribution. use real data to see if it fits distrubution

Answer 87

always use parametric if you can, it is more powerful. try and trnaform data is non normal first. best to use non parameteric test if your data is already ranks

Answer 88

non parameteric test to comapre 1 factor with 2 levels (similar to a t test )

Answer 89

and extension of MWW factor with mroe than 2 levels (similar to an anova)

Answer 90

for non linear correlations, continuous

Answer 91

average of the ranks

Answer 92

rnak coefficent for spearmans correlation. a measure of strength of the relationship -1 to 1.

Answer 93

randomly take 1 sample per treatment in the same block or take all samples for treatment in the same block and use the mean only cannot use all samples as that is pseudo replication

part one Flashcards

(117 cards)