exam 2 Flashcards

1
Q

descriptive, differences, correlation/regression, association projects (differences, projects)

A

descriptive - do not carry out hypothesis, the goal is to describe the situation (various statistical measures may be important), histogram, density and boxplots
differences - compares two or more sets of data (hypothesis will relate to differences you believe may exist), bar charts, side by side boxplots
correlation/regression - attempts to link variables (looking for strength and direction of links between variables), scatterplots, line plots
association - emphasis on links between variables that are categorical, bar charts, pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

null and alternative hypothesis

A

we test the null hypothesis, data is gathered to test null
we do not prove the alternative hypothesis, the most we can do is find support for it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

possible outcomes from hypothesis testing

A

reject and fail to reject null - reject = null is not accurate, fail to reject = null is accurate
p-value - the probability that the null hypothesis is correct from the data gathered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

histogram

A

descriptive test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

boxplots

A

descriptive, difference (side by side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

bar charts

A

differences, association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

scatterplots

A

correlation and regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

line plots

A

correlation and regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

pie charts

A

association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

histogram in r

A

hist(object)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

boxplot from object in r

A

boxplot(object)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

set scale of axis in r

A

ylim=c(0,0) xlim=c(0,0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

add axis label and graph title in r

A

xlab = “Title”
ylab=”Title”
main=”Title”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

change colors of bars or boxes in r

A

col=”Color”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

popperian philosophy

A

we learn by being wrong, no amount of evidence can prove something is true (empirical falsification)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

testing a null/ reshuffling

A

to determine what no change would look like, create data that would be reasonable for the system (after plenty of research about what is realistic) to come up with more data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

level of probability that scientists use as a threshold for deciding how to interpret hypothesis

A

.05 p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

basic study set up for a t-test

A

create hypothesises, collect data, data must be normally distributed, each data point must be independent

19
Q

what happens to t when variables are changed

A

when t increases, mean difference increases, when t decreases, standard deviation increases, when t increases, n increases

20
Q

what test to do to determine if data are appropriate for t-test, how to interpret

A

find if the data are normal (boxplot or shapiro-wilk test)
greater than .05 = the data is normal and a t-test can be done

21
Q

t-test in r

A

t.test(object)

22
Q

one tailed vs two tailed t-test

A

one tailed - more power to detect directional effect (greater than or less than)
two tailed - shows evidence that the difference between means is greater than expected

23
Q

paired t-test

A

repeated observations collected for a single variable with 2 levels (differences between sample point 1 and sample point 2 are compared for the same sample unit)

24
Q

non-parametric test

A

use the rank of data and rank from smallest to largest, compare the ranks
mann-whitney (two sample) and wilcoxon (paired) tests

25
Q

import data as .csv into r

A

data<-read.csv(file.choose())

26
Q

how to take one column and create an object in r

A

object<- dataset$column

27
Q

plotting boxplots and histograms in r

A

boxplot(object)
hist(object)

28
Q

checking for normality in r

A

shapiro.test(object) OR
wilcox.test(object)

29
Q

basic code for t-test in r

A

t.test(object)

30
Q

how to tell if data are normal/not normal

A

parametric - do a shapiro wilk test
non parametric - do a mann-whitney or wilcoxon test

31
Q

how to deal with not normal data

A

can be log transformed OR use non-parametric tests

32
Q

how to use a non-parametric test

A

mann-whitney test - equivalent of 2 paired t test, compares the observed difference in mean of ranks to the maximum possible difference in the mean of ranks
wilcoxon test - matched pairs, compares the ranks of differences

33
Q

how to decide how much data is needed to collect

A

preliminary sampling - small study to refine and evaluate sampling size, acceptability, feasibility, and cost of larger study (determines problems and best methods)
dummy data - learn everything and then make up what you think are plausible data
primary literature investigation - learn everything you can about your system and others like it (what are other people doing)
update analysis as you collect data

34
Q

when do we have independent replicates and when do we not

A

pseudoreplication - if replicates are “tied” to each other in some way
all independent data points must have no connection to other data points

35
Q

simple pseudoreplication

A

only a single replicate per treatment and subsamples are collected from each area

36
Q

sacrificial pseudoreplication

A

experimental units are replicated

37
Q

temporal pseudoreplication

A

only a single replicate per treatment and subsamples are collected from it repeatedly over time

38
Q

phylogenetic pseudoreplication

A

closely related individuals are the units being sampled (seeds, tadpoles, insect larvae)

39
Q

technical replication

A

different observers or instruments are used for different parts of the experiment

40
Q

converting continuous variables to catagories

A

can limit the amount of data visible, can give an inaccurate and varying perspective of the results

41
Q

true positive (p-value)

A

when Ho is true and we fail to reject

42
Q

true negative (p-value)

A

when Ho is false and we reject it

43
Q

false positive (p-value)

A

when Ho is true and we reject it

44
Q

false negative (p-value)

A

when Ho is false and we fail to reject it