Final: Ch 11-20 Flashcards

Question 1

Q

Numerical Variables from a Single Sample

When is Ȳ normally distributed?

Answer

A

whenever:
- Y is normally distributed, OR
- n is large

Question 2

Q

Numerical Variables from a Single Sample

If Ȳ is normally distributed, what can we convert its distribution to?

Answer

A

standard normal distribution

Question 3

Q

Numerical Variables from a Single Sample

What does a standard normal distribution do?

Answer

A

gives a probability distribution of the difference between a sample mean and the population mean

Question 4

Q

Numerical Variables from a Single Sample

What is used to calculate the confidence interval of the mean?

Answer

A

t-distribution

Question 5

Q

What does a one-sample t-test do?

Answer

A

compares the mean of a random sample from a normal population, with the population mean proposed in a null hypothesis

Question 6

Q

What are the hypotheses for a one-sample t-test?

Answer

A

H0: mean of the population is µ0
HA: mean of the population is not µ0

Question 7

Q

What is the degrees of freedom for a one-sample t-test?

Question 8

Q

What are the assumptions of a one-sample t-test? (2)

Answer

A

variable is normally distributed

- sample is a random sample

Question 9

Q

Tests that compare means have what type of variables?

Answer

A

one categorical and one numerical variable

Question 10

Q

Paired vs. 2-sample t-tests

Answer

A

paired comparisons: allow us to account for a lot of extraneous variation

ie. before and after treatment
ie. upstream and downstream of power plant
ie. identical twins – one with treatment, one without treatment
ie. how to get earwigs in each ear out – compare tweezers to hot oil

2-sample comparisons: sometimes easier to collect data for

Question 11

Q

What are paired comparisons?

Answer

A

data from the two groups are paired

each member of pair shares much in common with the other, except for the tested categorical variable
there is one-to-one correspondence between the individuals in the two groups
in each pair, there is one member that has one treatment/group and another who has another treatment/group

Question 12

Q

What do we used to compare two groups in paired comparisons?

Answer

A

use mean of the difference between the two members of each pair

Question 13

Q

What is a paired t-test?

Answer

A

one sample t-test on the differences

Question 14

Q

What does a paired t-test do?

Answer

A

compares mean of the differences to a value given in null hypothesis

for each pair, calculate the difference

Question 15

Q

What is the number of data points in a paired t-test?

Answer

A

number of pairs – NOT number of individuals

Question 16

Q

What is the degrees of freedom for a paired t-test?

Answer

A

df = number of pairs - 1

Question 17

Q

What are the assumptions of a paired t-test?

Answer

A

pairs are chosen at random

- differences (NOT individuals) have normal distribution

Question 18

Q

What does a 2-sample t-test do?

Answer

A

compares means of numerical variable between two populations

Question 19

Q

What is the degrees of freedom for a 2-sample t-test?

Answer

A

df1 = n1 - 1
df2 = n2 - 1

Question 20

Q

What are the assumptions of a 2-sample t-test? (3)

Answer

A

both samples are random samples
both populations have normal distributions
variance of both populations is equal

Question 21

Q

What does Welch’s t-test do?

Answer

A

compares means of two groups without requiring the assumption of equal variance

Question 22

Q

What is different about the degrees of freedom for Welch’s t-test compared to other tests?

Answer

A

degrees of freedom is not necessarily an integer

Question 23

Q

Wrong Way to Make Comparison of Two Groups

Answer

A

–

“Group 1 is significantly different from a constant, but Group 2 is not. Therefore Group 1 and Group 2 are different from each other.”

Question 24

Q

What does Levene’s test do?

Answer

A

compares variances of two (or more) groups

use R to calculate

Question 25

Q

What does the F test do?

Answer

A

most commonly used test to compare variances

Question 26

Q

Why do we usually use Levene’s test instead of F test?

Answer

A

F test is very sensitive to its assumption that both distributions are normal

Question 27

Q

What are the 2 tests that compare variances?

Answer

A

Levene’s test

- F test

Question 28

Q

What 2 tests can conduct two-sample comparisons?

Answer

A

2-sample t-test or Welch’s t-test

Question 29

Q

What 2 tests can conduct two-sample comparisons?

Answer

A

2-sample t-test or Welch’s t-test

Question 30

Q

What does 2-sample t-test and Welch’s t-test both assume?

Answer

A

normal distributed variables

Question 31

Q

What assumption differs between 2-sample t-test and Welch’s t-test?

Answer

A

2- sample t-test assumes equal variance

- Welch’s t-test does NOT assume equal variance

Question 32

Q

What can you compare the means of two groups using? (2)

Answer

A

mean of paired differences

- mean difference between two groups

Question 33

Q

What are the assumptions of all t-tests? (2)

Answer

A

random sample(s)
populations are normally distributed

(for 2-sample t-test only): populations have equal variances

Question 34

Q

What are methods to detect deviations from normality? (4)

Answer

A

previous data / theory
histograms
quantile plots
Shapiro-Wilk test

Question 35

Q

What does normal data look like in a quantile plot?

Answer

A

points form an approximately straight line

Question 36

Q

What is the Shapiro-Wilk Test used for?

Answer

A

to test statistically whether a set of data comes from a normal distribution

Question 37

Q

What do you do when assumptions are not true? (5)

Answer

A

if sample sizes are large, sometimes parametric tests work OK anyway
transformations
non-parametric tests
permutation tests
bootstrapping

Question 38

Q

Why do parametric tests on large samples work relatively well even for non-normal data?

Answer

A

means of large samples are normally distributed

rule of thumb: if n > ~50, then normal approximations may work

Question 39

Q

What parametric test is ideal when assumptions are not true?

Answer

A

Welch’s t-test

if sample sizes are equal and large, then even a 10x difference in variance is approximately OK – but Welch’s is still better

Question 40

Q

What are data transformations?

Answer

A

changes each data point by some simple mathematical formula

then carry out the test on transformed data

Question 41

Q

When is log transformation useful? (3)

Answer

A

variable is likely to be the result of multiplication or division of various components
frequency distribution of data is skewed right
variance seems to increase as mean gets larger (in comparisons across groups)

Question 42

Q

What are some other types of transformations? (3)

Answer

A

arcsine transformation
square-root transformation
reciprocal transformation

Question 43

Q

What are characteristics of valid transformations? (3)

Answer

A

require same transformation be applied to each individual
have one-to-one correspondence to original values
have monotonic relationship with original values (ie. larger values stay larger)

Question 44

Q

What should you consider when choosing transformations? (3)

Answer

A

must transform each individual in the same way
transformed values must still carry biological meaning
you CANNOT keep trying transformations until P < 0.05

Question 45

Q

What do non-parametric (“distribution-free”) methods assume?

Answer

A

assume less about underlying distributions

Question 46

Q

What do parametric methods assume?

Answer

A

assume a distribution or a parameter

Question 47

Q

What are some non-parametric tests? (3)

Answer

A

sign test
RANKS
Mann-Whitney U test

Question 48

Q

What does the sign test do?

Answer

A

compares data from one sample to a constant

Question 49

Q

How is a sign test conducted?

Answer

A

for each data point, record whether individual is above (+) or below (–) hypothesized constant
use binomial test to compare result to ½

Question 50

Q

Does sign test have high or low power?

Answer

A

has very low power – therefore it is likely to NOT reject false null hypothesis

Question 51

Q

What does it mean for a test to have high power?

Answer

A

more power → more information → higher ability to reject false null hypothesis

Question 52

Q

What is RANKS?

Answer

A

used by most non-parametric methods

rank each data point in all samples from lowest to highest – ie. lowest data point gets rank 1, next lowest gets rank 2, …

Question 53

Q

What does the Mann-Whitney U test do?

Answer

A

compares central tendencies of two groups using ranks (equivalent to Wilcoxon rank sum test)

Question 54

Q

How is a Mann-Whitney U Test conducted?

Answer

A

rank all individuals from both groups together in order (for example, smallest to largest)
sum the ranks for all individuals in each group → R1 and R2
calculate U1: number of times an individual from population 1 has lower rank than an individual from population 2, out of all pairwise comparisons

Question 55

Q

What are the assumptions of the Mann-Whitney U Test? (2)

Answer

A

both samples are random samples

- both populations have the same shape of distribution – only necessary when using Mann-Whitney to compare means

Question 56

Q

What is a permutation test used for?

Answer

A

for hypothesis testing on measures of association – can be done for any test of association between two variables

Question 57

Q

How is a permutation test conducted?

Answer

A

variable 1 from an individual is paired with variable 2 data from a randomly chosen individual – this is done for all individuals
estimate is made on randomized data
whole process is repeated numerous times – distribution of randomized estimates is null distribution

Question 58

Q

What does it mean if permutation tests are done without replacement?

Answer

A

all data points are used exactly once in each permuted data set

Question 59

Q

What are the goals of experiments? (2)

Answer

A

eliminate bias

- reduce sampling error (increase precision and power)

Question 60

Q

What are some design features that reduce bias? (3)

Answer

A

controls
random assignment to treatments
blinding

Question 61

Q

What is a control?

Answer

A

group which is identical to the experimental treatment in all respects aside from the treatment itself

Question 62

Q

What is random assignment?

Answer

A

individuals are randomly assigned to treatments

Question 63

Q

How does random assignment reduce bias?

Answer

A

averages out effects of confounding variables

Question 64

Q

What is blinding?

Answer

A

preventing knowledge of experimenter (or patient) of which treatment is given to whom

Answer 64

A

unblinded studies usually find much larger effects (sometimes 3x higher) – shows the bias that results from lack of blinding

Answer 65

A

increase signal to noise ratio

if ‘noise’ is smaller, it is easier to detect a given ‘signal’ – can be achieved with smaller s or larger n

Answer 66

A

replication
balance
blocking
extreme treatments

Answer 67

A

carry out study on multiple independent objects

Answer 68

A

nearly equal sample sizes in each treatment

Answer 69

A

grouping of experimental unit – within each group, different experimental treatments are applied to different units

Answer 70

A

stronger treatments can increase the signal-to-noise ratio

Answer 71

A

increases precision

for a given total sample size (n1 + n2), standard error is smallest when n1 = n2

Answer 72

A

allows extraneous variation to be accounted for – it is therefore easier to see the signal through the remaining noise

Answer 73

A

compares means of more than two groups

asks whether any of two or more means is different from any other – is the variance among groups greater than 0?

Answer 74

A

like t-test, but can compare more than two groups

Answer 75

A

like t-test, but can compare more than two groups

Answer 76

A

H0: all populations have equal means (variance among groups = 0)
HA: at least one population mean is different

Answer 77

A

two-tailed 2-sample t-test

Answer 78

A

because of sampling error

Answer 79

A

standard deviation of sample means (when true mean is constant)

Answer 80

A

variance among groups should be equal to variance due to sampling error plus real variance among population means

if at least one of the groups has a different population mean, we expect that variance between sample means can be captured by standard error

Answer 81

A

number of groups

Answer 82

A

mean squares group

Answer 83

A

mean squares error

Answer 84

A

F > 1

(but must take into account sampling error – F calculated from data will often be greater than one even when null is true, therefore we must compare F to null distribution)

Answer 85

A

convenient way to keep track of important calculations

scientific papers often report ANOVA results with ANOVA tables

Answer 86

A

random samples
normal distributions for each population
equal variances for all populations

Answer 87

A

non-parametric test similar to a single factor ANOVA

uses ranks of the data points

Answer 88

A

categorical explanatory variable

Answer 89

A

ANOVAs can be generalized to look at more than one categorical variable at a time

can ask whether each categorical variable affects a numerical variable
can ask whether categorical variables interact in affecting the numerical variable

Answer 90

A

treatments are chosen by experimenter – not a random subset of all possible treatments

things we care about
ie. specific drug treatments, specific diets, season

Answer 91

A

treatments are a random sample from all possible treatments

things that can affect response variable, but we don’t care too much about
ie. family, location

Answer 92

A

no difference

Answer 93

A

test multiple hypotheses

ie. no difference based on North and South alone

Answer 94

A

1 - (1-𝛼)^N

ie. for 20 tests, probability of at least one Type I error is ~65%

type 1 error rate for each test = 𝛼
Pr[not making type I error | null is true] = 1-𝛼
Pr[not making type I error on 2 tests | null is true] = (1-𝛼)(1-𝛼) = (1-𝛼)^N
Pr[at least one type I error] = 1- (1-𝛼)^N

Answer 95

A

probability increases

do too many tests → probability gets too high
do more tests → will find something that is statistically significant due to chance

Answer 96

A

uses smaller 𝛼 value

𝛼’ = 𝛼 / (number of tests)

Answer 97

A

compares all group means to all other group means to find which groups are different from which others

Answer 98

A

after finding evidence for differences/variation among means with single-factor ANOVA

Answer 99

A

H0: 𝜇1 = 𝜇2
H0: 𝜇1 = 𝜇3
H0: 𝜇2 = 𝜇3

etc.

Answer 100

A

probability of making at least one Type 1 error throughout the course of testing all pairs of means is no greater than significance level (𝛼)

Answer 101

A

multiple comparisons would cause t-tests to reject too many true null hypotheses
Tukey-Kramer adjusts for the number of tests
Tukey-Kramer also uses information about variance within groups from all the data, so it has more power than t-test with Bonferroni correction

Answer 102

A

⍴ (rho)

value is between -1 and 1

Answer 103

A

correlation coefficient (r): describes relationship between two numerical variables

Answer 104

A

describes proportion of variation in one variable that can be predicted from the other variable

Answer 105

A

variance is subset of covariance

Answer 106

A

random sample
X is normally distributed with equal variance for all values of Y
Y is normally distributed with equal variance for all values of X

Answer 107

A

r is normally distributed with mean = 0
every time sampling distribution is normal, use t when using estimated standard error
if ⍴ ≠ 0, there is asymmetry

Answer 108

A

alternative to Pearson’s correlation that does not make so many assumptions

Answer 109

A

estimated correlation will be lower if X or Y are estimated with error

Answer 110

A

both compare two numerical variables

Answer 111

A

each ask different questions:

correlation – symmetrical
regression – asymmetrical

Answer 112

A

predicts Y from X (one variable from another)

Answer 113

A

random sample
Y is normally distributed with equal variance for all values of X, assuming variance for all values of X is the same
relationship between X and Y can be described by a line

Answer 114

A

Y = a + bX

Answer 115

A

best line that minimizes sum of squares for the residual

Answer 116

A

residual = observed Y - predicted Y

for every X value, Ŷ (predicted value of Y, by regression line) is value of Y right on the line

Answer 117

A

predicts amount of variance in Y explained by regression line

Answer 118

A

unwise to extrapolate beyond range of the data

Answer 119

A

H0: 𝛽 = 0
HA: 𝛽 ≠ 0

Answer 120

A

df = n -2

Answer 121

A

confidence intervals for predictions of mean Y

Answer 122

A

confidence intervals for predictions of individual Y

Answer 123

A

transformations
quadratic regression
splines

Answer 124

A

help assess assumptions

Answer 125

A

mean population is right on the line, and there’s variance around it
residual should roughly be the same size across all values of X (should be centred around 0, with equal positives and negatives)
residual should be spread out across the line, and about the same distance from the line on average for every X

Answer 126

A

(sample size should be at least 7x the number of terms)

very unlikely that new X would fall on the line
tradeoff between fit and prediction error – would fit better with your particular data set, but would have larger prediction error

Answer 127

A

tests for relationship between numerical variable (as the explanatory variable) and binary variable (as the response variable)

ie. does the dose of a toxin affect probability of survival?
ie. does the length of a peacock’s tail affect its probability of getting a mate?

Answer 128

A

papers are more likely to be published if P < 0.05 – causes bias in science reported in literature

Answer 129

A

simulation

- randomization

Answer 130

A

bootstrap

Answer 131

A

simulates sampling process on computer many times – generates null distribution from estimates done on simulated data

computer assumes null hypothesis is true

Answer 132

A

L(hypothesis A | data) = P[data | hypothesis A]

Answer 133

A

other data sets – ONLY cares about the specific data set we have

Answer 134

A

captures level of surprise

prefer models that make data less surprising, and have higher likelihood

Answer 135

A

supports one hypothesis better than another if likelihood of that hypothesis is higher than likelihood of the other hypothesis

therefore we try to find the hypothesis with maximum likelihood (least surprising data) – all estimates we have learned so far are also maximum likelihood estimates

Answer 136

A

calculus

- computer calculations

Answer 137

A

ie. maximum value of L(p=x) is found when x = ⅜

note that this is the same value we would have gotten by methods we already learned

Answer 138

A

input likelihood formula to computer
plot value of L for each value of x
find largest L

Answer 139

A

compares likelihood of maximum likelihood estimate to null hypothesis

use log-likelihood ratio

Answer 140

A

ꭓ^2 = 2 (log likelihood ratio)

Answer 141

A

df = number of variables fixed to make null hypothesis

Answer 142

A

two-sample t-tests and confidence intervals are robust to violations of equal standard
deviations as long as:

sample sizes of the two groups are roughly equal
standard deviations are within three times of one another.

Answer 143

A

extreme doses increase power, and so enhance the probability of detecting an effect
however, effects of a large dose might be very different from effects of a smaller, more realistic dose
if an effect is detected, then studies of the effects of more realistic doses would be the next step

Answer 144

A

removes effects of confounding variables

Answer 145

A

avoids unconscious bias

Answer 146

A

increases possibility of confounding by unmeasured variables

Answer 147

A

unplanned comparisons – intended to search for differences among all pairs of means

planned comparisons – must be few and identified as crucial in advance of gathering and analyzing the data

Answer 148

A

failure to reject a null hypothesis that the difference between a given pair of means is zero does not imply that the means are equal, because power is not necessarily high, especially when the differences are small

if the means of the “medium” and “isolated” treatments differ from one another, then one or both of them must differ from the means from the other two groups, but we don’t know which

Answer 149

A

sampling error in the estimates of earwig density and the proportion of males with forceps means that true density and proportion on an island are measured with error

measurement error will tend to decrease the estimated correlation

therefore, the actual correlation is expected to be higher on average than the estimated correlation.

Answer 150

A

residuals are symmetric and don’t show any obvious non-normality
variance of the residuals does not appear to change greatly for different values of X

Answer 151

A

minimizes the sum of squared differences between the predicted Y-values on the regression line for each X and the observed Y-values

Answer 152

A

differences between predicted Y-values on the estimated regression line, and the observed Y-values

Answer 153

A

variance of the residuals

Answer 154

A

fraction of the variation in Y that is explained by X

Answer 155

A

first, check the data to ensure this individual was not entered incorrectly
perform the analysis with and without the outlier included in the data set to determine whether it has an influence on the outcome
if it has a big influence, then it is probably wise to leave it out and limit predictions to the range of X- values between 0 and about 200 (and urge them to obtain more data at the higher X-value)

Answer 156

A

give the confidence interval for the predicted Y for a given X

Answer 157

A

prediction interval, because it measures uncertainty when predicting Y of a single individual

Answer 158

A

(analysis of covariance)

compares many slopes

Answer 159

A

H0: 𝛽1 = 𝛽2 = 𝛽3 = 𝛽4 = 𝛽5… (multiple null hypotheses)

HA: at least one of the slopes is different from another

Answer 160

A

method for estimation (and confidence intervals)

often used for hypothesis testing too
often used in evolutionary trees

Answer 161

A

for each group, randomly pick with replacement an equal number of data points, from data of that group
with this bootstrap dataset, calculate bootstrap replicate estimate

Answer 162

A

two individuals in a pair share many things in common with each other but
differ from members of other pairs

whatever variation these shared differences causes in the response variable is factored out in the difference between them

by looking at the differences, we potentially avoid much of the error variance in the data

separate samples do not share these properties