Midterm 1: Ch 1-10 Flashcards

1
Q

What is statistics?

A

quantitative technology for empirical science – logic and methodology for the measurement of uncertainty, and for an examination of that uncertainty

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the goals of statistics (2)?

A
  • estimate the values of important parameters

- test hypotheses about those parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is data?

A

measurements of one or more variables made on a collection of individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a variable?

A

characteristic measured on individuals drawn from a population under study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two types of variables?

A
  • response variable (dependent variable)

- explanatory variable (independent variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a response variable?

A

(dependent variable – y-axis) variable that we try to predict or explain from the explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an explanatory variable?

A

(independent variable – x-axis) variable used to predict or explain the response variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are parameters?

A

descriptive measures of an entire population

  • population parameters are constants
  • ie. mean length of salmon
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are estimates?

A

descriptive measures of a sample

  • random variables – change from one random sample to the next, from the same population
  • ie. mean of some sample of salmon
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Do samples look exactly like the population?

A

no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a sample of convenience?

A

collection of individuals that happen to be available at the time – biased

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is bias?

A

systematic discrepancy between estimates and the true population characteristic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the goals of estimation? (2)

A
  • accuracy

- precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is accuracy?

A

on average gets the correct answer

  • accurate = unbiased
  • inaccurate = biased
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is precision?

A

gives a similar answer repeatedly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some determinants of precision (when unbiased)?

A
  • sample size

- precision of instrument

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Unbiased and Precise

A
  • on average, answer is correct

- repeated samples/estimates have very similar results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Unbiased and Imprecise

A

on average, anwer is accurate, BUT each individual estimate is off

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Biased and Precise

A
  • most dangerous – may not even realize there’s a problem, and may have a lot of false confidence in the answer
  • repeated samples/estimates have very similar results, BUT average value of estimates is off
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Biased and Imprecise

A
  • on average, answer is incorrect

- unconfident in the answer, but best guess would be wrong anyways – not as deadly as being confident and wrong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are properties of a good sample? (3)

A
  • independent selection of individuals
  • random selection of individuals
  • sufficiently large
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a random sample?

A

each member of a population has an equal and independent chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is independent sampling?

A

chance of an individual being included in the sample does NOT depend on who else is sampled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is sampling error?

A

difference between the estimate and average value of the estimate

measurement of precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Do smaller or larger samples have smaller sampling error?

A

larger samples → smaller sampling error

on average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is high sampling error?

A

every new measurement is different each time we do it

low precision – large difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is low sampling error?

A
  • higher precision – small differences

- low variance between different estimates (each time we do a study)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What are the two types of data?

A
  • categorical variables (class or nominal variables)

- numerical variables (quantitative variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What are categorical variables?

A

fall into categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What are the 2 types of numerical variables?

A

continuous: can be measured – ie. arm length, height, weight, age*
discrete: can be counted – ie. number of limbs, number of offspring, number of petals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is a frequency table?

A

frequency is NOT a variable – not measuring, just gathering data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What graph do you use for graphing categorical variables?

A

bar graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What graph do you use for graphing numerical variables?

A
  • histogram

- cumulative frequency distribution (CDF)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What data do histograms graph?

A

continuous numerical variable

  • no gaps between bars – conveys that these are continuous variables running together
  • widths are the same
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is cumulative frequency of a value?

A

proportion of individuals equal to or less than that value

  • 0 = none of the individuals are less than that value
  • 1 = all individuals are less than that value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is a contingency table?

A

describes association between two (or more) categorical variables by displaying frequencies of all combinations of categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What graphs are used for graphing the association between two categorical variables?

A
  • contingency table
  • grouped bar graph
  • mosaic plot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What data do mosaic plots use?

A

relative frequencies scaled to 1 – does NOT use discrete numebrs

width of bars indicates number of individuals in the treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What data do stacked bar plots use?

A

discrete numbers or frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What graphs are used for graphing the association between a categorical (x-axis) and numerical (y-axis) variable?

A
  • multiple histogram
  • cumulative frequency distribution (CDF)
  • box plot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What graphs are used for graphing the association between two numerical variables?

A

scatter plot`

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What are two common descriptions of data?

A
  • location: central tendency

- width: spread – how variable the data is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What are 3 measures of location?

A
  • mean
  • median
  • mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is the mean (or average)

A

add all numbers together and divide by total amount of data points – centre of gravity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is the median?

A

odd number: middle measurement in a set of ordered data

even number: average of two middle numbers in a set of ordered data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What is the mode?

A

most frequent measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Why might the mean and median be different?

A

skewed data – lot of the weight is on one side of the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Why might the mean and median be the same or similar?

A

symmetrical distribution of data – bell-shaped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Mean vs. Median

A
  • mean has nice statistical properties, can be quantified easily using theories
  • mean has good predictive behaviours
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What are the 4 measures of width?

A
  • range
  • variance
  • standard deviation
  • coefficient of variation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is the range?

A

maximum minus minimum

  • poor measure of distribution width – useless in statistics
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Is sample range a biased estimator of the true population range?

A

yes, smaller sample → lower estimates of range

  • sample range is not expected to match population range
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

In the equation for variance, why do we square the value

A

if we took unsquared value, negative and positive deviations cancel out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is sample variance?

A

unbiased estimator of population variance – used to try to learn about population variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What is standard deviation?

A

positive square root of the variance

σ: true standard deviation
s: sample standard deviation – unbiased estimator of population standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What is the coefficient of variation (CV)?

A

good for comparing distributions of different magnitudes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What is skew?

A

measurement of asymmetry – refers to pointy tail of distribution

right-skewed: pointy tail is on the right
left-skewed: pointy tail is on the left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Mean – Nomenclature

A

population parameter: µ

sample statistic: Ȳ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Variance – Nomenclature

A

population parameter: σ^2

sample statistic: s^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Standard Deviation – Nomenclature

A

population parameter: σ

sample statistic: s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

Manipulating Means

Mean of Sum of Two Variables

A

E[X + Y] = E[X] + E[Y]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Manipulating Means

Mean of Sum of Variable and Constant

A

E[X + c] = E[X] + c

ie. temperature conversions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Manipulating Means

Mean of Product of Variable and Constant

A

E[c X] = c E[X]

ie. measurement conversions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Manipulating Variance

Variance of Sum of Two Variables

A

Var[X + Y] = Var[X] + Var[Y]

ONLY if X and Y are independent

65
Q

Manipulating Variance

Variance of Sum of Variable and Constant

A

Var[X + c] = Var[X]

spread of data has not changed – variance is the same

ie. adding 10 cm to every measurement

66
Q

Manipulating Variance

Variance of Product of Variable and Constant

A

Var[c X] = c^2 Var[X]

variance in units^2, therefore multiply by constant^2

67
Q

What happens every time we take a sample from a population?

A

every sample will look different

68
Q

What happens if we take many samples from a population?

A

samples will look similar to each other

69
Q

How does variance change with sample size?

A

larger sample size = smaller variance of the sampling distribution of the mean

70
Q

What is standard error of an estimate?

A

standard deviation of its sampling distribution

predicts the sampling error of the estimate

71
Q

What is the problem with the equation for the standard error of the mean?

A

in most cases, we don’t know 𝜎 – we only have a sample

72
Q

What is the estimate of the standard error of the mean?

A

gives some knowledge of the likely difference between sample mean and true population mean

73
Q

What is the 95% confidence interval?

A

provides a plausible range for a parameter

  • all values for the parameter within the interval are plausible
  • all values for the parameter outside the interval are unlikely
74
Q

What is the 2SE rule-of-thumb?

A

interval that provides a rough estimate of 95% CI for the mean

assuming normally distributed population and/or sufficiently large sample size

75
Q

Correct or Incorrect:

“we are 95% confident that the population mean lies within the 95% CI”

A

correct

76
Q

Correct or Incorrect:

“there is a 95% probability that the population mean is within a particular 95% CI”

A

incorrect

77
Q

What is pseudoreplication?

A

error that occurs when samples are not independent, but they are treated as though they are

ie. taking multiple measurements from one individual and using each as an individual of the sample

EXAMPLE:

  • taking 10 measurements from each climber (6) to get 60 measurements
  • to avoid pseudoreplication: take mean blood pressure for each climber, so that you have 6 pulse rates, one for each climber (n = 6)
78
Q

What is the probability of an event?

A

its true relative frequency – proportion of times event would occur if we repeated same process over and over again

79
Q

What does mutually exclusive mean?

A

when two events cannot both be true

Pr(A and B) = 0

80
Q

What does independent mean?

A

when the occurrence of one event gives no information about whether the second event will occur

81
Q

What is the probability distribution?

A

describes the true relative frequency of all possible values of a random variable

all probabilities have to sum to 1

82
Q

What is the addition principle?

A

if two events A and B are mutually exclusive

Pr[A or B] = Pr[A] + Pr[B]

83
Q

What is the probability of a range?

A

Pr[number ≥ 6] = Pr[6] + Pr[7] + Pr[8]…

84
Q

What is the probability of ‘not’?

A

Pr[not rolling a 2] = 1 – Pr[rolling a 2] = 5/6

85
Q

What is the general addition principle?

A

Pr[A or B] = Pr[A] + Pr[B] - Pr[A and B]

need to subtract Pr[A and B], otherwise it’ll be counted twice

86
Q

What is the multiplication principle?

A

if two events A and B are independent

Pr[A and B] = Pr[A] x Pr[B]

87
Q

What is the general multiplication principle?

A

Pr[A and B] = Pr[A] Pr[B | A]
Pr[A and B] = Pr[B] Pr[A | B]

therefore, Pr[A] Pr[B | A] = Pr[B] Pr[A | B]

88
Q

What are dependent events?

A

probability of one event depends on the outcome of another event

89
Q

Are variables always independent?

A

no

90
Q

What is the conditional probability of an event?

A

probability of that event occurring given that a condition is met

Pr[X|Y]
probability of X given Y (if Y is true)

91
Q

Law of Total Probability

A

92
Q

When is Bayes’ Theorom used?

A

when you want to flip conditional probability

93
Q

What is hypothesis testing?

A

asks how unusual it is to get data that differ from the null hypothesis

  • if the data would be quite unlikely under H0, we reject H0
  • assumes random sampling
  • about populations, but are tested with data from samples
94
Q

What is the null hypothesis?

A

specific statement about a population parameter made for the purposes of argument

  • simplest statement
  • specific
  • good H0 would be interesting if proven wrong
95
Q

What is an alternate hypothesis?

A

represents all other possible parameter values except that stated in the null hypothesis

  • statement of greatest interest
  • non-specific
96
Q

Steps of Hypothesis Testing

A

population → sample → estimate → test statistic

null hypothesis → test statistic

null hypothesis → construct a new population under H0 → imagined repeated sampling → sample from H0 and calculate test statistic → null distribution of test statistic

test statistic + null distribution of test statistic

  • how weird would these data be if the null hypothesis were true?
  • compare distribution from H0 to observed sample
  • how likely would it be to obtain our data sample if H0 were true?
97
Q

What is a statistic?

A

number calculated to represent/summarize the match between a set of data and the null hypothesis

can be compared to a general distribution to infer probability – for any given value for a test statistic, we can say how likely those possible outcomes are

98
Q

What is a null distribution (sample distribution)?

A

probability distribution of alternative outcomes for a test statistic when a random sample is taken from a population corresponding to the null expectation

99
Q

If H0 is true, do we expect variance between samples?

A

yes

need to evaluate range and distribution of possible test statistics we have sampled, if we sampled repeatedly

100
Q

What is the P-value?

A

probability of getting the data, or something as or more unusual/extreme, if the null hypothesis were true

NOT probability H0 is true
NOT probability HA is true

101
Q

How can we find P-values? (3)

A
  • simulation
  • parametric tests
  • permutation
102
Q

What is the significance level?

A

acceptable probability of rejecting a true null hypothesis

𝜶 = usually 0.05

103
Q

What does it mean to be statistically significant?

A

if p-value for a test is ≤ 𝜶, then H0 is rejected

104
Q

What does it mean to be statistically insignificant?

A

if p-value for a test is > 𝜶, then H0 is NOT rejected

105
Q

How does sample size influence the range of test statistic we see under the null hypothesis?

A

larger sample → estimate has smaller confidence interval

larger sample → more power to reject a false null hypothesis

106
Q

What is a Type I error?

A

rejecting a true null hypothesis

probability of Type I error is 𝜶 (significance level)

107
Q

What is a Type II error?

A

not rejecting a false null hypothesis

probability of Type II error is 𝜷

  • what the real world looks like
  • our sample size
  • our 𝜶

smaller 𝜷 = larger power a test has

108
Q

What is power?

A

ability of a test to reject a false null hypothesis – how likely we will reject it

1 – 𝜷

109
Q

How are power and sample size related?

A

larger power = larger sample size (more information)

increase sample size → decrease standard deviation of null distribution → increase power to reject H0

110
Q

What is a two-tailed test?

A

deviation in either direction would reject the null hypothesis

  • most tests are two-tailed
  • normally 𝜶 is divided into 𝜶/2 on one side, and 𝜶/2 on the other
111
Q

What is a one-tailed test used?

A

only used when the other tail is nonsensical

ie. comparing grades on multiple choice test to that expected by random guessing

112
Q

What is a critical value?

A

value of a test statistic beyond which the null hypothesis can be rejected

  • we never ‘accept the null hypothesis’
113
Q

Where in the 95% CI is the value proposed by the null hypothesis rejected?

A

in general, if a hypothesis test rejects a null hypothesis test (p < 0.05), the value proposed by the null hypothesis is outside the 95% confidence interval

114
Q

2-Sample T-tests

A

115
Q

2-Sample T-tests

The more different the sample means are….

A

(when taking into account sample spread and size, and assuming we’ve randomly sampled), the less likely it is they were drawn from populations with the same mean

116
Q

2-Sample T-tests

What would a P-value of 0.03 mean?

A

there is a 3% chance of getting means that are at least this different if they’re drawn from populations with the same mean

117
Q

2-Sample T-tests

Higher vs. Lower P-values

A

higher p-values:

  • higher probability of 2 sample means being at least this different, if drawn from populations with same mean
  • less evidence of differences between population means

lower p-values:

  • lower probability of 2 sample means being at least this different, if drawn from populations with same mean
  • more evidence of differences between population means
118
Q

What is a confounding variable?

A

unmeasured variable that may be the cause of both X and Y

119
Q

What is a proportion?

A

fraction of individuals having a particular attribute

120
Q

What is a binomial distribution?

A

describes the probability of a given number of ‘successes’ from a fixed number of independent trials

121
Q

What are the 2 properties of binomial distributions?

A
  • mean of number of successes

- variance of number of successes

122
Q

What is the estimate of a proportion?

A

number of ‘successes’ over total sample size

123
Q

What are the 2 properties of sample proportions?

A
  • mean

- variance

124
Q

How does the standard error of the estimate of a proportion change with sample size?

A

larger sample → lower standard error

125
Q

What is the name for the 95% CI for a proportion?

A

Agresti-Coull confidence interval

126
Q

How does the Agresti-Coull confidence interval change with sample size?

A

larger sample → more symmetrical distribution

127
Q

For the Agresti-Coull confidence interval, what are the +2 and +4 factors?

A

‘fudge factors’ that are there for more asymmetrical distributions

128
Q

What is Murphy’s law?

A

anything that can go wrong will go wrong

129
Q

What is the binomial test?

A

uses data to test whether a population proportion (p) matches a null expectation for the proportion

H0: relative frequency of successes in the population is p0
HA: relative frequency of successes in the population is not p0

130
Q

What is a goodness-of-fit test?

A

compares count data to a probability distribution (expected frequencies) of a set of categories

131
Q

What are the hypotheses for a 𝜒2 goodness-of-fit test?

A

H0: data come from a particular probability distribution
HA: data do NOT come from that distribution

132
Q

What is the test statistic for 𝜒2 goodness-of-fit test?

A

𝜒2

133
Q

How does the number of categories affect 𝜒2 goodness-of-fit test?

A

the more categories you have, the more opportunities to deviate from expectations

134
Q

What is the degree of freedom of a test?

A

specifies which of a family of distributions to use

135
Q

What is the equation for degrees of freedom for 𝜒2 test?

A

df = (number of categories) - (number of parameters estimated from the data) - 1

136
Q

What is the critical value?

A

value of the test statistic where P = 𝛼

if observed 𝜒2 > 𝜒2 corrected for df, we reject the null hypothesis

if observed 𝜒2 < 𝜒2 corrected for df, we DO NOT reject the null hypothesis

137
Q

What is a test statistic?

A

number calculated from the data and the null hypothesis that can be compared to a standard distribution to find the P-value of the test

138
Q

Can 𝜒2 goodness-of-fit test substitute for binomial test?

A

yes, because it works even when there are only two categories

  • very useful if the number of data points is large
  • BUT not recommended if binomial test is possible – two categories (success and failure)
139
Q

What are the assumptions of the 𝜒2 goodness-of-fit test?

A
  • no more than 20% of categories have Expected < 5
  • no category with Expected ≤ 1

(if needed, combine categories to satisfy these requirements)

140
Q

What is a discrete distribution?

A

probability distribution describing a discrete numerical random variable

ie. number of heads from 10 flips of a coin
ie. number of flowers in a square meter
ie. number of disease outbreaks in a year

141
Q

What is the Poisson distribution?

A

describes the probability that a certain number of events occur in a block of time or space, when those events happen independently of each other and occur with equal probability at every point in time or space

used to ask questions about random events (by chance)

142
Q

What is contingency analysis?

A

test the independence of two or more categorical variables

143
Q

What is the equation for degree of freedom for 𝜒2 contingency analysis?

A

df = (# of columns - 1) (# of rows - 1)

144
Q

What are the assumptions for 𝜒2 contingency analysis?

A

this test is just a special case of the 𝜒2 goodness-of-fit test, therefore the same rules apply

  • no more than 20% of categories have Expected < 5
  • no category with Expected ≤ 1
145
Q

What is the Fisher’s exact test?

A

for 2 x 2 contingency analysis
- does not make assumptions about the size of expectations

R (or other programs) will do it, but difficult to do by hand

146
Q

What are odds?

A

probability of success divided by the probability of failure

147
Q

What is odds ratio?

A

odds of success in one group divided by the odds of success in another group

OR < 1 means treatment helps
OR > 1 means treatment makes things worse

148
Q

What is a normal distribution?

A
  • distribution fully described by its mean and standard deviation
  • symmetric around its mean
  • mean, median, and mode are all the same
  • 67% of random draws from a normal distribution are within one standard deviation of the mean
  • 95% of random draws from a normal distribution are within two (1.96) standard deviations of the mean
149
Q

For a standard normal distribution, what is the mean?

A

mean (μ) = 0

150
Q

For a standard normal distribution, what is the standard deviation?

A

standard deviation (σ) = 1

151
Q

What is a standard normal table?

A

gives probability of getting a random draw from a standard normal distribution greater than a given value

152
Q

Is a standard normal symmetric?

A

yes

Pr[Z > x] = Pr[Z < -x]

153
Q

What is the total area under the curve of a standard normal distribution?

A

1

Pr[Z < x] = 1 – Pr[Z > x]

154
Q

Are all normal distributions shaped the same?

A

yes, just with different means and variances

155
Q

Can any normal distribution be converted to a standard normal distribution?

A

yes, by Z: standard normal deviate

  • Z tells us how many standard deviations Y is from the mean
  • probability of getting a value > Y is the same as probability of getting a value > Z from a standard normal distribution
156
Q

Are sample means normally distributed?

A

yes, if the variable itself is normally distributed

  • mean of the sample means
  • standard deviation of the sample means
157
Q

What is the standard error of an estimate of a mean?

A

the standard deviation of the distribution of sample means

158
Q

What is the central limit theorem?

A

sum or mean of a large number of measurements randomly sampled from any population is approximately normally distributed

159
Q

Why do we “fail to reject H0” rather than “accept H0” after a test in which the P-value is calculated to be greater than α?

A

failing to reject H0 does not mean H0 is correct, because the power of the test might be limited

null hypothesis is the default and is either rejected or not rejected