lecture 3. statistical inference Flashcards

1
Q

does this describe exploratory data analysis or statistical inference?
its purpose is unrestricted exploration of data, searching for interesting patterns; conclusions apply to the subjects and circumstances for which we have data in hand; conclusions are informal based on what we see in the data

A

exploratory data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

does this describe exploratory data analysis or statistical inference?
purpose is to answer a specific question, posed before the data produced; conclusions apply to a larger group of subjects or a broader class of circumstances; conclusions are formal, backed by a statement of our confidence in them

A

statistical inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

this is a method for drawing conclusions about a population from a sample. uses probability to indicate how trustworthy its conclusions are. assumes a random sample - if this is not the case your conclusions may be faulty

A

statistical inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the two most common types of statistical inference?

A
  • significance tests
  • confidence intervals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

this common type of statistical inference estimates a population parameter.

A

confidence intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

this common type of statistical inference assess the evidence in the data for some claim about the population

A

significance tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

generating statistics from a relatively small sample to provide an indication of a population value (parameter) is the process of ________

A

estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

________ are fixed for a given population (the mean for a given population is constant)

A

parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

_______ are estimates of parameters and cary sample by sample. in effect they are random variables.

A

sample statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

the ____________ distribution is the distribution of all possible values of that statistic in all possible random sample so the same size n from population N

A

random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

if a random variable x has a population mean μ and population variance σ2, then the sampling distribution of means (of samples of size n) will have a mean of μ and variance ____

A

σ2/n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

this property means variability of the random sampling distribution depends on _______ and ________

A

sample size and the variability of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

the ______ (smaller/larger) the sample size, the smaller the σ2/n

A

larger

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

_______ (more/less) variance = the more certain we can be

A

less

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

since sample means are normally distributed, we can define a ________ which is a range of values used to estimate the true value of the population parameter; it is the probability (usually expressed as a percentage) or the proportion of times that the ____ actually does contain the population parameter

A

confidence interval (CI)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

states that if you have a population with mean μ and standard deviation σ and take sufficiently large random samples from the population with replacement , then the distribution of the sample means will be approximately normally distributed.

A

central limit theorem

17
Q

In a sample mean, we use the _______ to describe the variability of individual observations

A

standard deviation, σ

18
Q

in a sample mean, we use the ______ to describe the variability of a sampling distribution

A

standard deviation of the sampling distribution of means/THE STANDARD ERROR OF THE MEAN

19
Q

For normal distributions, μ +/- σ contains 68.26% of the observations, μ +/- 2σ contains 95.44% of the observations and so on based on the empirical rule. How does this statement differ for a Sampling distribution?

A

For normal distributions, μ +/- σ contains 68.26% of the observations, μ +/- 2σ contains 95.44% of the observations, etc.

For sampling distribution, mean +/- σ/√n contains 68.26% of the observations, mean +/- 2σ/√n contains 95.44% of the observations, etc.

20
Q

what are the SIX steps in hypothesis testing

A
  1. state the null hypothesis
  2. state the alternative hypothesis
  3. set the decision level (alpha)
  4. choose the test statistic
  5. calculate P (assumes null hypothesis is true)
  6. make a decision concerning null hypothesis
21
Q

this states that any differences in the data are due to chance

A

null hypothesis

22
Q

this states that any differences in the data are “real’ or significant

A

alternative hypothesis

23
Q

if P (null hypothesis is true) is less than 0.05, do you reject or retain the null hypothesis

A

reject null hypothesis

24
Q

if P (null hypothesis is true) is greater than 0.05, do you reject or retain the null hypothesis

A

retain the null hypothesis