module 9 Flashcards

1
Q

Statistical Inference

A

Want to extend implications to the setting where we only have sample data
Want to make a statement about a feature of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Steps for Hypothesis Testing

A
  1. Identify and define the parameter of interest
  2. Define the competing hypotheses to test
  3. Set the evidence threshold, formally called the significance level
  4. Generate or use theory to specify the sampling distribution and check conditions
  5. Calculate the test statistic and p-value
  6. Evaluate your results and write a conclusion in the context of the problem.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Type I Error

A

Rejecting a true null hypothesis (false positive).
Significance level measures this

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Type II error

A

Failing to reject a false null hypothesis (false negative)
Beta measures this

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

p-value

A

the probability of getting a test statistic as extreme or more extreme (in the direction of the alternative hypothesis), assuming the null hypothesis is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

confidence intervals

A

use when we have 2 competing theories that we are trying to choose between

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Confidence interval steps

A
  1. Identify and define the parameter of interest
  2. Determine the confidence level
  3. Generate or use theory to specify the sampling distribution and check conditions
  4. Calculate the middle region of your sampling distribution, according to your confidence level
  5. Write a conclusion in the context of the problem.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Confidence level interpretation

A

“If we gathered repeated random samples of the same size and calculated a CL% confidence interval for each, we would expect CL% of the resulting confidence intervals to contain the true parameter of interest.”
we expect CL% of our intervals to be correct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

name scenario: 1 categorical

A

proportion (p)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

name scenario: 1 quantitative

A

mean (mu)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

name scenario: 2 categorical

A

proportion_1 - proportion_2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

name scenario: 1 categorical and 1 quantitative

A

mean_1 - mean_2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

name scenario: 2 quantitative

A

simple linear regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

name scenario: 3 categorical

A

multiple logistic regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

name scenario: 3 quantitative

A

multiple linear regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

confidence interval interpretation

A

“We are CL% confident that are true parameter of interest lies in (a,b).”

17
Q

describe distribution of simulated sampling distribution

A

Shape
Outliers
Center
Spread

18
Q

p-value interpretation

A

The p-value represents the probability of observing a sample as extreme as the one calculated (or more extreme), assuming the null hypothesis is true.