Lecture 6: p-values and confidence intervals Flashcards

1
Q

What is a point estimate?

A

An inference made about the population based on the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Left-skewed distribution

A

a distribution that has a concentration of data on the upper end and the tail on the left

Skewness depends on where the tail is (so tail on left is left skewness)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does correlation measure?

A

Correlation measures the degree of relationship between two or more variables

It looks at association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the goodness of fit of a model?

A

Goodness of fit describes how well data fits a set of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is standard error of regression?

A

The standard error of the regression (S), also known as the standard error of the estimate, represents the average distance that the observed values fall from the regression line. Conveniently, it tells you how wrong the regression model is on average using the units of the response variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

R^2 (Goodness of Fit)

A

R^2 is a goodness-of-fit statistic.
Values: between 0 and 1.
Interpretation: The larger the better.
Meaning: Proportion of the outcome’s variability that the model explains.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some questions to think about for generalising?

A

*Would I get the same coefficient if I built my model using different data?

*How likely am I to estimate the correct value?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

confidence interval

A

statistical range, with a given probability, that takes random error into account

A confidence interval refers to the probability that a population parameter will fall between a set of values for a certain proportion of times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is meant by interval width?

A

Interval width = boundaries of your sample’s estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When should you be concerns about confidence intervals?

A

Be concerned about confidence intervals if they include a contradictory estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Match the confidence level with the interval width

99%
90%
10%

A

99% - very wide
90% - wide
10% - narrow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Confidence level is proportional to interval width

ANALOGY

A

Analogy: A bigger net is more likely to catch the fish

you are looking for.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

If we assume there is no association, what will you expect?

A

Assuming there is no association, you will expect:

  • a zero coefficient is very likely
  • tiny coefficients are somewhat likely
  • and big coefficients are unlikely
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is p-value?

A

the probability of a coefficient at least as big as estimated assuming the coefficient is actually zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Small p-value

A

1.When we assume the True coefficient is zero, the probability of sampling 0.32 is small, i.e. our estimate is unlikely.

2.But we know our sample estimate is 0.32.

3.Therefore, we concede that our assumption is probably wrong.

4.We conclude that an association is likely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Large p-value

A

1.When we assume the True coefficient is zero, the probability of sampling 0.32 is high, i.e. our estimate is likely.

2.And we know our estimated coefficient is 0.32.

3.Therefore, we concede that our assumption could still be right.

4.We conclude that an associations is not likely.

17
Q

Difference between small p-value and large p-value

A

Small p-value:

*Zero-assumption is probably

wrong.

*An association is likely.

Large p-value:

*Zero-assumption is probably right.

*An association is unlikely

18
Q

Example:

p-value is so small that it is <0.001

A

Typically, p-value <0.05 is statistically significant, the threshold is called alpha

GRL

19
Q

What do neither confidence intervals nor p-values tell us?

A

Neither confidence intervals nor p-values tell you anything about the effect of the exposure or how meaningful your conclusion is.

20
Q

What is the order for dealing with data?

O
G
T
A

A

Observe - look at data
Guess - build a model to estimate relationship
Test - how well does model fit (R2 and S)
Assess - confidence intervals etc

21
Q

R^2 summary

A

R^2
large = good
small = bad

22
Q

Confidence interval summary

A

If a confidence interval includes contradiction = bad

23
Q

P-value summary

A

P-value summary
Small = good
Large = bad.

24
Q

Relationship between confidence level and confidence interval width

A

The larger your sample, the more sure you can be that their answers truly reflect the population. This indicates that for a given confidence level, the larger your sample size, the smaller your confidence interval

25
Q

What does it mean if the confidence interval for a coefficient encompasses zero?

A

Zero is the null value of the parameter (in this case the difference in means). If a 95% confidence interval includes the null value, then there is no statistically meaningful or statistically significant difference between the groups.

26
Q

What does a regression coefficient mean?

A

Regression coefficients are estimates of the unknown population parameters and describe the relationship between a predictor variable and the response.

27
Q

What does it mean if the P-value is less than your significance level?

A

If the P value is less than your significance (alpha) level, the hypothesis test is statistically significant –> reject null

(GRL)