Week 1 Flashcards

1
Q

Prior knowledge

A

Bayesian approach. “already known” information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Posterior distribution

A

Prior knowledge is updates with the information in the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Advantages & disadvantages prior knowledge

A

Pro: accumulating knowledge & more powered
:( : results depend on the choice of prior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Frequentist Pr(data|H0)

A

P-value. Probability of observing same or more extreme data given that the null is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bayesian Pr (Hj|data)

A

Probability that hypothesis Hj is supported by the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Frequentist probability

A

Relative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Bayesian probability

A

Degree of belief

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

95% confidence interval (frequentist)

A

If we were to repeat the experiment many times and calculate CI each time, 95% of the intervals will include the true parameter value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

95% credible interval (bayesian)

A

There is 95% probability that the true value is in the credible interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

R squared

A

…% of the variance in y is explained by the regression model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Adjusted R squared

A

Corrects for overfitting (having many predictors increase R squared)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Method enter (frequentist)

A

data analist decides what goes in the model (confirmatory)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Method stepwise (frequentist)

A

The best prediction model is determined based on results in this sample (exploratory)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

B-value

A

the unstandardized regression B can be used to predict a score on the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Beta value

A

the standardized regression coefficient can be used to determine the relative importance of the predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Registered report

A

develop idea > design study > stage 1 peer review > collect and analyze data > write report > stage 2 peer review > publish report

17
Q

Cook’s distance

A

Assumption of no outliers on y-axis. Cook’s distance indicates the overall influence of a respondent on the model. Value must be below 1

18
Q

Violation of absence of multicollinearity leads to…

A

Regression coefficients (B) are unreliable
Limits magnitude of R (correlation Y and ^Y)
The importance of individual independent variables can hardly be determined, if at all

19
Q

Tolerance or VIF

A

Determining if multicollinearity is an issue:
Tolerance <.2: indicates potential problem
Tolerance <.1: indicates problem
VIF >10: indicates problem

20
Q

Value of multiple correlation coefficient is same as…

A

R