WEEK 4 Flashcards

1
Q

key parts of a paper

A

intro, tables and figures, rest is for reverence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

purpose of lit review and theory

A

demonstrate plausibility of hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

purpose of research design

A

provide blueprint for replicability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

purpose of results and conclusions

A

demonstrate researcher competency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

to evaluate the causal relationship

A

look at predictions and probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what do plots show: marginal effect plots

A

IV (X axis) DV (y axis)
regression lines and confidence intervals ; predicted values of y given values of x

if y is a binary variable, the prediction is a probability

does not depict control variables, but accounts for them in compiling the image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what do plots show: coefficient plots

A

IV (x-axis) and DV (y-axis)

Predicted coefficient of IV and all control variables

confidence intervals for each coefficient

USEFUL For: OLS, logistic, Poisson

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what to use when evaluating INTERACTIVE EFFECTS

A

Marginal effects plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are coefficient plots commonly used in:

A

OLS Regression: coefficients are in original units (e.g., years of education → +0.5 on income)

Logistic Regression: coefficients are in log-odds

Poisson/Negative Binomial: coefficients are in log count or log IRR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

logit and probit models

A

Both are types of regression models used when your dependent variable (DV) is binary (i.e., 0 or 1, like yes/no, vote/don’t vote, support/don’t support).

Coefficients are in log-odds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what do tables show

A

Coefficients, standard errors, test statistics, and significance of each IV

The intercept (value of “y” when each “x” equals 0) and its errors and significance

Markers of significance and goodness of fit for entire model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

histogram

A

shows the distribution of independent variables, and how many observations are at each value

they can skew in one direction or another

leftward skew can mean that the bulk of the values are concentrated around a certain closer to 0 than to 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

z-score

A

results tell you how many standard deviations value is from the mean

only gives result of 1 observation

cant do for bivariate / multivariate hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

t-test

A

GOAL: compares variation across two groups
- basically compares two groups

can compare with model and without that variable

get it by doing divide coefficient by standard error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

one tailed vs two tailed test

A

in assessing critical value of t-test (comparing 2 samples), have to specify if it is a one-tailed or two-tailed test

one-tailed = population mean is below the standard (shows you where a value is above or below – but rarely use)

two-tailed = tells you whether it is different form the mean, in either direct
—— TWO TAILED can determine significance even if the relationship is opposite of hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

goodness of fit

A

Goodness of fit measures how well your statistical model explains the observed data.

can assess overall strength w/ pearson correlation

2 tests: f-test and r-squared test

17
Q

r-squared

A

a goodness of fit test

only works when one IV

18
Q

multicolinearity

A

vaariables are overllapping and we cant tell what is causing what

19
Q

confidence intervals

A

1.96 standard deviations is the critical value for 95% confidence

range of likely values is the coefficient + or - 1.96 x standard error

usually slimmer with more values

20
Q

RSS V. TSS V. SEE

A

total sum of squares: average distance of observations of y to the average y

regression sum of squares: How much variability the model explains

Residual Sum of Squares SSE (SS Error) What’s left over — the unexplained variability (errors)

21
Q

root mean square data

A

measure of how uch model misses predictions on average

basically, the average distance of an observation to the reg line