regression Flashcards

1
Q

regression analysis

A

a statistical method for examining relationships among variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

linear regression

A

a statistical model that assumes a linear relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

population linear regression model

A

decribes the relationship that holds between Y and X in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

X

A

the independent variable or regressor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Y

A

the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Beta 0

A

the intercept, it measures the point at which the regression line intercepts the Y axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Beta 1

A

the slope of the regression line. It measures the difference in Y associated with a one unit change in X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

u i

A

regression residual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Prediction

A

using the observed values of a given variable to predict the value of another variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

causal inference

A

to determine whether and to what extent a cause-and-effect relationship exists between variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

causality

A

an action is said to cause an outcome in the outcome is the direct consequence of that action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

treatment group

A

recieve the treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

control group

A

does not recieve treatment (counterfactual)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

observational data

A

surveys, administrative records, financial reports

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

cross-sectional data

A
  • data collected at a single point in time for different entities
  • reflects a snapshot of variables at that point
  • we can use this data so study differences across intities in a single time period
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

panel data

A
  • data collected for multiple entities at multiple points in time
  • captures the dynamics of change over time
    -allows for the analysisi of temporal effects across entities
17
Q

time series

A
  • data collected for a single entity at multiple time points
  • allows for the analysis of temporal effects and forecasting
18
Q

ordinary least squares (OLS)

A

it identifies the prameters that minimize the sum of the squared residuals

19
Q

residual

A

the vertical distance from the regression line

20
Q

the sign (±) on Beta 1 for an independent variable

A

the direction of its association with the dependent variable

21
Q

Central limit theorem

A

when the sample is large and properly drawn, the sample mean is distributed normally around the true mean

22
Q

standard error

A

it represents the average distance that the observed values fall from the regression line

23
Q

t-statistic

A

the ratio of the departure of the estimated value of a parameter from its hypothesized value to its standard error

24
Q

95% confidence interval

A

an interval that is a function of the data that contains the true parameter value 95% of the time in repeated samples

25
Q

omitted variable bias (OVB)

A

the bias in the OLS estimator that occurs as a result of an omitted variable (Z)

26
Q

statistical inference entails

A
  1. estimation of the coefficients of interest
  2. hypothesis testing and confidence intervals
27
Q

R-squared

A

a measure of the regression model fit

28
Q

R-squared value of 1

A

perfect explanation of variance

29
Q

R-squared value of 0

A

the model explains none of the variance

30
Q

adjusted R-squared

A

a modified version of the R-squared that does not necessarily increase when a regressor is added to the regression

31
Q

control variable

A

controls for an omitted causal factor in the regression but itself not necessarily have a causal effect on Y

32
Q

endogeneity

A

a situation where the explanatory variable is correlated with the error term

33
Q

omitted variable bias (endogeneity)

A

when a model fails to include one or more relevant variables that influence the dependent variable

34
Q

selection bias (endogeneity)

A

the data sample is not randomly selected

35
Q

reverse causality or simultaneity (endogeneity)

A

two-way causation exists between independent and dependent variables

36
Q

measurement error (endogeneity)

A

if an independent variable is measured with error

37
Q

difference in difference (DiD)

A

this is used to asses the causal effect of an event by comparing the set of units where the event happened (treatment group) in relation to units where the event did not happen (control group).

38
Q

knowledge diffusion

A

disclosure of technical knowledge in the patent document