Tutorials Flashcards

1
Q

Laddering vs Probing

A

Laddering : asking ‘why?’ questions. Up > more abstract, down > more concrete

Probing: asking about concrete / specific / sensory-based examples. ‘what else?’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

formulas to remember

A

• t-value:
mean 1 - mean 2 / standard error

• Marginal effects for OLS moderated regression analysis: 𝝏𝒀𝝏𝑿⁄ = a1 + a3Z

• Direct/Indirect/Total effect in mediation models
total effect = c=c’ + ab
direct effect = c’
indirect effect = a
b

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Reliability

A

Degree to which measure is consistent + free from random errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Calculate reliability

A

Single-item > impossible calculate variance true score, thus reliability
Multi-item > cronbachs alpha

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Validity

A

Measure what it intends to measure?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Key elements validity

A
  1. Content validity > reflect conceptual domain, judged by experts
  2. Face validity > reflect conceptual domain, judged by non-experts
  3. Convergent validity > different measures of same construct should diverge
  4. Discriminant validity > measures of different construct should NOT diverge
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Regression choice

+ solution correlations

A

Depends on data structure

1. Cross-sectional data: sample of units, 1 observation each
2. Time series data: 1 unit, multiple observations over time period 
3. Panel data: sample of units, multiple time periods
4. Clustered / nested data: hierarchical levels (employees, subs, MNE) Problem: observations are correlated Solution 1: cluster standard error  Solution 2: generalized estimating equations  Solution 3: include dummies as control variables  Solution 4: multilevel modeling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

OLS requirements

A
  1. Linear
  2. Random
  3. Mean = 0
  4. No multi-collinearity
  5. Homotheticity
  6. Error terms are normal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Limited Dependent Variable Models

A

range of possible values is restricted in some important way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

marginal effect

A

How DV changes when a IV changes

When X is continuous >
When X is NOT continuous > Y,X=1 - Y,X=2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Difference marginal effects vs corresponding coefficients

A

In OLS: yes
In the simple OLS model with linear effects, estimated coefficients = marginal effects

In MLS: no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

marginal effect of an interaction term

A

There is no marginal effect of an interaction term or any higher order terms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

STATA lessons

A
  • When higher order terms / non-OLS model used > regression coefficients harder to interpretate
  • Create higher order terms within regression command, otherwise imported as separate variables when calculating marginal effects
  • Inform Stata about indicator variable = dummy by adding = i.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Panel data

A

TSCS = sample of units over multiple time periods

Time series = periods can be daily, weekly, monthly etc.
Cross-sectional = units can be individuals, families, cities etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Balanced vs unbalanced panel data

A
Balanced = every unit has all X + Y readings for T periods
Unbalanced = some units have missing values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

FE vs RE models

A

Fixed-effects model: separate fixed intercept controlling unobserved heterogeneity
> allows arbitrary correlation: YES
> not feasible when T is small and change is slow

Random-effects model: separate fixed intercept randomly draws from N
> allows arbitrary correlation: NO
> when interested in effects time-invariant covariates

17
Q

Outliers:

Problem + solution

A

Few observations which disproportionately influence results, especially when n is small
Refer to observations, not variables

Solution:

  • Remove outliers > no
  • Robust regression > only possible for OLS
  • Winsoziring variables = limiting values 1/2%
18
Q

Control variables + note

A

explain the dependent variable, not those associated with independent variables

Note:

  • Include “enough” controls to isolate the effect of our key variables, but should be randomized controlled experiment
  • Data availability (missing values)
  • Sample size (n is small, less control v)
19
Q

How avoid overcontrolling

A
  1. Do not include 2 variables measuring same construct, different dimension is possible
  2. Do not control mechanisms how X influences Y > not possible research total effect