Tutorials Flashcards
Laddering vs Probing
Laddering : asking ‘why?’ questions. Up > more abstract, down > more concrete
Probing: asking about concrete / specific / sensory-based examples. ‘what else?’
formulas to remember
• t-value:
mean 1 - mean 2 / standard error
• Marginal effects for OLS moderated regression analysis: 𝝏𝒀𝝏𝑿⁄ = a1 + a3Z
• Direct/Indirect/Total effect in mediation models
total effect = c=c’ + ab
direct effect = c’
indirect effect = ab
Reliability
Degree to which measure is consistent + free from random errors
Calculate reliability
Single-item > impossible calculate variance true score, thus reliability
Multi-item > cronbachs alpha
Validity
Measure what it intends to measure?
Key elements validity
- Content validity > reflect conceptual domain, judged by experts
- Face validity > reflect conceptual domain, judged by non-experts
- Convergent validity > different measures of same construct should diverge
- Discriminant validity > measures of different construct should NOT diverge
Regression choice
+ solution correlations
Depends on data structure
1. Cross-sectional data: sample of units, 1 observation each 2. Time series data: 1 unit, multiple observations over time period 3. Panel data: sample of units, multiple time periods 4. Clustered / nested data: hierarchical levels (employees, subs, MNE) Problem: observations are correlated Solution 1: cluster standard error Solution 2: generalized estimating equations Solution 3: include dummies as control variables Solution 4: multilevel modeling
OLS requirements
- Linear
- Random
- Mean = 0
- No multi-collinearity
- Homotheticity
- Error terms are normal
Limited Dependent Variable Models
range of possible values is restricted in some important way
marginal effect
How DV changes when a IV changes
When X is continuous >
When X is NOT continuous > Y,X=1 - Y,X=2
Difference marginal effects vs corresponding coefficients
In OLS: yes
In the simple OLS model with linear effects, estimated coefficients = marginal effects
In MLS: no
marginal effect of an interaction term
There is no marginal effect of an interaction term or any higher order terms
STATA lessons
- When higher order terms / non-OLS model used > regression coefficients harder to interpretate
- Create higher order terms within regression command, otherwise imported as separate variables when calculating marginal effects
- Inform Stata about indicator variable = dummy by adding = i.
Panel data
TSCS = sample of units over multiple time periods
Time series = periods can be daily, weekly, monthly etc.
Cross-sectional = units can be individuals, families, cities etc.
Balanced vs unbalanced panel data
Balanced = every unit has all X + Y readings for T periods Unbalanced = some units have missing values