Week 7 Flashcards
What are the 2 types of FE effects?
- State fixed effects- unobserved, time-invariant characteristics
- Time fixed effects - capture common shocks that occur in a given period.
Define factor analysis
This is a technique used to reduce a large amount of information (contained in a number of original variables) into fewer variables, with a minimum loss of information.
Define latent variables
These are variables which can only be inferred indirectly from other directly observed or measured variables (i.e popularity)
Main uses of factor analysis
- Reduce clutter –> Reduce data complexity to a manageable size while retaining as much of the original information as possible.
- Identify / verify patterns
What are the 2 types of factor analysis?
Exploratory–> There is no pre-defined structure to verify.
- Searching for structures
- Understanding the groups of measured variables that relate to factors or theoretical constructs
- Looking for factors to identify.
- Data reducing, decluttering
Confirmatory- this is to verify the factor structure of a set of observed variables.
- The factor structure would be based off previous theories or research
- Confirms existing ideas, research, or measurement
- Assess the degree to which the data meet the expected structure
- Test hypothesis about the structure or the number of dimension underlying a set of variables.
What are underlying dimensions in a factor analysis?
These are known as factors –> These are where variables may be in “clusters of meaningful correlations” which suggest that the variables associated to these clusters are referring to aspects of the same, common underlying assumption.
What is factor loading?
These are the coordinates of a variable.
- Ideally, a variable should have large coordinates for one axis (one factor) and small coordinate for the other axis (if the second factor opposes the qualities of the first).
–> This would infer the variable is related to only one of the factors.
How can you analyze a matrix of factor loadings?
It would look something similar to:
0.87. 0.01
0.96. 0.04
0.92. -0.03
0.00. 0.82
And there would be large brackets on each side. The columns would refer to the factors that are being studied, and the rows would refer to a variable.
i.e. The first variable has the ‘loading’ of 0.87 for the first factor and 0.01 for the second factor.
What is a eigenvalue
A measure of how much the common variance (communality) of the observed variables a factor explains.
-> Any factor with an eigenvalue > 1 explain more variance than a single observed variable.
-> Indicates the importance of a factor, represents the amount of variation explained by a factor.
-> The factors which explain the least amount of variance often get discarded.
The higher the eigenvalue, the better.
Kaiser’s criterion: Keep factors with eigenvalue > 1
Keep factors with eigenvalue > 1
Scree plot
Graphical method to determine number of factors (exploratory factor analysis)
When using a scree plot, how can you determine which (or how many) factors to keep
Find the “point of inflection” (elbow of the graph) and you keep the factors to the left of the elbow.
What is a factor rotation?
Once the factors are extracted, the next step is to examine factor loadings.
Factor rotation is a tool to help interpret factor loadings:
- This is used to ensure that variables:
- Load maximally to only one factor
- Have minimal loading (close to 0) on the other factor.
What are the 2 types of factor rotations
- Orthogonal (varimax)
- Independent / Uncorrelated factors
- May result in a loss of valuable information
- Oblique (oblimin)
- Allows factors to be correlated
- Produce more accurate and reproducible solution.
The choice of which is dependent on whether there is a good theoretical reason to suppose that the factors are related or independent.
Cronbach’s alpha
It is a measure of internal consistency and reliability, NOT VALIDITY. (Between 0 and 1)
High Cronbach’s alpha values –> Response values for each participant across a set of questions are consistent.
Low Conbach’s alpha values –> Response values do not reliably measure the same construct.