Factor analysis Flashcards
What is factor analysis?
- A statistical technique that takes a large number of variables and puts them into a small number of “factors” (groups) with which all of the variables are related to
What’s the general main principle when performing factor analysis?
- Must identify the basic underlying variables which account for the correlations between the actual test scores
When do we use factor analysis?
- Simplify a large data set
- Map out the most important variables
- Theory testing
What’s the conceptual procedure for factor analysis?
1) Complete a correlation matrix for all variables
2) Extract factors from this correlation matrix
3) Decide how many factors are necessary to represent the correlation matrix “best”, often a very important and sometimes difficult decision
4) Once decided, must rotate the factor loading matrix
5) Interpret the rotated factors and label them
What’s important to know when extracting the first factor?
- The first factor is obtained by linear combining the variables so that the factor has the largest variance (i.e., it will encapsulate the largest number of values)
T/F: Want to capture the most variance possible while extracting the fewest factors possible
- TRUE
What are factor loadings?
- The simple correlation between factors and the variables
- We can interpret factors based on factor loadings
What are Eigenvalues?
- The sum of the squared factor loadings of unrotated factors (columnwise)
- This indicates the overall factor size/impact
- Important to use when determining the number of factors to extract
What’s the Eigenvalue-greater-than-one rule?
- A strategy to use when deciding how many factors you want to extract
- You extract factors that only have an Eigenvalue greater than one
- Downside: Can cause you to over extract factors
What’s the Scree plot method?
- Plots Eigenvalues, then select factors that are found before the “elbow” of the graph
What do reproduced correlations mean?
- Depending on how many factors you extract, you can create a new correlation matrix estimated entirely from the two factors retained
- A correlation between any two variables can be estimated by finding the sum of cross-products of factor loadings of the two variables on the same factor
What can residual correlations tell us?
- Original correlation - reproduced correlation
- If residual correlations are small, that means that the factors extracted do a good job at explaining the original observed correlations quite well
- i.e., want the residual correlations to be as small as possible
Why do we rotate factors?
- Want to achieve a simple structure, meaning we want to obtain factors that have high loadings for some variables, but low loadings for other variables
- SImple structure solution = each variable loads highly on only one factor
Why do we want to obtain a simple solution with our factors?
- Want to increase interpretability
- Remember: loadings = simple correlations
What’s orthogonal rotation?
- Means that the 90-degree angles between axes are maintained in the rotations