L10 - Factor analysis Flashcards
Factor analysis definition (Malhotra, 2010)
- a class of procedures primarily used for data reduction and summarization.
- take a large number of variables or objects and searches to see whether they have a small number of factors in common which account for their intercorrelations.
- an interdependence technique in that an entire set of interdependent relationships is examined.
Applications of Factor analysis (Malhotra, 2010)
1) Market segmentation - identify the underlying variable to group the customers.
2) Product research - determine the brand attributes that influence consumer choice
3) Advertising studies - identify the characteristics of price-sensitive consumers.
4) Data reduction; Structure identification; Measurement scale purification; Scale development and Data transformation.
Bartlett’s test of sphericity definition (Malhotra, 2010)
used to examine the hypothesis that the variables are uncorrelated in the population.
> the population correlation matrix is an identity matrix.
Correlation matrix definition (Malhotra, 2010)
a lower triangle matrix showing the simple correlations, r, between all possible pairs of variables included in the analysis.
Factor loading plot definition (Malhotra, 2010)
A plot of the original variables using the factor loadings as coordinates. The correlation of a variable and a factor indicates the degree of correspondence between the variable and a factor.
Factor matrix definition (Malhotra, 2010)
Contain the factor loadings of all variables on all the factors extracted.
Factor score coefficient matrix definition (Malhotra, 2010)
the matrix contains weights (or factor score coefficients) used to combine the standardized variables to obtain factor scores.
KMO (Kaiser-Meyer-Olkin) definition (Malhotra, 2010)
an index used to examine the appropriateness of FA. Value < 0.5 imply that FA may not be appropriate.
Residuals definition (Malhotra, 2010)
the differences between the observed correlations, as given in the input correlation matrix, and the reproduced correlations, as estimated from the factor matrix.
Scree plot definition (Malhotra, 2010; Field, 2013)
- A plot of eigenvalues against the number of factors in order of extraction. (X-axis: factor; Y-axis: eigenvalue)
- The point where curve first begins to straightens out = maximum number of factors.
The procedure for conducting factor analysis (Malhotra, 2010)
1) Formulate the problem
2) Construct the correlation matrix
3) Determine the method of factor analysis
4) Determine the number of factors
5) Rotate the factors
6) Interpret the factors
7) Calculate factor scores or select surrogate variables
8) Determine the fit of the FA model
Step 1: Formulate the problem
Identify the objectives of FA, as well as specifying the appropriate sample size.
Assumptions of the factor analysis (Malhotra, 2010)
- Metric measures variables.
- Sample size > 50 cases, each case > 5 per variable
- Sufficient correlations: > 0.30
Step 2: Construct the correlation matrix
To test the correlation:
- Test Ho with Bartlett’s test of sphericity: identity matrix, sig < 0.05
- Test with KMO: value < 0.5 => FA may not be appropriate
Two methods to derive the factor score coefficients:
Principal components analysis and Common factor analysis.
Principal components analysis definition (Malhotra, 2010)
- Considers the total variance in the data.
- Primary concern is to determine the minimum number of factors that will account for maximum variance in the data for use in subsequent multivariate analysis.