FA; Lec 3 & 4; Lab 2 & 3 Flashcards
Give an example of CFA.
Given a set of data you could determine which factor theory of personality best represents the data
In the FA output, what should you be able to tell from the second column in the table ‘Total Variance Explained’?
How much of the variance is explained by the factors with an Eigenvalue above 1
What is a problem with the Kaiser Guttman criterion?
It is sensitive to the number of items. Therefore, an increase in items = increase in eigenvalue.
The Kaiser Guttmann needs to fulfill one of two criteria to be valid - what are they?
- Either there must be <30 variables and ALL communalities >.7
OR
- The sample size must be greater than >250 and AVERAGE communality must be >.6
How does the Kaiser Guttman criterion work?
Generated factors with eigenvalues above 1 are removed as real factors.
One purpose of FA is to show how many distinct common factors are measured by a set of test items - give an example of this.
Are the supposed different constructs: neuroticism, anxiety, hysteria, ego strength, self-actualisation, and locus of control, 6 independent entities or would they be better described as only 2 factors? (Elements of pathology: neuroticism, anxiety and hysteria; Healthy mechanisms: ego-strength self-actualisation and locus of control)
Rotation has no impact on the overall variance explained - why do we do it?
Because we are searching for a simple structure and it helps us with this; it moves loadings around and cleans up the output. This aids our interpretation of the latent constructs.
What is the most common orthogonal rotation?
Varimax
One purpose of FA is to determine whether tests that purportedly measure the same thing in fact do so - give an example of this.
3 tests that claim to measure anxiety - FA may produce more than one factor indicating something in addition to anxiety is being measured.
How do you conduct PCA (a type of EFA) with a correlation matrix and Varimax rotation in SPSS?
Analyse –> Dimension Reduction –> Factor –> Move all variables to ‘Variable’ box –> Extraction –> Scree plot –> Deselct ‘Unrotated Factor Solution –> Continue –> Descriptives –> KMO and Bartlett’s test of sphericity –> coefficients (for correlation matrix) –> Rotation –> Varimax –> Continue
How do you interpret factors?
You use the factor loadings - anything >.3/>.32
Why does the button ‘Eigenvalues over’ automatically become deselected when you indicated how many factors you want to be selected from the ‘Extraction’ dialogue box - why?
Because you aren’t using Eigenvalues anymore, you are forcing the result into a specific number of factors.
What is an identity matrix?
When the R-matrix has no correlations/all correlations are 0
If it is debatable whether, for example, a 2 or 3 factor solution makes more sense, what should you do?
Report the results of both interpretations and then follow one based on theoretical disposition. Since all the results will be reproduced enough information is available for someone with a different theoretical disposition to interpret the data in an alternative fashion.
In the FA output, what does the table labelled ‘communalities’ tell us?
The first reads ‘Initial’ and indicates from a theoretical position that the communality of any item is potentially one.
The second, labelled ‘Extraction’ gives a different value for the communalities of the items after extraction has taken place.
For data to be suitable for FA, should Bartlett’s be significant or not?
Significant
What are the assumptions re variance of principal components analysis (PCA)?
- All variance explained by the factors
In these two questions:
1. What is the capital of Spain?
2. What is the capital of Italy?
What is the common factor
Geographical knowledge
How do you report Bartlett’s
Χ2(df) chi sq value, p><0.05
Are loading factors with PAF going to seem less or more impressive than PCA?
PAF = Less impressive loading factors, because it allows for specific variance
What must KMO value be for data to be suitable for FA?
Above 0.5
What are the 4 parametric assumptions?
- Must be continuous
- Variables much be normally distributed and outliers must have been appropriately dealt with
- Relationship between all variables appear to be linear, or at least not U-shaped or J shaped
- All variables must be independent
How do you estimate communality for PCA and PAF?
- PCA - it is assumed to be 100% and therefore there is no estimation required
- With PAF there is no agreed way to do this
Whilst there are 7 steps to conducting PCA, this can be simplified to 3, what are they?
- Determine the suitability of the data
- Factor extraction
- Rotation and interpretation
One purpose of FA is to check the psychometric properties of a questionnaire - give an example of this.
Would a different population made of Chinese identify the constructs of extraversion-introversion and neuroticism which have been found in European cultures?
Note: this would need to be done through confirmatory factor analysis
When conducting FA you should look at the R-matrix for two potential problems, what are they?
- Correlations are too low - variables with lots of correlations .9 for two variables or >.6 among many variables also possibly a problem)
specific variance
variance that cannot be explained by the factors - fluke knowledge (e.g. knowing the capital of Spain because you went there, but not actually having good geographical knowledge)
If you decide that the weakest factor is not worth retaining, how do you get rid of it in your SPSS output?
Go to the Factor Analysis ‘Extraction’ dialogue box and in the Box labelled ‘Number of Factors’ type in the number of factors you want to retain.
How do you read a scree plot?
Going from left to right draw the first straight line that shows the data leveling off (elbow).
No. factors above th line = number of factors to be retained
In the FA output, what does the Rotated Component Matrix tell us?
It shows which items load heavily onto which factors based on the rotated solution