Deriving a Scale (Principle Component Analysis) Flashcards
Define:
Principle Components Analysis
A tool that helps identify which questions form a cohesive group versus those that give unique information.
(i.e. a tool to help organise different items measuring the same construct into one scale).
This is done by plotting them in a 3D space with the axes representing certain ‘dimensions’ (i.e. the ‘Principle Components’).
The tool ‘rotates’ the data-set until the ‘best summary’ is found, and then groups the items into these overall ‘Principle Components’.
List the FOUR main steps in developing a scale using PCA:
(PCA = Principle Components Analysis)
- Create Questions/Items.
- Apply Principle Components Analysis.
- Check Assumptions.
- Interpret & Identify Components.
Give THREE tips for developing questions to be analysed using PCA:
(PCA = Principle Components Analysis)
- Generate a large number of (high-quality) questions.
- Create questions that are appropriate for the aim, sample, & operationalisation(s).
- Balance (reverse-)wording and valence when constructing questions.
What is the name of the equivalent/similar tool to PCA that is specific to the field of Psychology?
(PCA = Principle Components Analysis)
‘Exploratory Factor Analysis’
Note: Although EFA is NOT the exact same as PCA, both are ‘variable reduction techniques’.
What are the TWO main forms of ‘cheating’ when using PCA?
Note: ‘Cheating’ may be unintentional too!
Inputting…
- many unrelated items.
- related pairs of items that don’t connect to any others.
- The PCA software will attempt to group regardless of ‘logical relatedness’ and will give results simply based on what was put in (“garbage in, garbage out”).
- This is essentially putting in your own ‘predetermined groups’ in order to verify your own thoughts/beliefs.
List:
The TWO key assumptions when applying PCA to your dataset, and the measures/tests for each to ensure they have been met.
Extra Knowledge Test: Define/describe each assumption and its test/measure.
- Sphericity + Bartlett Test
- Sampling Adequacy + Kaiser-Meyer-Olkin (KMO) Measure
- Sphericity concerns whether there is equal enough variance (in your items/between groups etc.) to form multiple/different scales.
A significant value (p-value <.05) needs to be obtained in a Bartlett Test to indicate the above assumption has been met.
- Sampling Adequacy and the KMO Measure relate to whether the items have unique variance (i.e. are not just pairs of concepts).
The value of the KMO Measure indicates the proportion of the variance in your variables/items that may be caused by underlying factors - the ‘cut-off’ value for this is >.5 , and the closer to 1.0 the better.
What TWO strategies are used for interpretting components?
(In order to determine potential scales)
- Analysing statistical indicators in a Scree plot.
- Applying our theoretical knowledge to the components.
When applying theoretical knowledge, it is important to asses whether or not the grouping outcomes appear to retain construct and internal validity, as well as whether or not these groupings differ to the ones you theorised.
Fill-in-the-Blanks:
A Scree plot helps determine the ____ of ____ that are statistically significant and best summarise the items inputted with PCA.
A Scree plot helps determine the number of components that are statistically significant and best summarise the items inputted with PCA.
Fill-in-the-Blanks:
In a Scree plot…
The ____ essentially represents the amount of variance/variation in the data that can be explained by each principle component.
This value is represented on the ____-axis.
- eigenvalue
- y
The eigenvalue of the first few PCs (i.e. PC1, PC2, PC3, etc.) is usually highest, and thus we tend to only focus on the ones before the plateu.
Fill-in-the-Blanks:
To determine the minimum number of components that best summarise the items, look for the point ____ the line drops to its plateu, and the final point ____ the simulations line.
- before
- above
In the image example, both of these indicators suggest the first two principle components best summarise the items inputted to our PCA.
How do you interpret item loadings and what do they tell you about the relationship between items and components?
Item loadings tell us how much of an item’s variation is explained by a particular component.
Therefore, the higher the item loading value for a certain item + component, the stronger the correlation between them.
List:
The SIX steps for reporting a PCA
- Examine the table of loadings.
- If there are cross-loaded or non-loaded items, consider restarting.
- Convert loadings table into APA format with item labels.
- Identify common themes within components.
- Average items together to form a scale.
- Report the descriptive statistics/reliability of the newly formed scale.