Factor Analysis Flashcards

1
Q

What is the difference between Principle component analysis (PCA) and factor analysis

A

PCA as data compression asks about using the data to find a succinct way of describing the data.
In factor analysis, conceptually we ask what (small number of) processes could give rise to the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In factor analysis, how do DVs relate to a factor

A

Use the correlation btw the scores of a DV and the factor scores, known as the loading on the factor
In the ‘component matrix’ of the PASW output

The larger the correlation of a factor to a DV, the closer the factor is related to the DV (or the closer the DV is related to the factor)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to perform factor analysis

A

look at scree plot (plot eigenvalue against component)–>count the number of eigenvalues (e.g., 2) that are larger then one–> look to see if there 2 factors–> rotate the components so that the loading were as different from each other as possible (e.g., varimax rotation, rotate so that for a given factor, items load either high or low)–>pick the higher loading for each item, label the factors, add the scores from the DVs linked to the latent variable and ignore DVs that do not contribute much, and then try to find the relationship between latent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the prons and cons of PCA

A

Pros: Is a good way of describing data with small number of variables, operates in a principled way: maximise the variability explained, make sure the components are orthogonal,
know how much of the information in the data set is captured by a ‘solution’

Cons
not so easy to know the best ‘solution’
How many components to use is a trade off

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are the auumptions of PCA

A

multivariate normality

linear (streaight line) relationship btw the measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how to interpret PCA

A

First, how much of the variability of the original data have we captured
Overall: the sum of the eigenvalues included in the ‘solution’
Want to know about the individual DVs
how much of the information (variance) of a particular DV is being used in the components in the ‘solution’
Called the ‘communality’
If the calue is low the variable might not be that useful

How well did you know this?
1
Not at all
2
3
4
5
Perfectly