FA; Lec 3 & 4; Lab 2 & 3 Flashcards

1
Q

Give an example of CFA.

A

Given a set of data you could determine which factor theory of personality best represents the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In the FA output, what should you be able to tell from the second column in the table ‘Total Variance Explained’?

A

How much of the variance is explained by the factors with an Eigenvalue above 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a problem with the Kaiser Guttman criterion?

A

It is sensitive to the number of items. Therefore, an increase in items = increase in eigenvalue.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The Kaiser Guttmann needs to fulfill one of two criteria to be valid - what are they?

A
  1. Either there must be <30 variables and ALL communalities >.7

OR

  1. The sample size must be greater than >250 and AVERAGE communality must be >.6
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How does the Kaiser Guttman criterion work?

A

Generated factors with eigenvalues above 1 are removed as real factors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

One purpose of FA is to show how many distinct common factors are measured by a set of test items - give an example of this.

A

Are the supposed different constructs: neuroticism, anxiety, hysteria, ego strength, self-actualisation, and locus of control, 6 independent entities or would they be better described as only 2 factors? (Elements of pathology: neuroticism, anxiety and hysteria; Healthy mechanisms: ego-strength self-actualisation and locus of control)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Rotation has no impact on the overall variance explained - why do we do it?

A

Because we are searching for a simple structure and it helps us with this; it moves loadings around and cleans up the output. This aids our interpretation of the latent constructs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the most common orthogonal rotation?

A

Varimax

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

One purpose of FA is to determine whether tests that purportedly measure the same thing in fact do so - give an example of this.

A

3 tests that claim to measure anxiety - FA may produce more than one factor indicating something in addition to anxiety is being measured.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you conduct PCA (a type of EFA) with a correlation matrix and Varimax rotation in SPSS?

A

Analyse –> Dimension Reduction –> Factor –> Move all variables to ‘Variable’ box –> Extraction –> Scree plot –> Deselct ‘Unrotated Factor Solution –> Continue –> Descriptives –> KMO and Bartlett’s test of sphericity –> coefficients (for correlation matrix) –> Rotation –> Varimax –> Continue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you interpret factors?

A

You use the factor loadings - anything >.3/>.32

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why does the button ‘Eigenvalues over’ automatically become deselected when you indicated how many factors you want to be selected from the ‘Extraction’ dialogue box - why?

A

Because you aren’t using Eigenvalues anymore, you are forcing the result into a specific number of factors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an identity matrix?

A

When the R-matrix has no correlations/all correlations are 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If it is debatable whether, for example, a 2 or 3 factor solution makes more sense, what should you do?

A

Report the results of both interpretations and then follow one based on theoretical disposition. Since all the results will be reproduced enough information is available for someone with a different theoretical disposition to interpret the data in an alternative fashion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In the FA output, what does the table labelled ‘communalities’ tell us?

A

The first reads ‘Initial’ and indicates from a theoretical position that the communality of any item is potentially one.

The second, labelled ‘Extraction’ gives a different value for the communalities of the items after extraction has taken place.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

For data to be suitable for FA, should Bartlett’s be significant or not?

A

Significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the assumptions re variance of principal components analysis (PCA)?

A
  • All variance explained by the factors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

In these two questions:
1. What is the capital of Spain?
2. What is the capital of Italy?
What is the common factor

A

Geographical knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How do you report Bartlett’s

A

Χ2(df) chi sq value, p><0.05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Are loading factors with PAF going to seem less or more impressive than PCA?

A

PAF = Less impressive loading factors, because it allows for specific variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What must KMO value be for data to be suitable for FA?

A

Above 0.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are the 4 parametric assumptions?

A
  1. Must be continuous
  2. Variables much be normally distributed and outliers must have been appropriately dealt with
  3. Relationship between all variables appear to be linear, or at least not U-shaped or J shaped
  4. All variables must be independent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How do you estimate communality for PCA and PAF?

A
  1. PCA - it is assumed to be 100% and therefore there is no estimation required
  2. With PAF there is no agreed way to do this
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Whilst there are 7 steps to conducting PCA, this can be simplified to 3, what are they?

A
  1. Determine the suitability of the data
  2. Factor extraction
  3. Rotation and interpretation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

One purpose of FA is to check the psychometric properties of a questionnaire - give an example of this.

A

Would a different population made of Chinese identify the constructs of extraversion-introversion and neuroticism which have been found in European cultures?

Note: this would need to be done through confirmatory factor analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

When conducting FA you should look at the R-matrix for two potential problems, what are they?

A
  1. Correlations are too low - variables with lots of correlations .9 for two variables or >.6 among many variables also possibly a problem)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

specific variance

A

variance that cannot be explained by the factors - fluke knowledge (e.g. knowing the capital of Spain because you went there, but not actually having good geographical knowledge)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

If you decide that the weakest factor is not worth retaining, how do you get rid of it in your SPSS output?

A

Go to the Factor Analysis ‘Extraction’ dialogue box and in the Box labelled ‘Number of Factors’ type in the number of factors you want to retain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

How do you read a scree plot?

A

Going from left to right draw the first straight line that shows the data leveling off (elbow).

No. factors above th line = number of factors to be retained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

In the FA output, what does the Rotated Component Matrix tell us?

A

It shows which items load heavily onto which factors based on the rotated solution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Aside from parametric assumptions, how do you know if data are suitable for EFA?

A
  1. There must be at least some correlations in the matrix that are above .3
  2. There must be at least 100 participants and more participants than items; although partly a function of the ‘strength’ of the data
32
Q

What is an eigenvalue?

Therefore, what does a scree plot indicate?

A

An indication of the amount of variance explained by any one factor.

The scree plot graphically indicates how much variance is explained by the removal of successive possible factors.

33
Q

What is the simplest form of structural equation modelling?

A

CFA

34
Q

What is the most common oblique rotation?

A

Direct Oblimin

35
Q

What does factor rotation do?

A

It changes the position of the factors to ease interpretation. Each factor should have some large loadings and some small ones (simple structure). Large numbers of mediocre loadings should be avoided.

36
Q

FA will produce a correlation matrix. However, if we want to run this in SPSS beforehand how would we do it?

A

Analyse –> Correlate –> Bivariate –> Move the variables we are interested in correlating to the ‘Variables’ box –> OK

37
Q

What is the difference between exploratory factor analysis and confirmatory factor analysis?

A

EFA seeks to determine the number and nature of factors which underpin a set of data; while CFA allows you to choose between alternative hypotheses which purport to represent your data.

38
Q

What are the 4 basic purposes of FA?

A
  1. To show how many distinct common factors are measured by a set of test items
  2. Shows which items relate to which common factors
  3. Determines whether tests that purportedly measure the same thing in fact do so
  4. Checks the psychometric properties of questionnaire - with a different sample do the same factors materialise (CFA only)
39
Q

In the FA output, what should you be able to tell from the third column (Rotation Sums of Squared loadings) in the table ‘Total Variance Explained’?

A

This column identifies the percentage of variance which each of the rotated factors now explains.

40
Q

A person’s score on a factor can be calculated from what?

A

Their responses to items that load onto that factor. Items with greater loadings have higher weighting. You can then use factor scores for subsequent tests.

41
Q

If you expect your factors to be uncorrelated which kind of rotation should you use?

A

orthogonal - look at the rotated component matrix

42
Q

What do we usually use the Kaiser criterion to identify factors, and not the Joliffe criterion?

A

Joliffe retains too many factors

43
Q

In the FA output where do you look for the rotated solution?

A

‘Rotated component matrix’

44
Q

What is a scree test?

A

It is based on eigenvalues of an unrotated solution

45
Q

What is a simple structure?

A

When a factor only have substatial loadings on a few item

46
Q

How does Bartlett’s test of sphericity tell you if the R-matrix is an identity matrix?

A

If it is all correlations will be zero.

A result with significance

47
Q

What is the difference between principal axis factoring and principal components analysis?

A

Principal components analysis (PCA) does not discriminate between common and specific variance, while principal axis factoring (PAF) does.

48
Q

SPSS sometimes produces strange numbers, if it produces a number with an E and a minus number after it e.g. 1.24E-02, what should you do?

A

Move the decimal place to the left the stipulated number of places. In this instance: 0.0124

49
Q

Large correlations between factors should be considered as what?

A

suspect

50
Q

Is there much difference between PCA and PAF?

A

They seem to produce very similar results; so much so that some researchers do not identify which one they are carrying out (although not good practice).

However, since PAF allows for specific variance then an item’s communality is necessarily going to be less than one; therefore, loading factors for items are going to appear less impressive with PAF as opposed to PCA.

51
Q

SPSS sometimes produces strange numbers, if it produces a number with an E and a positive number after it e.g. 1.24E02, what should you do?

A

Move the decimal place to the right the stipulated number of places. In this instance: 124.00

52
Q

Once you have your FA output, what 6 things should you look for to know that the data is suitable for FA?

A
  1. Correlation matrix - some of the variables must be above .3
  2. Bartlett’s test - if its significant this means it is different from data selected at random
  3. KMO - The least you should accept is .5 (.9 is superb)
  4. Communalities - you want some above .4
  5. The relationship between variables should be largely linear. But FA is robust so as long as it isn’t J or U shaped you are okay

6 scree plot - interpret left to right as elbow to get initial idea of no. of factors

53
Q

What are two formal tests to assess data suitability for FA?

A
  1. Kaiser-Meyer-Olkin (KMO) is a measure of sampling adequacy and addresses if the sample is big enough
  2. Bartlett’s test of sphericity - measures if the R-matrix is an identity matrix
54
Q

How would you clean up your component matrix if there were too many small values and it is hard to read?

A

Go to the Factor Analysis ‘Options’ and ‘suppress small coefficients’

This will get rid of values below .3

55
Q

Correlations detected from Bartlett’s are unlikely to be…?

A

The product of chance

56
Q

In the FA output, what does the Component Matrix tell us?

A

It shows intial correlations between the items and the proposed factors.

NOTE: Usually we don’t use this

57
Q

When you report factor analysis what 5 things should you include?

A
  1. Table of factor loadings
  2. Extraction
  3. Rotation methods
  4. Rationale for the number of factors
  5. Bartlett’s and KMO
58
Q

What are the two questions that need to be asked in determining the suitability of a set of data for FA?

A
  1. Is the sample big enough?
  2. Are there at least some correlations between items in the sample (above .3 in magnitude) that would suggest that there might be underlying factors that could summarise some of the items used?
59
Q
  1. What are the two types of exploratory factor analysis?

2. How are they different from one another?

A
  1. Principal components analysis and Principal axis factoring
  2. They make different assumptions regarding unexplained variance
60
Q

When reporting FA what eight details should you include?

A
  1. packages used (e.g. SPSS)
  2. Presence of correlations with values .3 and above
  3. KMO
  4. Bartlett’s
  5. Inspection of scree plot (sometimes)
  6. Number of factors with Eigenvalues exceeding 1 and how much variance (%) each one explains
  7. Justification for removal of any factors and how much variance is explained by each of remaining factors
  8. What kind of rotation was performed and whether this yielded a simple structure
61
Q

When factors are uncorrelated and commonalities are moderate, what does PCA produce?

A

Inflated values of variance accounted for by the components

62
Q

In the FA output, the table Total Variance Explained has three columns: ‘Initial Eigenvalues’, ‘Extraction Sums of Squared Loadings’ and ‘Rotation Sums of squared loadings’, what is the difference between these two?

A

The second column ignores potential factors that have an eigenvalue below 1

63
Q

According to PCA, what is ‘total variance’ equal to?

A

Total variance = common factor variance + measurement error

64
Q

What does variables being independent mean?

A

That they cannot be calculated from other variables - e.g. if item A was height and B was weight, then it would be inappropriate for C to be a height to weight ration since it would necessarily be correlated to both A and B

65
Q

What does varimax do?

A

Tries to equalise variance across all the factors

66
Q

What are 4 possible errors in factor analysis?

A
  1. Interpreting the unrotated solution (SPSS spits this out by default)
  2. Applying rigid rules to the extraction of factors (KG vs scree method)
  3. Replication is very important
  4. Factor validity is not attested to only by item content (face validity), it must also be compared with some other measure
67
Q

What you rotate a solution what changes? What doesn’t?

A

The communality of each variable remains the same.

The eigenvalues of factors do not

68
Q

What is regression and what is it used for?

A

It is a weighted sum of the factor loadings, used to calculate the factor scores.

69
Q

What is a varimax rotation?

A

An orthogonal rotation - tries to make factors independent by turning them to 90 degrees

70
Q

If you expect your factors to be correlated which kind of rotation should you use?

A

Oblique - look at the pattern matrix

71
Q

Why does the sample size have to be big enough?

A

Because FA is easily influenced by error variance

72
Q

Which is the simplest form of FA and suitable for undergraduate level?

A

Principal components analysis (PCA)

73
Q

According to principal components analysis (PCA), all items have a communality of what?

A

1

Therefore the factors will, between them, account for 100% of the variation among the items.

74
Q

KMO is a measure of sampling adequacy and addresses if the sample is big enough. What are its ranges?

A
>.5 min
.5-.7 mediocre
.7-.8 good
.8-.9 great
.9< superb
75
Q

What are the 7 stages of carrying out exploratory factor analysis?

A
  1. Ensure that data are suitable
  2. Decide on the model - PAF or PCA
  3. Decide how many factors are required to represent the data
  4. When using PAF estimate the communality of each factor
  5. Factor extraction
  6. Rotate the factors ensuring that simple structure has been reached
  7. Compute factor scores
76
Q

According to principal axis factoring (PAF), what is ‘total variance’ equal to?

A

Total variance = common factor variance + specific item variance + measurement error

77
Q

Is the scree plot alone enough to identify latent constructs?

A

No. FA should always be conducted in light of psychological theory - how many factors does it make sense to extract from a psychological point of view.