SPSS video lectures Flashcards

1
Q

What is skewness?

A

tail leans to left or right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How can you get an indication of the normality (normal distribution) on the data depending on what?

A

By looking at the mean and the median. The closer they are, the more likely it is that its normally distributed. its just a relative question.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is kurtosis?

A

data is peaked or flat. we want a bell-shaped curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the rule of thumb for skewness and kurtosis?

A

an absolute value of one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If you have a test of normality and you have a sample size above 50, what do you use?

A

the Kolmogorov-Smirnov test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If you have a test of normality and you have a sample size below 50, what do you use?

A

the Shaoiro-Wilk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When having a test of normality, what is the rule of thumb for the significance?

A

It needs to be above 0.05. Because a value under 0.05 indicates non-normality. (abnormal distribution).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can you interpret a box plot?

A

The more symmetric the boxplot, the more normally distributed the data is.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Correlation varies from what interval?

A

-1 (perfectly negatively correlated) to +1 (perfectly correlated

0 is no correlation

if its below 0.3 don’t consider it
0.3 -> 0.6 medium sized correlation
above 0.6 strong correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does it mean when we say that the correlation coefficient is significantly different from zero?

A

the correlation coefficient is significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is independent samples t-test? Give an example

A

Independent samples t-test is when we have to group means (like men vs women average income) that we want to compare to see if the means are significantly different from each other, the same or just random chance. in this case the two groups are not related to each other.

not to be compared wit repeated measure t-test: womens income at time 1 and then time 2. two dependent group (same group at different times)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When we have dummy variables, always do what when coding?

A

code them as 0 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Levenes test for equality of variances?

A

this is one of the assumptions of the independent samples t-test. the two groups should have equal variances. if we have quite different group sizes, there is a chance the variance will not be equal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the standard cutoff for significance?

A

0.05

> 0.05 = equal variances

< 0.05 = would be significant difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the rule of thumb for t-tests?

A

if the t value < 2, then its prob not significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the rule of thumbs for Cohens D point estimate?

A

anything around 0.2 would be a small effect
< 0.2, dont interpret it
0.5 medium effect
0.8 large effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is paired samples/repeated measures t-tests?

A

you look at the same period but for different time periods. they are dependent on each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What does the significance level in a paired sampled t-test tell us?

A

if its < 0.05 we can say that there is a significant difference between the means
if its larger than > 0.05 the difference are by chance, there is no significant difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is Exploratory factor analysis?

A

It is a way of establishing construct validity which is made up of two parts: convergent validity and discriminant validity.

20
Q

What is a latent construct? (EFA)

A

Something that is not directly observable, like collaboration or trust so we observe through ex a questionnaire. Everything in like colllaboration should converge. But other factors like trust should diverge from each other. (discriminant). because otherwise trust and collaboration is overlapping.

21
Q

Is EFA subjective or objective?

A

subjective

22
Q

A factor coefficient can be anywhere from… what range?

A

-1 to +1

0.4-0.5 (absolute value) is meaningful

being close 0 is meaningless

23
Q

How can you know if the factor ability of the data is good?

A

Look at the correlation matrix. But we also have the KMO and Bartletts test

24
Q

What is Kaiser-Meyer-Olkin measure (KMO) of sampling adequacy?

A

The Kaiser-Meyer-Olkin (KMO) measure is a statistic that helps you decide if your data is suitable for factor analysis, which is a method used to identify patterns or groupings in data.

Key Points in Simple Terms:
1. Purpose: It checks if the variables in your dataset are related enough to each other to form groups (factors).

  1. Scale:
    • The KMO value ranges from 0 to 1.
    • Higher values (closer to 1) mean that your data is good for factor analysis.
    • Lower values (closer to 0) mean that your data is not suitable for factor analysis because the variables don’t relate well to each other.
  2. Rule of Thumb:
    • KMO ≥ 0.8: Excellent.
    • 0.7 ≤ KMO < 0.8: Good.
    • 0.6 ≤ KMO < 0.7: Acceptable.
    • KMO < 0.6: Poor, factor analysis might not work well.
  3. How It Works: The KMO compares:
    • Correlations: How strongly variables are related.
    • Partial Correlations: How much the relationship between two variables is explained by other variables.
      If correlations are much stronger than partial correlations, the KMO will be high, meaning factor analysis is appropriate.

Example:
If you have a survey with several questions, the KMO helps you know if the questions are related enough to summarize them into fewer factors (like grouping questions about “job satisfaction” together).

In summary, the KMO is like a “test of fit” to see if your data can be simplified and grouped through factor analysis!

the absolute lowest value for KMO is 0.5

0.9 is close to 1 so its Marvelous. you prefer a higher KMO

25
Q

What is the Anti-image matrix?

A

lets say that our KMO is 0.5 or 0.4. its low and we are not happy with it. what we do is look at the anti-image correlation and we look at the diagonal. what could happen is that lets its a bad indicator; that would have great effect on the KMO. so if we remove that from the analysis our KMO would go up. so the anti-image corr matrix shows our each contributor to the KMO. so we can identify one indicator or two that we can remove.

26
Q

What are communalities?

A
27
Q

What is the rule of thumb for communalities?

A

We want them to be above 0.5. Ex, we have 4 that is 0.4, the other are well above 0.5. EFA is subjective and we dont go through it using absoulte, we go through it collecting evidence. So our KMO was fine but here we get an indication that some factors perhaps dont share enough common variance to remain in the model. But we would not remove them. We just know some of them would be problematic.
the lower the communality, the more problematic.

28
Q

What about the total variance explained table in EFA?

A

Here we want to look at the cumulative variance explained for the solution. And how many factors were extracted? Did we want 3? Ok, is it reasonable given how close to 1 the Eigenvalue is? You can force a 3 factor solution

29
Q

What can you do to make a Rotated Factor Analysis easier to read?

A

Suppress small values (below 0.4-0.5).

30
Q

When you have a 3 factor solution, how can you know in percentage how much of the variance is explained in the factor model?

A

the cumulative % under initial eigenvalues

31
Q

When do we know that we have discriminant validity?

A

When each factor indicator loads above 0.5 on one factor and below 0.5 on the other factors

32
Q

We have convergent validity in a rotated factor matrix when…

A

each indicator is loaded on the proper factor (ensamma på sina ställen)

33
Q

We have discriminant validity in a rotated factor matrix when…

A

they are not loading on the wrong places.

34
Q

What is another word for Reliability analysis?

A

Cronbachs alpha

35
Q

What is the generally accepted cutoff value for Chronbachs alpha?

A

above 0.7 gives a reliable construct

36
Q

What is another term for multiple regression model?

A

ordinary least squares regression

37
Q

What is another word for R squared?

A

explained variance

38
Q

What is another word for coefficient of determination? (explained variance)

A

R squared

39
Q

When do you use R-squared?

A

When you are evaluating a single model

40
Q

When do you use R-squared adjusted?

A

When you comparing different models

41
Q

What happens to R-squared when you keep adding more independent variables?

A

R-squared go up

42
Q

What does parsimony mean?

A

a simple regression model is referred over a complicated regression model. 2 or 3 independent variables is better than 10.

43
Q

What is another name for the significance?

A

P value

44
Q

What is the classical cutoff value for significance?

A

below 0.05. it means that the statistics is significant. = r squared is significantly different from zero = good regression model. above 0.05 = the factor does not say anything

45
Q

What can we say about r-squared when the significance is below 0.05?

A

r squared is significantly different from zero. its a good regression model